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empirical analysis guided by a command of theory. His work and 
professional style have set the standard of empirical economics, espe- 
cially in the field of labor economics, where he has made major 
contributions to the understanding of the determinants of earnings and 
the labor force participation of married women. 

His book Schooling, Experience and Earnings is a classic that has 
enormously influenced the quantitative study of earnings in countries 
throughout the world. He based his analysis on the theory of invest- 
ment in human capital, a framework that continues to motivate most 
empirical earnings studies. He showed that a few variables— especially 
education and measures of labor force experience— explain a signifi- 
cant fraction of differences in earnings among males. 

He has subsequently extended this analysis to consider the earnings 
of women, job separations, differences between the effects on earnings 
of job tenure and general employment experience, and other variables. 

Mincer was also among the first to examine the labor force participa- 
tion of married women in a model that highlights the relation between 
the decisions of husbands and wives, and the effect of family size on 
labor force participation. His approach continues to be followed and 
extended as social scientists study further the remarkable growth in 
participation of married women in all developed and many developing 
countries. 
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Lifetime Earnings and the Vietnam Era Draft Lottery: 
Evidence from Social Security 
Administrative Records 


By Josua D. ANGRIST* 


The randomly assigned risk of induction generated by the draft lottery is used to 
construct estimates of the effect of veteran status on civilian earnings. These 
estimates are not biased by the fact that certain types of men are more likely than 
others to service in the military. Social Security administrative records indicate 
that in the early 1980s, long after their service in Vietnam was ended, the 
earnings of white veterans were approximately 15 percent less than the earnings 


of comparable nonveterans. (JEL 824) 


A central question in the debate over mili- 
tary manpower policy is whether veterans 
are adequately compensated for their service. 
The political process clearly reflects the de- 
sire to compensate veterans: since World 
War II, millions of veterans have enjoyed 
benefits for medical care, education and 
training, housing, insurance, and job place- 
ment. Recent legislation provides additional 
benefits for veterans of the Vietnam era. Yet, 
academic research has not shown conclu- 

` sively that Vietnam (or other) veterans are 
worse off economically than nonveterans. 
Many studies find that Vietnam veterans earn 
less than nonveterans, but others find posi- 
tive effects, or effects that vary with age and 


*Department of Economics, Harvard University, 
Cambridge, MA 02138. Grateful thanks go to Warren 
Buckler, Cresston Smith, Ada Enis, and Bea Matsui for 
their assistance in producing the Social Security data; to 
Chester Bowie for his help in producing the SIPP data; 
and to Mike Dove for providing DMDC administrative 
records. Special thanks also go to David Card and 
Whitney Newey, from whose instruction and comments 
I have benefited greatly, and to Alan Krueger and an 
anonymous referee, whose careful reviews of an earlier 
draft led to substantial improvement. Data collection 
for this project was funded by the Princeton Industrial 
Relations Section. Funds for computation and financial 
support of the author were provided by the Industrial 
Relations Section, the Princeton Department of Eco- 
nomics, the Sloan Foundation, and the Olin Founda- 
tion. 


313 


schooling. Regarding the general position of 
veterans, a member of the Twentieth Cen- 
tury Fund's Task Force on Policies Toward 
Veterans concludes that “Within any age 
group, veterans have higher incomes, more 
education, and lower unemployment rates 
than their nonveteran counterparts."! 

The goal of this paper is to measure the 
long-term labor market consequences of mil- 
itary service during the Vietnam era. Previ- 
ous research comparing civilian earnings by 
veteran status may be biased by the fact that 
certain types of men are more likely to serve 
in the armed forces than others. For exam- 
ple, men with relatively few civilian opportu- 
nities are probably more likely to enlist. Es- 
timation strategies that do not control for 
differences in civilian earnings potential will 
incorrectly attribute lower civilian earnings 
of veterans to military service. The research 
reported here overcomes such statistical 
problems by using the Vietnam era draft 


The quote is from Michael Taussig (1974, p. 51). 
Legislation pertaining to veterans benefits is outlined in 
Veterans Administration (1984) and in other annual. 
reports of the Veterans Administration. Studies by 
Sherwin Rosen and Paul Taubman (1982), Saul Schwartz 
(1986), and Jon Crane and David Wise (1987) find that 
Vietnam veterans earn less than nonveterans. Dennis 
DeTray (1982) and Mark Berger and Barry Hirsch 
(1983) find some positive effects for different age and 
Schooling classes, and Veterans Administration (19812) 
researchers find an overall positive effect. 
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lotteries to set up a natural experiment that 
randomly influenced who served in the mili- 
2 


Section I describes the Social Security ad- 
ministrative records used in the empirical 
work and provides background on the draft 
lotteries. In each lottery, priority for induc- 
tion was determined by a Random Sequence 


Number (RSN) from 1-365 that was as- . 


signed to birthdates in the cohort being 
drafted. Men were called for induction by 
RSN up to a ceiling determined by the De- 
fense Department, and only men with lottery 
numbers below the ceiling could have been 
drafted. Therefore, men with lottery num- 
bers below the ceiling are referred to here as 
“draft-eligible.” 

The empirical analysis begins in Section II 
with estimates of the effect of draft eligibility 
on earnings. If draft eligibility is correlated 
with veteran status but uncorrelated with 
other variables related to earnings, then 
earnings differences by draft-eligibility status 
can be attributed to military service. In Sec- 
tion III, information on the proportions of 
draft-eligible and draft-ineligible men who 
actually served in the military is used to 
convert estimates of the effect of draft eligi- 
bility into estimates of the effect of military 
service. The assumptions underlying this 
procedure are those that justify instrumental 
variables estimation; in principle, any func- 
tion of the RSN provides a legitimate instru- 
ment for veteran status. In the second part 
of Section III, an instrumental variables esti- 
mation strategy is developed which is more 
: efficient than one based solely on draft-eligi- 
bility status. Results in Section III indicate 
that white veterans earn approximately 15 
percent less than nonveterans as much as ten 
years after their discharge from the military. 


?A candid assessment of the problems caused by 
nonrandom selection for military service is given by 
Crane and Wise (1987), who note they were unable to 
use econometric sample selection models to generate 
robust estimates of the effects of military service on 
civilian earnings. The first researchers to use the lottery 
to solve the selection problem were Norman Hearst, 
Tom Newman, and Stephen Hulley (1986), who present 
lottery-based estimates of delayed effects of military 
service on mortality. 
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Section IV tests the hypothesis that veter- 
ans earn less than nonveterans because they 
have less civilian labor market experience. 
Results in this section suggest that the earn- 
ings loss to white veterans is equivalent to a 
loss of two years of civilian labor market 
experience. Section V reviews some of the 
potential pitfalls in estimation based on the 
draft lottery. Section VI offers conclusions 
and indicates directions for future research. 


J. Background and Data 
A. National Random Selection? 


There were five draft lotteries during the 
Vietnam War period. The 1970 lottery cov- 
ered 19- to 26-year-old men born in 1944—50, 
although most of the men drafted in 1970 
were born in 1950. Other lotteries were re- 
stricted to 19- and 20-year-olds. The 1971 
lottery covered men born in 1951, the 1972 
lottery covered men born in 1952, and so on, 
through 1975. However, no one was drafted 
after 1972, and congressional conscription 
authority expired in July 1973. 

Draft lottery RSNs were randomly as- 
signed in a televised drawing held a few 
months before men reaching draft age were 
to be called.^ Draft-eligibility ceilings—RSN : 
195 in 1970, RSN 125 in 1971, and RSN 95 
in 1972— were announced later in the year, 
once Defense Department manpower needs 
were known. As a consequence of this delay, 
many men with low numbers volunteered for 
the military to avoid being drafted and to 
improve their terms of service. (Angrist 
1989b). There was even a behavioral re- 
sponse to the lottery in enlistment rates for 
the 1953 cohort, although no one born in 
1953 was drafted. In the analysis that fol- 
lows, the “draft-eligibility ceiling" for men 
born in 1953 is set at RSN 95, the highest 
lottery number called in 1972. 


3This section draws on Curtis Tarr (1981) and the 
Selective Service System (1986). 

Men born from 1944-49 were already of draft age 
when the 1970 lottery was held on December 1, 1969. 
For nonveterans ia this group, subsequent liability for 
service was determined by 1970 lottery numbers. 
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Only the initial selection process was based 
on RSN order. Subsequent selection from 
the draft-eligible, nondeferred pool was 
based on a number of criteria. The most 
important screening criteria were the pre- 
induction physical examination and a mental 
aptitude test. In 1970, for example, half of 
all registrants failed pre-induction examina- 
'tions and 20 percent of those who passed 
were eliminated by physical inspections con- 
ducted at induction (Selective Service Sys- 
tem, 1971). Of course, the fact that armed 
forces selection criteria were ultimately not 
random does not mean that the initial prior- 
ity for induction was not randomly assigned 
by RSN. 

The year 1970 was the last time men over 
the age of 20 were drafted. In principle, 
nonveterans born between 1944 and 1949 
continued to be at risk of induction in the 
1970 lottery, but the majority of men who 
ended up serving from these cohorts had 
already entered the military by the time of 
the 1970 lottery drawing. Veterans born from 
1944—49 who managed to avoid service until 
1970 may not constitute a representative 
sample. Therefore, the analysis here is re- 
stricted to men who turned 19 in the year 
they were at risk of induction. This sample 
includes men who were born between 1950 
and 1953. 


B. Social Security Earnings Data 


Earnings data used in this study are drawn 
from the Social Security Administration's 
(SSA) Continuous Work History Sample 
(CWHS). The CWHS data set, described in 
detail in the Appendix, is a one percent 
sample drawn from all possible Social Secu- 
rity numbers. The CWHS includes two earn- 
ings series: the first contains information on 
the 1964—84 earnings of men in employment 
covered by FICA (Social Security) up to the 
Social Security taxable maximum. It also 
includes FICA taxable earnings from self- 
employment. 'The second series, beginning in 
1978, contains total compensation as re- 
ported on Internal Revenue Service Form 
W-2, excluding earnings from self-employ- 
ment. In principle, the W-2 earnings data are 
neither censored nor limited to earnings from 
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Social Security taxable employment. How- 
ever, because SSA procedures for the collec- 
tion of W-2 forms are relatively new, W-2 
earnings data are probably less reliable than 
the FICA data. 

The original CWHS data set does not 
contain information on date of birth. SSA 
programmers matched date of birth vari- 
ables to the CWHS in a special extract cre- 
ated for this project. Lottery numbers were 
then matched to dates of birth, using tables 
published in the 1969-73 Semiannual Re- 
ports of the Director of Selective Service. 

The Internal Revenue Service limits dis- 
closure of data collected for tax purposes. 
To adhere to these disclosure requirements, 
the SSA could release only aggregate data. 
The aggregate data set contains sample 
statistics for cells defined by year of earn- 
ings, year of birth, race, and five consecutive 
lottery numbers. Cell statistics include 
means, variances, fraction: with earnings 
equal to the taxable maximum, fraction with 
earnings above the taxable maximum, frac- 
tion with zero earnings, and number of ob- 
servations in each cell. 


U. The Effect of Draft Eligibility 
i on Earnings 


Figure 1 shows the history of FICA tax- 
able earnings for draft lottery participants 
born between 1950 and 1953.? For each co- 
hort there are two lines drawn: one for 
draft-eligible men, and one for men with 
lottery numbers that exempted them from 
the draft. 

The impact of draft eligibility on the earn- 
ings profiles is striking. There appears to be 
no difference in earnings until the year of 
conscription risk in the draft lottery. Subse- 
quently, the earnings of draft-eligible white 
men born in 1950-52 fall below the earnings 
of draft-ineligible white men born in 1950- 
52. 'The earnings of draft-eligible nonwhites 
also fall below the earnings of other non- 


Earnings are in 1978 dollars. The deflator used for 
all tabulations is the CPI on p. 313 of The Economic 
Report of the President (Council of Economic Advisors, 
1988). 
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: Notes: The figure plots the history of FICA taxable earnings for the four cohorts born 
1950—53. For each cohort, separate lines are drawn for draft-eligible and draft-ineligible, 
men. Plotted points show average real (1978) earnings of working, men born in 1953, 
real earnings + $3000 for men born in 1950, real éarnings + $2000 for men born in 1951, 
and real earnings + $1000 for men born.in 1952. 


FIGURE 1. SOCIAL SECURITY EARNINGS PROFILES BY DRAFT-ELIGIBILITY 
STATUS 


whites, but the gap appears to narrow and 
become positive for some nonwhite cohorts 
in later years. The fact that earnings do not 
differ by draft-eligibility status before the 
lotteries is a consequence of the random 
assignment of draft eligibility. The only thing 
that distinguishes draft-eligible men from 
draft-ineligible men is the higher conscrip- 
‘tion risk faced by eligible men after the 
lottery. 

: Figure 2 presents a magnified view of the 
effect of draft eligibility on earnings. This 
figure plots the time-series of differences in 
earnings by draft-eligibility status for each 
cohort. As in Figure 1, Figure 2 shows no 
difference during the years before the year of 
conscription risk, while in subsequent years, 
the earnings histories diverge. Figure 2 also 
shows that the loss of earnings to draft-eligi- 
ble white men was largest during the period 
they were most likely to be in the service. 


However, the earnings of draft-eligible white 
men continued to lag behind the earnings of 
draft-ineligible white men through 1984. 

The picture for nonwhites is less clear. 
The earnings of draft-eligible nonwhites born 
in 1950 and 1951 exceed those of draft- 
ineligible nonwhites in some of the later 
years. On the other hand, for nonwhites born 
in 1952, time-series variation in earnings 
differences by eligibility status is similar to 
that of whites. The general impression for 
the three older cohorts of nonwhites is that 
the earnings of draft-eligible men at least 
had caught up with the earnings of draft-in- 
eligible men by 1984. 

Earnings of white men born in 1953 do 
not appear to differ by draft-eligibility sta- 
tus. The earnings of draft-eligible nonwhites 
born in 1953 generally exceed the earnings 
of nonwhites who were not draft-eligible. 
Differences between the effect of draft eligi- 
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NONWHITES EARNINGS DIFFERENCE 


66 68 70 72 74 76 78 80 82 84 
YEAR 


BORN 1950 
BORN 1951 


BORN 1952 
BORN 1953 


Notes: The figure plots the difference in FICA taxable earnings by draft-eligibility 
status for the four cohorts born 1950-53. Each tick on the vertical axis represents $500 


real (1978) dollars. 


FIGURE 2. THE DIFFERENCE IN EARNINGS BY DRAFT-ELIGIBILITY STATUS 


bility on men born in 1953 and the effect on 
the three older cohorts might be explained 
by the transition to an All-Volunteer Force 
in 1973. Men who volunteer for the military 
are probably less likely than draftees to suf- 
fer a career disadvantage from their service. 

Estimates of the effect of draft eligibility 
are reported in Table 1 for both FICA earn- 
ings and W-2 earnings. Standard errors asso- 
ciated with the estimates are reported in 
parentheses. The statistics in Table 1 show 
that the loss in FICA earnings to draft-eli- 
gible white men is sometimes statistically 
significant and amounts to 2-3 percent of 
earnings. Estimated W-2 earnings losses are 
similar, but tend to be larger and more vari- 
able than the estimated losses in FICA earn- 
ings. In contrast, differences in earnings by 
draft-eligibility status for nonwhites rarely 
exceed their standard errors. 

Elsewhere (Angrist 1989c), I have shown 
that draft-eligible white men are less likely 
to have earnings above the FICA taxable 
maximum than draft-ineligible white men. 


The effect of draft eligibility on nonwhites’ 
probability of being at the taxable maxi- 
mum, although imprecisely measured, also 
appears to go in the same direction as the 
effect of draft eligibility on mean earnings. 
These results are worth noting because, when 
the effect of draft eligibility on the probabil- 
ity of being censored has the same sign as 
the effect on earnings, estimates tabulated 
using censored data tend to underestimate 
the true effect. 


SThe effect of censoring on estimated treatment ef- 
fects is discussed in the appendix. Angrist (1989c) also 
reports estimates of the effect of draft eligibility on the 
probability of having no recorded earnings. These tabu- 
lations indicate that draft-eligible whites were somewhat 
more likely to have had FICA earnings during the years 
in which they were in the service, and that draft-eligible 
nonwhites are more likely to have had no earnings in 
recent years. There is no statistically significant evi- 
dence for either race, however, of any lasting effect of 
draft eligibility on the probability of having zero earn- 
ings. 
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TABLE 1— DRAFT-ELIGIBILITY TREATMENT EFFECTS FOR EARNINGS 





Whites 
FICA Taxable Earnings . Total W-2 Compensation 
Year 1950 1951 1952 1953 1950 1951 1052 1953 
66 —21.8 f 
(14.9) 
67 — 8.0 13.1 
(18.2) (164) 
68 —14.9 123 —8.9 
(24.2) (95) (192) 
69 —2.0 18.7 11.4 -40 
(34.5) (264) (22.7) (83) 
70 —233.8  —448 —5.0 32.9 
(39.7) (36.7) (29.3) (24.2) 
71 —325.9 —2982  —294 27.6 
(46.6) (41.7) (40.2) (303) 
72 —203.5 —1974 —261.6 2.1 
(55.4) (511) (46.8) (429) 
73 — 226.6 228.8 —357.7 56.5 
(67.8) (616) (562) (548) 
74 —243.0 —1554 —4027  -150 
(81.4) (75.3) (68.3) (68.1) 
75 —295.2 Á—992 -3045  —283 
(94. (89.7 (850) (79.6) 
16 —3142  Á—86.8 —3707 -145.5 
(106.6) (102.9 (982) (93.0) 
TI —262.6 —2742 —3969  —85.5 
(117.9 (1122) (1L1) (107.1) 
78 —205.3 —203.8 —4671  —653 1,0593 2332 1753 —1,974.5 
(132.7 (127.0) (127.3) (123.1) (2,159.3) (1,609.4) (1,567.9) | (912.1) 
79 —263.6 —60.5 —236.8 89.2 —1,5887 523.6 —580.8 —557.9 
(160.5) (152.3) (153.9) (148.7) (1,575.6) (1,590.5) (736.7) (150.1) 
80 —339.1 -2679 —3121 -—93.8 —-1,0281 85.6 —5813  —4287 
(183.2) (175.3) (1782) (170.7) (756.8) (599.8) (309.1) (341.5) 
81 — 435.8 —358.33 —342.8 343  —5896  —71.6 —4405 -109.5 
(210.5) (203.6) (206.8) (199.0) (299.4) (423.4) (2650) (2452) 
82 —320.22 —1173 —235.1 2904  —3055 -72.7 —5147 18.7 
(235.8) (229.1) (232.3) (222.6) (345.4) (3721) (296.5) (2819) 
83 —349.5 —3140 -—4377  Á —96.3  —5129 —896.5 -—915.7 30.1 
(261.6) (2532) (257.5) (248.) (4412) (4263) (3952)  Q181) 
84 — 484.3 —398.44 —4360 —228.66 —1,143.3 —809. -7672 | —-1642 
(286.8) (2792) (281.9 (2722) (4922) (380.9) (376.0) (3660) 


on Earnings 





IIL The Effect of Military Service 


| À. Estimates Using Draft Eligibility 


Estimates of the effects of military service 


are based on a simple linear model for earn- 
ings. Denote the earnings of man i in cohort 
c at time £t by y,,,, and let s, be an indicator 
of veteran status. Then we may write 


(1) 


Yeti = B, 9, sa us, 


where f, is a cohort effect, Ó, is a period 
effect common tc all cohorts, and u; is a 
residual. The coefficient a is the effect of 
military service on civilian earnings. If s, is 
correlated with the unobserved components 
of the earnings equation, then a will not be 
consistently estimated by Ordinary Least 
Squares (OLS). For example, correlation be- 
tween s; and u;, may arise because the armed 
forces' eligibility criteria are correlated with 
earnings, but not accounted for by the. 
econometrician, or because veterans are 
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TABLE 1—CONTINUED 


Nonwhites 
FICA Taxable Earnings Total W-2 Compensation 
Year 1950 1951 . 1952 1953 1950 1951 -1952 1953 
66 —11.8 
(27.6) 
67 129 | —40 
j (34.2) (30.6) 
68 —29.5 -62  —120 
(44.5) (373) (85.0) 
69 —541 67.8 34 —42.4 
(66.8) (534) (434) (364) 
70 — 99.8 62.2 24.7 -9.0 
(78.5) (75.7) (622) (44.9) 
71 —1648 —1443 —25.0 182 
(92.7) (864) (851) (60.7) 
72 —188.8 —156.77 -2082 60.4 
(113.6) (105.7) (1042) (92.8) 
73 —85.7 —1348 -175.6 115.5 
(137.7) (127.0) (290) (119.4) 
74 —179.3  —967 -1814 2165 
(165.0) (160.1) (155.6) (145.1) 
75 —190.3 —2361 -183.77 111.6 
(189.3) (186.8) (185.8) (166.9) 
76 —105.3 —333.7 —3089  —464 
(14.7 (215.4) (216.5 (199.3) 
TI 1124 —206.8 —2511 1535 
(238.5) (240.4) (248.5) (233.5) 
78 163.6 —108.6. —4249 381.9 -1,145.0 2,9782 —4,676.2  —482.7 
(272.6) (269.2) (279.4) (275.7) (2,395.6) (2,869.6) (1,393.1) (2,206.0) 
79 1870 —2103 -—3917 3120 4005.4 1,545.0 —4947 -—1,043.3 
(317.2) (323.0) (324.8) (326.3) (2,7212) (2,191.1) (2,683.8) (1,6602) 
80 203.2 48 —2126 3440 790.3 3764 -2927 288.6 
(363.1) (368.4) (372.5) (370.3) (648.1) (533.6) (440.9) (4164) 
81 534.5 313.2 — 305.8 717.8 802.5 415.9 -—272. 784.4 
(413.5) (419.1) (429.1) (433.7) (5246) (745.1) (492.8) (503.1) 
82 285.1 175.4 —262.5 810.4 3260 —2443 —160.2 675.1 
(461.2) (471.6) (476.7) (486.3) (608.9 (647.8) (590.0) (5641) 
83 96.0 4195 —1773 543.6 3154 | 2543 -53.6 462.3 
(512.6) (5381) (531.5) (5232) (720.0) (761.5) (643.4) (638.9) 
84 —76.8 —-2231 -1233 6413 -—2874 -—7186 -2880 827.3 
(548.2) (562.8) (568.5) (568.2) (8040) (77.5) (7210) (716.8) 


Notes: Standard errors in parentheses. 


The table shows the difference in earnings by lottery- determined draft-eligibility status. 
Eligibility ceilings are RSN 195 for men born in 1950, RSN 125 for men.born in 1951, 
and RSN 95 for men born in 1952 and 1953. 
Earnings data are from the Social Security Administration CWHS, described in the text 
and the paren 


self-selected on the basis of unobserved 


characteristics. 


The draft lottery facilitates estimation of 


(1) because functions of randomly assigned 
lottery numbers provide instrumental vari- 
ables that are correlated with s,, but orthog- 


onal to the error term, u,,. For example, one 
such instrument is a dummy variable, d, 
that equals one if the ith individual wa: 
draft eligible. Suppose that attention is re 
stricted to a single cohort. Then, use of d 
and a constant as instrumental variable: 
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leads to the following estimator for a: 


Q) &=(¥*— 9") /( °- P"), 


where p is the proportion of the cohort 
actually entering the military, y is mean 
earnings, and superscript e and superscript 
n denote the draft-eligible and draft-ineligi- 
ble samples. Note that the numerator of (2) 
consists of estimates of the effect of draft 
eligibility plotted in Figure 2. 

Intuitively, equation (2) simply adjusts 
earnings differences by draft-eligibility status 
for the fact that not all draft-eligible men 
actually served in the military, while some 
men who were not draft eligible voluntarily 
enlisted for service. The justification for esti- 
mation of the effects of military service in 
this manner is clear: it is assumed that noth- 
ing other than differences in the probability 
of being a veteran is responsible for differ- 
ences in earnings by draft-eligibility status. 
This formula may also be recognized as an 
application of Abraham Wald’s (1940) 
grouping method, where the data have been 
grouped by draft-eligibility status. Applica- 
tions of this formula will therefore be re- 
ferred to here as * Wald estimates." 

In addition to draft-eligibility treatment 
effects, implementation of the Wald estima- 
tor requires estimates of f° and f". These 
estimates are tabulated from a special ver- 
sion of the 1984 Survey of Income and Pro- 
gram Participation (SIPP). The SIPP data 
used here were matched to an indicator of 
draft-eligibility status from information on 
. birthdates included in the Census Bureau's 
' in-house version of the SIPP file. Additional 
details on the SIPP data are provided in 
Section 7 of the Appendix. 

In the upper panel of Table 2, the columns 
labeled p* and p" show probabilities of 
veteran status tabulated using the SIPP. Be- 
cause of the small number of observations 
available for single-cohort statistics, each 
SIPP estimate is actually the average for 
three consecutive cohorts. For example, SIPP 
estimates assigned to men born in 1951 are 
based on data for men born in 1950, 1951, 
and 1952. The last column of Table 2, la- 
beled p*— p", shows the difference in the 
probability of military service by draft-eligi- 
bility status. Estimates of the effect of draft 
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eligibility on veteran status for whites born 
1950-52 range from 0.10 to 0.16. Thus, a 
rule of thumb for conversion of draft-eligi- 
bility treatment effects into estimates of the 
effects of military service is to multiply by 
1/0.15 = 6 2/3. 

Wald estimates of the effect of military 
service for selected cohorts and years are 
presented in Table 3. The sample is re- 
stricted to the subset of whites born 1950—52 
because the results in Table 2 suggest that 
this is the group for whom draft eligibility is 
most likely to be a useful instrument. Earn- 
ings variables are for 1981-84 because the 
impact of military service in these years rep- 
resents a long-term effect. Furthermore, as a 
practical matter, both FICA and W-2 earn- 
ings data are likely to be more reliable in 
recent years—the FICA data because of in- 
creased employment coverage and the W-2 
data because of improvements in data collec- 
tion procedures. 

Table 3 reports three sets of estimated 
draft-eligibility effects for use in the numera- 
tor of the Wald estimator. Column (1) pre- 
sents estimates for FICA earnings and col- 
umn (3) presents estimates for W-2 earnings; 
the figures in both of these columns are 
copied directly from Table 1. In addition, 
column (2) reports estimates for an earnings 
series constructed by applying a simple 
non-parametric correction for censoring to 
the FICA earnings data. The correction pro- 
cedure is described in detail in Section 6 of 
the Appendix. Briefly, data are adjusted for 
censoring by using the fraction with recorded 
earnings at the taxable maximum, combined 
with mean earnings above the taxable maxi- 
mum estimated from Current Population 
Surveys, to estimate population’ mean earn- 
ings from censored mean earnings. Note that 
the effects estimated using the adjusted data 
are usually bracketed by the effects esti- 
mated using the unadjusted FICA and W-2 
data. Therefore, only Wald estimates con- 
structed from the adjusted data are reported 
in the table. 

Wald estimates for adjusted FICA earn- 
ings, reported in column (5) of Table 3, 
indicate that white veterans suffered an an- 
nual earnings loss of roughly $2000 constant 
(1978) dollars of $3,500 current dollars. This 
is approximately 15 percent of annual W-2 
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TABLE 2— VETERAN STATUS AND DRAFT ELIGIBILITY 


Whites 

Data Set Cohort Sample  P(Veteran) pE P p p" 
SIPP (84)* 1950 351 0.2673 0.3527 . 0.1933 0.1594 
(0.0140) — (0.0325) (0.0233) (0.0400) 

1951 359 0.1973 0.2831 0.1468 0.1362 

(0.0127) — (0.0390) (0.0180) (0.0429) 

1952 336 0.1554 0.2210 0.1257 0.1053 

(0.0114) — (0.0473) (0.0146) (0.0495) 

1953 390 0.1298 0.1581 0.1153 0.0427 

(0.0106) — (0.0339) (0.0152) (0.0372) 

DMDC/CWHS* 1950 — 16119 0.0633 0.0936 | 0.0279 0.0657 


(0.0019) (0.0032) (0.0019) (0.0037) 
19531 16768 0.1176 0.2071 0.0708 0.1362 

(0.0025) — (0.0053) (0.0024) (0.0059) 

1952 17:03 0.1515 02683 01102 01581 

(0.0027) (0.0065) (0.0027) (0.0071) 

i. 1953 17749 01343 01548 01268 0.0280 
(0.0026) — (0.0053) (0.0029) (0.0060) 


Nonwhites 

Data Set | Cohort Sample  P(Veteran) p p" b-wF 
SIPP (84)* 1950 70 0.1625 0.1957 0.1354 0.0603 
(0.0292) — (0.0699) (0.0491) (0.0854) 

1951 63 0.1703 0.2014 0.1514 0.0500 

(0.0292) — (0.0827) (0.0448) (0.0940) 

1952 52 0.1332 0.1449 — 0.1287 — 0.0161 

(0.0275) — (0.1040) (0.0373) (0.1105) 

1953 55 0.1749 0.1980 0.1612 0.0367 
(0.0305) (0.0865) (0.0470) (0.0984) 

DMDC/CWHS? 1950 5447 0.0417 0.0548 0.0271 0.0276 


(0.0027) (0.0042) (0.0032) (0.0053) 
19531 5258 0.0794 01173 .0.0599 0.0574 

(0.0037) — (0.0076) (0.0040) (0.0086) 

l 1952 5493 0.0953 0.1439 0.0794 0.0644 
c. (0.0040) — (0.0095) (0.0042) (0.0104) 
1953 5303 0.0925 0.0984 0.0904 0.0080 

(0.0040) — (0.0079) (0.0046) (0.0092) 


Notes; Standard errors in parentheses. D^ is the probability of being a veteran 
conditional on being draft eligible; p" is the probability of being a veteran conditional 
on being ineligible. i 

*Wave I, Panel I of the 1984 Survey of Income and Program Participation. 
Probabilities are for service in the Vietnam era. Estimates are weighted by the SIPP 
sampling weight and smoothed over 3 cohorts. 

Defense Manpower Data Center Administrative Records’ information on acces- 

sions, from 1970-73, combined with information on cohort size from the Social 
Security Administration Continuous Work History Sample. 


compensation for white men between 1981 statistically significant at conventional 
~-- and 1984. The similarity of coefficient esti- levels.” 

mates across cohorts and years suggests that 

the Wald estimates provide a robust measure 

of the impact of military service. Taken indi- TThe asymptotic standard error of the Wald esti- 
vidually, however, few of the estimates are mates is derived from the limiting distribution of 
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TABLE 3— WALD ESTIMATES 


Draft-Eligibility Effects in Current $ 


FICA Adjusted FICA Total W-2 


: Earnings Earnings 
Cohort Year (1) (2) 
1950 1981  —435.8 — 487.8 

(210.5) (237.6) 

1982  —3202 —3961 
(235.8) . Q81.7) 

1983 —349.5 —450.1 
(261.6) (302.0) 

1984  —4843 — 638.7 
(286.8) (336.5) 

1951 1981  —3583 —428.7 
(203.6) (224.5) 

1982 —117.3 —278.5 
(229.1) (264.1) 

1983  —3140 ~ 452.2 
(253.2) (289.2) 

1984  —3984 ~ 573.3 
(279.2) (331.1) 

1952 1981  —342.8 — 392.6 
(206.8) (228.6) 

1982  —2351 — 255.2 
(232.3) (264.5) 

1983  —4377 — 500.0 

à (257.5) (294.7) 
1984  —4360 — 560.0 
(281.9) (330.1) 


Notes: Standard errors in parentheses. 


Columns (1) and (3) are taken from Table 1. 


Service Effect 


Earnings f*— f" in 1978$ 
(3) (4) (5) 
—589.6 0159 2,195.8 
(2994) (0040) (1,069.5) 
— 305.5 —1,6783 
(345.4) (1,193.6) 
—512.9 —1,795.6 
(441.2) (1,204.8) 

-1143.3 — 2,517.7 
(492.2) (1,326.5) 
-716 0.136  —2,261.3 
(423.4) (0.043) (1,184.2) 
2023 —1,386.6 
(372.1) (1,312.1) 
— 896.5 -2,181.8 
(426.3) (1,395.3) 
—809.1 —2,647.9 
(380.9) (1,529.2) 
—440.5 0105 —2,5023 
(265.0) (0.C50) (1,556.7) 
—514.7 —1,626.5 
(296.5) (1,685.8) 
—915.7 —3,103.5 
(395.2) (1,829.2) 
~167.2 — 3,323.8 
(376.0) (1,959.3) 


Column (2) reports draft-eligibility treatment effects on earnings adjusted for censoring 
at the FICA taxable maximum. The adjustment procedure is described in the Ap- 

. pendix. Column (4) reports SIPP estimates of the effect of draft eligibility on veteran 
status, taken from Table 2. Column (5) reports estimates of the effect of military service 
on civilian earnings is implied by columns (2) and (4). 


B. Efficient Instrumental 
Variables Estimates 


The Wald estimator is based solely on 
earnings differences by draft-eligibility sta- 
tus. A more efficient estimator exploits ail 
the information on RSNs in the aggregate 
data by fitting earnings model (1) to obser- 
vations on mean earnings for each group of 


YnCy* — y")/Cf* — P^). The standard error is therefore 
equal to 1/( p* — p^) times the standard ‘error of the 
numerator because the numerator has a nondegenerate 
limiting distribution, while (f° — f”) converges to 4 
constant. The same standard error formulas arise from 
application of conventional Instrumental Variables for- 
mulas. ` : 


d 


five consecutive lottery numbers. Consider 
the following grouped version of (1), where 
Jaj i$ mean earnings for members of cohort 
c at time ¢ with lottery numbers in group j, 
and f,, is the fraction of cohort c with 
lottery numbers in group j who served: 


Baj = Be + à Ba Us 


Intuitively, estimation of (3) simply general- 
izes Wald's method to grouped data with 
more than two groups. 

Generalized. Least Squares (GLS) esti- 
mates of (3) may easily be shown to have 
an instrumental variables interpretation 
(Angrist, 1988). In this case, the instrument 
set includes dummy variables that indicate 


VOL. 80 NO. 3 


groups of five consecutive lottery numbers 
for each race, cohort, and year of earnings. 
There are 73 dummy variables for a particu- 
lar race, cohort, and year; the first indicates 
men with lottery numbers 1-5 and the 73rd 
indicates men with lottery numbers 360—365. 
Furthermore, the quadratic form minimized 
by the GLS estimator is an overidentifica- 
tion test statistic associated with the use of 
dummy variables as instruments. This statis- 
tic tests the exclusion of lottery number 
group dummies from equation (1). It may 
also be viewed as a measure of the goodness- 
of-fit of the cell means to equation (1).? 

In principle, implementation of the esti- 
mation strategy based on (3) is straightfor- 
ward—the estimates are simply coefficients 
from GLS regressions of mean Social Secu- 
rity earnings on estimates of f... The SIPP 
sample is too small to allow accurate estima- 
tion of a full set of p,;, however. . Thus, a 
second set of probabilities was estimated 
from a combination of Defense Manpower 
Data Center (DMDC) administrative records 
and CWHS data on cohort size. Detailed 
descriptions of the DMDC administrative 
records may be found in Angrist (1989b). 
Briefly, the DMDC data show the total num- 
ber of new entrants to the military by race, 
cohort, and lottery number from July 1970 
through December 1973. 

DMDC and CWHS administrative records 
are used to estimate f.., by first counting the 
number of entrants to "the military by race, 
cohort, and lottery number interval. These 
numbers are the numerator of the P. . Esti- 
mates of overall cohort size, to be used in 
the denominator of f,;, are derived from the 
CWHS. Recall that the CWHS is a one 
percent sample, so that if the CWHS sam- 
pling frame is identified with the populauon 


TA general reference on overidentification testing is 
Whitney Newey (1985). See also Angrist (1988), where 
GLS on grouped data is shown to be the minimum 
variance linear combination of all the Wald estimators 
that can be computed from any division of grouped 
observations into linearly independent pairs. The overi- 
dentification test statistic for dummy variable instru- 
ments is also shown to be the same as the Wald statistic 
for equality of alternative Wald estimates. 
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at risk, an estimate of total cohort size is 
simply 100 times the CWHS cohort size. For 
example, to estimate the probability of being 
a veteran conditional on being draft eligible, 
the number of draft-eligible men in the 
DMDC data is divided by 100 times the 
number of men in the CWHS with lottery 
numbers below the induction ceiling. Stan- 
dard errors for these estimates are computed 
by applying the usual formula for a binomial 
proportion.? 

For comparison with the SIPP estimates, 
DMDC/CWHS estimates of f* and f” are 
reported in the lower panel of Table 2. These 
figures show that, with the exception of the 
1950 cohort, the SIPP and DMDC/CWHS 
procedures give reasonably similar estimates 
of p*— p". Inaccuracy of the DMDC/ 
CWHS estimates for 1950 is a consequence 
of the fact that DMDC administrative 
records are unavailable before July 1970. 
Therefore, despite the limitations of the SIPP 
data, the SIPP must be used to construct 
probabilities for the 1950 cohort. The SIPP 
sample is too small to allow estimation of a 
complete set of p,, for all lottery number 
cells in 1950. Consequently, SIPP estimates 
for 1950 are computed for only two cells, 
defined by draft-eligibility status, and CWHS 
earnings data for men born in 1950 are also 
grouped by draft eligibility. Thus, for each 
race and year, the sample used to estimate 
equation (3) includes 73 cell means for each 
of the three cohorts born from 1951-53, plus 
two cell means for the 1950 cohort. | 

A graphical version of equation (3) is de- 
picted in Figure 3, which shows the relation- 
ship between probabilities of veteran status 
(£.;) and mean W-2 compensation in 1978 
dollars ( y4;) between 1981 and 1984. Plot- 
ted in the figure are the average (over four 
years of earnings) residuals from a regression 


?The formula used is y[ £(1— 6)/n,], where ĵ is the 
estimated proportion of servers and n, is the number in 
the CWHS cohort. For example, 5749 draft eligible 
white men in the CWHS were born in 1951, and DMDC 
administrative records show that 119,062 draft-eligible 
white men born in 1951 served between July 1970 and 
December 1973. p° is therefore 119,062/574,900 = 0.21, 
with estimated variance equal to (0.21 *0.79)/5749. 


324 l E THE AMERICAN ECONOMIC REVIEW 


WHITES EARNINGS IN 1978 DOLLARS - 


YEAR 





66 68 70 72 74 76 78 80 82 84 


JUNE 1990 





66 68 70 72 74 76 78 80 82 84 
YEAR 


NONWHITES EARNINGS IN 1978 DOLLARS 


DRAFT 


ELIGIBLE INELIGIBLE 
COHORT 1950 +++ 


1954 e. 


1952. ———— 
1953  ———--. 





Notes: The figure plots mean W-2 compensation in 1981-84 against probabilities of 


veteran status by cohort and groups of five consecutive lottery numbers for white men 
born 1950-53. Plotted points consist’ of the average residuals (over four years of 
earnings) from regressions on period and cohort effects. The slope of the least squares 
regression line drawn through the points is — 2,384 with a standard error of 778, and is 
an estimate of a in the equation 


Jaj 7 By + 8, + Pojat uy. 


FIGURE 3. EARNINGS AND THE PROBABILITY OF VETERAN STATUS BY LOTTERY NUMBER 


of earnings and probabilities on period and 
cohort effects.!° Thus, the slope of the ordi- 
nary least squares regression line drawn 
through the points corresponds to an esti- 
mate of a. This slope is equal to —2,384 
dollars, with a standard error of 778 dollars. 
An interesting feature of the figure is the 
apparent heteroscedasticity of the earnings 
residuals. Dispersion around the regression 
line is reduced for cells with high probabili- 


10There are 221 points plotted in the-figure: 4 years 
of earnings times 3 cohorts with 73 cells plus 4 years of 
earnings times one cohort with 2 cells (men born in 
1950) = 884, divided by four to compute the average 
over years. 


ties of veteran status. This heteroscedasticity 
also appears in comparisons (not shown here) 
of cell variances by draft-eligibility status; 
draft-eligible men have somewhat less vari- 
able earnings. 

As pointed out earlier, estimation of (3) i is 
the same as instrumental variables estima- 
tion of (1) using dummy variables as instru- 
ments. However, inference for the case where 
the estimation strategy is implemented by 
regressing CWHS mean earnings on 
DMDC/CWHS or SIPP probabilities is 
complicated by the use of data from multiple 
samples. Assuming that the samples used to 
calculate mean earnings and the sample used: 
to calculate ., are independent, the optimal 
Two-Sample Instrumental Variables (TSIV) 


VOL. 80 NO. 3 


estimator has a simple form that may be 
briefly described as follows. Let y denote the 
vector of Ya; P denote the vector of P,;, 
and u(@) denote the vector of usj, where 0 
in parentheses represents the dependence of 
residuals on the parameter vector. Also, let 
V( ) denote the covariance matrix of the 
argument. Then the optimal TSIV estimator 
chooses @ to minimize 


-u(0)'[V(9) o? (9)] “a 4) = m0), 


which is also the GLS minimand for (3).!! 

The minimized value of m(@) is an overi- 
dentification test statistic for the validity of 
dummy variables as instruments. If some of 
these dummy variables are correlated with 
the regression error, then m(@) should be 
large relative to a chi-square distribution with 
degrees of freedom equal to the difference 
between the number of instruments and the 
number of estimated parameters. 

Table 4 presents two sets of TSIV esti- 
mates of equation (3) for 1981—84 earnings 
in 1978 dollars. Model 1 allows the effect of 
veteran status on earnings to vary by cohort, 
while Model 2 restricts estimated service ef- 
fects to be the same across cohorts. Note 
that, as in Table 3, the heading "adjusted 
FICA earnings" refers to FICA-taxable 
earnings adjusted for censoring at the tax- 


See Angrist (1989d) for details. Estimates of a for 
use in the formula for ® were computed by weighted 
least squares using the inverse of the sampling variance 
of the y,,; as weights. Estimates of V( y) and V( f) are 
discussed in Section 5 of the Appendix. Note that the 
TSIV estimator may also be motivated as an application 
of Optimal Minimum Distance (OMD) techniques such 
as those described by Gary Chamberlain (1982). Ignor- 
ing period and cohort effects, OMD estimates for the 
current problem are tabulated by choosing « and. p to 
minimize 


q(«, p) = 
pT Le sol Dl 


By concentrating out the estimate of p, it is easy to 
show that g(a, p) = m(0). 
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able maximum -using the procedure de- 
scribed in section 6 of the Appendix. 

The results in Table 4 show that white 
veterans born from 1950-52 suffered an an- 
nual earnings loss of between $1,500 and 
$2,100 constant (1978) dollars. These results 
generally are similar in magnitude to the 
Wald estimates reported in Table 3. Also, as 
in Table 3, regression estimates for adjusted 
FICA earnings tend to be bracketed by the 
results for unadjusted FICA and W-2 earn- 
ings. Although many of the estimates for 
individual cohorts in Model 1 are not signif- 
icant, the combined estimates for whites in 
Model 2 are substantially larger than twice 
their standard errors. In contrast, results for 
nonwhites show no evidence of a statistically 
significant earnings loss to veterans. 

The overidentification test statistics re- 
ported in Table 4 take on values less than 
their degrees of freedom, suggesting that the 
residuals in equation (1) are not correlated 
with lottery-based instruments. It should 
be noted, however, that low values of the test 
statistics may indicate low power in a test 
with so many degrees of freedom. On the 
other hand, without a particular alternative 
hypothesis in mind, it seems natural to re- 
port the omnibus goodness-of-fit test. 

Subtracting the test statistic for Model 1 
from the test statistic for Model 2 gives a 
chi-square test for the restriction of equal 
treatment effects across cohorts. The set of 
restrictions imposed by equal treatment ef- 
fects has three degrees of freedom. None of 
the chi-square statistics for Model 2 are 
larger than the corresponding statistics for 
Model 1 by as much as three, indicating that 


P Degrees of freedom for the overidentification tests 
are calculated as follows. For each race, the data consist 
of four years of earnings for three cohorts with 73 
lottery number cells each. The fourth cohort, men born 
in 1950, has four years of earnings with 2 lottery num- 
ber cells each. This gives a total of 884 cells or, equiva- ` 
lently, 884 categorical instruments. Model 1 includes 4 
cohort dummies, 3 year dummies, and 4 treatment 
effects. 884 minus 11 parameters gives 873 degrees of 
freedom. Model 2 has 3 fewer parameters than model 1 
and consequently the chi-square statistic for model 2 
has 876 degrees of freedom. 
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Model 1 
1950 


1951 
1952 
1953 
x?(873) 
Model 2 


1950-53. 


x^(876) 


Cohort 


Model 1 
1950 


1951 
1952 
1953 
x?(873) 


Model 2 
1950-53 


x^(876) 
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TABLE 4— TWO-STAGE INSTRUMENTAL VARIABLES ESTIMATES 


Whites 
FICA Adjusted FICA Total W-2 
Taxable Earnings Earnings Compensation 
—1709.2 — 2093.7 —1895.0 
(946.8) (1108.8) (1333.1) 
— 1457.1 — 1983.7 — 2431.4 
(959.3) (1036.1) (1152.1) 
= 1724.0 — 1943.0 — 2058.7 
: (8634) (927.2) (1001.9) 
1223.8 ` 900.7 — 488.6 
(3232.1) (3505.3) (3936.0) 
5783. 630.3 569.5 
—1562.9 —1920.4 — 2094.5 
(521.8) (575.9) (646.3) 
579.1 631.0 569.7 
Nonwhites 
FICA Adjusted FICA Total W-2 
Taxable Earnings Earnings Compensation 
3893.7 3891.9 5711.8 
(5358.5) (6244.5) (7206.0) 
— 891.3 — 333.4 2609.0 
(4397.1) (4664.2) (4894.6) 
—3182.9 . —3457.7 — 3068.0 
(3997.4) (4195.2) (4229.2) 
— 5928.3 —8571.4 — 6325.8 
.(10296.3) (10697.1) (11410.6) 
6167 ` 681.7 693.6 
— 643.3 — 999.7 366.7 
(2407.5) (2602.5) (2734.2) 
618.4 683.4 695.6 


Notes: Standard errors in parentheses. 
The table shows estimates of the effect of military service on average 1981-84 earnings 
in 1978 dollars. The estimation method is optimally weighted Two-Sample Instrumental 
Variables, described in the text. FICA and W-2 earnings are from the Social Security 
CWHS. The adjusted FICA series is described in the Appendix. 


the estimated treatment effects are not statis- 


tically different across cohorts. 


IV. Military Service and Loss of Labor 
Market Experience 


The simplest explanation for a veteran 
earnings penalty is that military experience 


is a poor substitute for lost civilian labor 


market experience. As evidence for this hy- 
pothesis, Griliches and Mason (1972) show 


` that the longer they were in the military, the 


less veterans earn relative to nonveterans. A 
test of the loss-cf-experience hypothesis is 
developed here using the functional form 
commonly employed in empirical studies of 
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human capital. The earnings function moti- 
vated by the theory of human capital is 
loglinear in years of schooling and log- 
quadratic in years of labor market exper- 
ience. This functional form puts testable 
restrictions on the time-series of earnings 
differences by veteran status.P? 

Adapting the human capital earnings 
function for the problem at hand, the earn- 
ings of individual i in cohort c at time t 
may be written 


(4a) 


Yeri = 8, + wido + BoX ict 


+ YXer + 
where y, now denotes log earnings, ô, is a 
time-varying intercept, fy, y, and ô are 
parameters. X; is the civilian labor market 
experience of man i in cohort c at time 1, 
taken here to be equal to [t — (c +18)— w, — 
5,1], where w; is the deviation of i's school- 
ing from the sample mean level of schooling, 
and l is years of military experience for 
veterans. As before, s, is a dummy variable 
that indicates military service. 

To focus on parameters that can be esti- 
mated using Social Security data, equation 
(4) is rewritten as 


(4b) Yeti = 8, 


+ Bo( x. i sl) ty(x, E sl)? 


— (2yW,xq—2ylw,s;) + Uys 


where x, 15t —(c +18) and §,,= ô, + w,(d9 
— By) + Yw?. Now, as in the previous analy- 
sis, assume. that schooling does not vary by 
lottery number. Assume also that schooling 
is independent of cohort—this seems rea- 
sonable for the small cohort range consid- 
ered here. Finally, to focus solely.on the loss 
of labor market experience, assume that 


Psee Mincer (1974) for theoretical justification of the 
human capital earnings function. A recent survey of the 
human capital literature is Willis (1986). 
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schooling is independent of veteran status. 
Then using dummy instrumental variables to 
group equation (4b) by cohort, year, and 
lottery number, average log earnings for 
members of cohort c at time ¢ in lottery- 
number cell j are 


(5) Yaj = 5, + Box, YX2 


e [ Bol y yl?| Be; 
a [2y0]( B.j%er) + Her 


where 6, now includes the period mean of 
6;,- 

A generalization of model (5) allows the 
linear experience term to vary with veteran 
status by letting the slope for individual i be 
B; = By + B,5;. In this case, mean cell earn- 
ings are characterized by 


(6) Yaj = 8, + Box, + ve 
- [Bol — v? + Bu] By 
a [2y] EH B B.j%er) + Wa. 


Models (5) and (6) both have the follow- 
ing reduced form in terms of unrestricted 
regression coefficients: 


(7) Ver = 6,+ BoX ce Tyx2 
* T; Bj + vy BEX 1) + us. 


Note that the reduced form veteran effect is 
Qo = T + TX Thus, these models parame- 
terize a time-varying veteran status coeffi- 
cient as a linear function of labor market 


VAveraging over c, t, and j eliminates (Oywjx,, — 
dyw,s;) because w; is orthogonal to x,, and 5 by 
assumption. Using ‘the fact that E(s;|c, j) = E(s? le, D 
= Pej» (4) simplifies to (5). Note that (5), (6), and (7) are 
not estimable if allowance need be made for cohort as 
well as period effects. Qualitatively similar estimates to 
those reported below were obtained when 6, was 
dropped in favor of cohort effects, although the good- 
ness-of-fit test leads to rejection of models without 
period effects. 
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TABLE 5— EARNINGS FUNCTION MODELS FOR THE VETERAN EFFECT, 
WHITES BORN 1950-52 : 
Model (6): Model (7): 
Model (5): Loss of Experience, Unrestricted 
Loss of Experience Reduced Growth Rate Reduced Form 

Parameter (1) (2) (3) 
Experience Slope, fj, 0.1022 0.1016 0.1016 

(0.007) (0.007) (0.007) 
Experience Squared, y — 0.0027 — 0.0025 — 0.0025 

(0.0003) (0.0003) (0.0003) 
Veteran Effect on Slope, fj — 0.0035 

(0.0023) 

Veteran Loss of Experience, / 2.08 1.84 

(0.38) . (0.43) 
m = — [Bol — yl? + Bil] — 0.189 

(0.052) 
m = — [Zyl - £1] 0.096 
(0.004) 

Age at Which Reduced Form 50.1 
Veteran Effect ( + 2x,,) =0 (15.9) 
x^ (dof) 1.41(1) 813.57(1247) 


Notes: Standard errors in parentheses. 


The table reports estimates of experience-earnings profiles that include parameters for the effect of veteran status. 
Estimates are of equations (5), (6) and (7) in the text. The estimating sample includes FICA taxable earnings from 
1975-84 for men born 1950, 1976-84 earnings for men born in 1951, and 1977-84 earnings for men born 1952. The 
estimation method is optimally weighted Two-Sample Instrumental Variables for a nonlinea model in columns (1) 


and (2), and for a linear model in column (3). 


experience. Excluding the time-varying inter- 
cept, model (6) contains four structural pa- 
rameters; By, i y, and /, and four reduced 
form parameters; Bp, y, 7, and m, Model 
(5) imposes one testable restriction on the 
reduced form by setting f, = 0. 

Table 5 shows results from nonlinear GLS 
estimation of (5) and (6), and results from 
Linear GLS estimation of (7), using data on 
the real FICA earnings of white men born 
from 1950 to 1952. The weighting matrix 
used in estimation was derived in a manner 
similar to the weighting matrix used to con- 
struct the estimates in Table 4.16 Because 


Sa third model is derived by letting both y and f 
vary with veteran status. This model leads to a reduced 
form similar to (7), with the only modification being the 
addition of a linear term of the form m,(f,, * x2). In 
the empirical work, however, no evidence was found 
that such a term belongs in the earnings function. re- 
duced form. 

The only modification arises from the fact that for 
equations (5)-(7), reduced form treatment effects ap- 
pearing in the weighting matrix are time varying. Let 
IL, denote the vector of o,, for the time-series of 


earnings functions are commonly fit in logs, 
the dependent variable is taken to be the log 
of mean earnings for each cell. The log of 
the mean is not the same as the mean of the 
log, but the CWHS data set does not contain 
the mean of log earnings. If earnings are 
approximately lognormally distributed, use 
of the log of the mean will provide a reason- 
able approximation. In practice, estimates of 


earnings by cohort c and suppose that each time-series 
' is of length T. Then the second term in the optimal 


weighting matrix has the following block corresponding 
to the time series of earnings for lottery number cell j 
of cohert c: 


(1/n,;) II, : 


x { ferer-J@{ [851 7 Bey) ]} nz. 


where ep is ‘a vectcr of T 1’s. In practice, II, is 
replaced by weighted least squares estimates (weights 
are the inverse sampling variances of y,;) of the re- 
duced form equation, (7). Estimates of models for the 
log of earnings replace V(y)' with delta-method esti- 
mates of the covariance matrix of log( y). 
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models in levels resulted in inferences simi- 
lar to those arising from estimates of models 
in logs. 

The sample used to estimate equations 
(5)-(7) begins in the fifth year after the 
lottery in which members of the cohort were 
drafted. For example, the sample begins in 
1975 for men born in 1950. This allows for 
three years of service and one year of read- 
justment to civilian life. Median length of 
service of Vietnam era veterans was 37 
months (Veterans Administration 1981b, p. 
16). Evidence from Table 4 suggests that 
veteran effects are essentially zero for the 
1953 cohort, so it was excluded from the 
estimation. 

Estimates of models (5), (6), and (7) are 
presented in columns (1), (2), and (3) of 
Table 5. The chi-square statistic in column 
'(3) is an overidentification test statistic for 
the overall goodness-of-fit of the reduced 
form and the chi-square statistic in column 
(1) is for the restriction. B, = 0. Restricting 
B, to be zero does not affect the overall fit. 
The effect of veteran status on earnings 
growth, although negative, is not statistically 
significant. The sum of the test statistics in 
columns (1) and (3) is an overidentification 
test statistic for the overall goodness-of-fit of 
model (5). This statistic also takes on a value 
less than its degrees of freedom, indicating 
that the simplest loss-of-experience model is 
not at odds with the data. 

The loss of experience estimated using 
model (5) is approximately 2 years. This is 
'somewhat low relative to the median length 


of service, suggesting that military experi- 


ence may be a partial substitute for civilian 
‘experience. Table 5 also shows the age at 

` which the reduced form veteran effect finally 
reaches zero. This occurs when x,,— — m/ 
fiz, SO that a,,=0. The reduced form esti- 
mates in column (3) imply that the loss of 
earnings to veterans decays to zero around 
age 50 with a standard error!" of 16. 


Delta method standard errors are given by the 
square root of (9o,/72)— (20157 /m) + (&5]/ 3 ), 
where c, 0, and: 9,2 are the elements of the covariance 


matrix of the estimated 7, and 7. 0,2 is estimated to 


be — 0.00019, and the square roots of a, and o; appear 
in! Table 5. 


ANGRIST: LIFETIME EARNINGS AND VIETNAM ERA DRAFT LOTTERY 329 


V. Caveats 


The consistency of lottery-based estimates 
of the effects of military service turns on two 
key assumptions. First, earnings model (1) 
must be an accurate representation of the 
impact of military service. Second, functions 
of the draft lottery must be valid instru- 
ments for s; in the earnings regressions. 
Three models leading to the failure of these 
assumptions are discussed briefly below. The 
first two, incorporating treatment effect het- 
erogeneity and missing covariates, merely re- 
sult in a reinterpretation of the estimates. 
The third model, allowing for earnings-mod- 
ifying draft-avoidance behavior, calls into 
question the assumption that earnings dif- 
ferences by lottery number can be attributed 
solely to veteran status. 


A. Treatment Effect Heterogeneity 


The effect of veteran status can only be 
estimated for those who served in the armed 
forces. If veterans are more or less likely to 
benefit from military service than the rest of 
the population, then the estimated veteran 
Status coefficient does not characterize the 
impact of veteran status on a random sam- 
ple. This problem is formalized in the ran- 
dom coefficients model for treatment effect 
heterogeneity: 


Yeri = Be + 8, + 540; ug 


a; = Ay t Ejs 


where a; is the effect of military service on 
the earnings of person 7, with population 
mean equal to ap. If e; is uncorrelated with 
the.instruments, an instrumental variables 
estimator of the random coefficients model 
identifies ag + E(sjs; 1) and not a, 
(James Heckman and Richard Robb, 1985). 
Related to the problem of treatment effect 
heterogeneity is the fact that not all Vietnam 
era accessions to the military were induced 
by the draft. A substantial fraction of enlist- 
ments were made by "true volunteers" who 
would have volunteered for service in the 
absence of a draft (Tarr, 1981). Suppose that . 
the impact of military service on the earn- 
ings of true volunteers differs from the im- 
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pact on the earnings of draftees and men 
who enlisted because of the draft. Then us- 
ing functions of the draft lottery as instru- 
ments will only identify the effect of military 
service for the latter group. To see this, let f, 
be an indicator for the veteran status of 
draftees and draft-motivated volunteers and 
let g, be an indicator of veteran status for 
true volunteers. Write 


0,5, = af, + 0,8; 


for the treatment. effect experienced by i. 
Note that functions of draft lottery numbers 
wil only be correlated with f, Conse- 
quently, a, is identified by lottery-based IV 


estimators, but «,g, becomes part of the: 


regression error. 
B. The Absence of Covariates 


An additional problem arises because So- 
cial Security data contain no information on 
covariates other than race and age. This may 
be of concern if the impact of veteran status 
is primarily through its effect on covariates. 
For example, veterans might have a higher 
level of educational attainment because of 
financial aid available through the GI Bill. 
In the absence of data on education, esti- 
mated veteran effects confound the.“ pure” 
effect of military service with its effect on 
education. Formally, the need for covariates 
may be represented by replacing 6, with x,6 
in (1). In this case, instrumental variables 
estimates identify 


Lis [E(xils; 71) Ni E(x;|s,=0)]8+ a. 


In many applications, however, it may be a. 


that is actually the parameter of interest. For 
example, the fact that veteran status influ- 
ences civilian earnings primarily through its 
influence on third variables might be of little 
importance for issues related to veterans 
compensation. 


C. Earnings-Modifying Draft 
Avoidance Behavior 


- Perhaps the most serious problem arises if 


the risk of induction affected earnings for 
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some other reason than through an effect on 
the probability of military service. For exam- 
ple, it is sometimes argued that during the 
Vietnam era, students went to college to 
avoid the draft and that educational stan- 
dards were reduced to avoid having to flunk 
students out of school. Lawrence Baskir and 
William Strauss (1978) claim that Vietnam 
era college enrollment -was 6-7 percent 
higher than normal because of the draft. 

If draft-avoidance behavior is correlated 
with lottery numbers and with variables re- 
lated to-earnings besides veteran status, then 
lottery-based instruments will be correlated 
with the regression error in equation (1). 
Such correlation will bias estimates of the 
effects of military service constructed using 
the lottery. But in a previous study using 
micro data (Angrist, 19892), specification 
tests provided no evidence of a relationship 
between lottery numbers and characteristics 
other than veteran status. The overidentifi- 
cation test statistics reported here also show 
no evidence of omitted variables bias. Fi- 
nally, even if having a low lottery number is 
correlated with a tendency to stay in school, 
the fact that earnings rise with schooling 
implies that lottery-based estimates of the 
effect of veteran status will tend to underes- 
timate the true effect. 


VI. Conclusions 


Estimates based on the draft lottery indi- 
cate that as much as ten years after their 
discharge from service, white veterans who 
served at the close of the Vietnam era earned 
substantially less than nonveterans. The an- 
nual earnings loss to white veterans is on the 
order of $3,500 current dollars, or roughly 
15 percent of vearly wage and salary earn- 
ings in the early 1980s. In contrast, the esti- 
mated veteran effects for nonwhites are not 
statistically significant. 

In light of the results reported here, some 
more conventional estimates of the effect of 
Vietnam era veteran status do not appear to 
be too far off the mark. Rosen and Taubman 
(1982) report estimates close to these, finding 
a 19 percent annual earnings loss to Vietnam 
era veterans. Crane and Wise (1987) find an 
11 percent reduction in 1979 weekly earn- 


- 
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ings. On the other hand, Mark Berger and 
Barry Hirsch (1983) find essentially no effect 
on 1977 weekly earnings, and Angrist (1989a) 
reports OLS coefficients of zero when using 
the National Longitudinal Survey (NLS) to 
estimate the effects of veteran status on 
hourly wages. In contrast to the OLS esti- 
mates, lottery-based estimates from the NLS 
indicate that white veterans had lower hourly 
wages in 1981 than their nonveteran coun- 
terparts. Similar results from the SIPP are 
reported in Angrist (1989c). Thus, lottery- 
based estimates from a variety of sources 
provide conclusive evidence that white Viet- 
nam veterans were disadvantaged by their 
service.!? 

This paper also proposes a simple expla- 
nation for the loss of earnings to white veter- 
ans: they earn less because their military 
experience is only a partial substitute for the 


civilian labor market experience lost while in 


the armed forces. Goodness-of-fit tests sug- 
gest that for whites, the time-series of vet- 
eran status coefficients is consistent with this 
hypothesis. Experience-earnings profiles esti- 
mated using Social Security data imply that 
white veterans suffered an earnings reduc- 
tion equivalent to the loss of two years civil- 
ian labor market experience. 

The analysis reported here leads naturally 
to further research on a number of topics. 
One of these is the question of alternatives 
to the loss-of-experience explanation for the 
reduction in white veterans' earnings. Vet- 
eran status may be a screening device, as 
suggested by DeTray (1982), or there may be 
cohort size effects such as those discussed by 
Finis Welch (1979). Because the Social Secu- 
rity data include information on variances, 
testable implications of these theories might 
also include restrictions on second, as well as 
first, moments. Another question for future 
research is whether draft eligibility affected 
educational and career plans independently 
of its effect on military service. The lottery 


187 ottery-based estimates from the SIPP and NLS 
suggest that nonwhite veterans may have higher monthly 
and hourly earnings than comparable nonveterans. The 
estimates for nonwhites are too imprecise, however, to 
be viewed as conclusive. 
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may provide a useful tool for research on 
changing educational attainment in the 1960s 
and 1970s. 

Finally, there remains the question of rec- 
onciling the loss of earnings to Vietnam era 
veterans with the apparent benefits of mili- 
tary service to veterans of World War II and 
other eras (Rosen and Taubman, 1982; 
Berger and Hirsch, 1983). Elsewhere, Alan 
Krueger and I have argued that the need for 
reconciliation is, at least in part, illusory 
(Angrist and Krueger, 1989). Although OLS 
regressions usually show that the. effect of 
World War II veteran status is large, posi- 
tive, and significant, these results may actu- 
ally be a consequence of selection bias. By 
exploiting the fact that World War II vet- 
eran status is also correlated with exact date 
of birth, we have implemented an instrumen- 
tal variables estimation strategy similar in 
spirit to the one used here. The results of 
this procedure indicate that the true impact 
of World War II veteran status on earnings 
is no larger than zero and may well be 
negative. 


APPENDIX: DATA SOURCES AND METHODS 


1. CWHS Data Collection? 

The Social Security Administration maintains the 
earnings histories of covered employees in a data base 
known as the Summary Earnings Record (SER). Ap- 
proximately one year after the SER has been updated 
with the latest year's earnings, a one percent sample of 
earnings histories is extracted. The sampling frame con- 
sists of all issued Social Security numbers, and the 
sample is stratified using the-regional information coded 
in the numbers. 

Prior to 1978, the FICA taxable earnings of employ- 
ees were reported to the SSA by employers on a quar- 
terly basis. Self-employed workers report their earnings 
annually on schedule SE of Internal Revenue Service 
(IRS) Form 1040, which is forwarded to the SSA by the 
IRS. Since 1978, employers have no longer been re- 
quired to make quarterly reports. Instead they file IRS 
form W-2 with the SSA, as well as with the IRS, on an 
annual basis. 

After 1978, all earnings, idodi those above the 
FICA taxable maximum, are to be reported to the SSA 
on form W-2. Furthermore, all employers are required 


19-This section draws on Warren Buckler and Cresston 
Smith (1984), U.S. Department of Health and Human 
Services (1987), and personal correspondence and con- 
versations with Buckler and Smith. 
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to file W-2s with the SSA regardless of whether their 
employees are engaged in FICA taxable employment. 
In practice, however, many employers do not report the 
earnings of those engaged in non-FICA-taxable employ- 
ment. A further shortcoming of the W-2 series is the 
poor quality of the data during the first years of annual 
reporting. ` 


2. Coverage and Truncation of the CWHS Earnings 
Series 

FICA coverage includes most wage and salary and 
self-employment earnings. For the sample period used 
here, the most important coverage exceptions are the 
majority of federal civilian employees, some state and 
local government employees, some agricultural and do- 
mestic workers, and the employees of some nonprofit 
organizations. A view of coverage by industry in 1981 is 
given in Robert Meyer (1985, Table 2.1). Meyer’s fig- 
ures for state, local, and civilian federal government 
employees imply that by 1981, roughly 58 percent of all 
civilian government workers were covered. 

FICA taxable maximums are reported in Appendix 
Table Al. The combined effects of limited coverage and 
censoring at the taxable maximum are conveniently 
summarized by the percentage of all earnings that are 
reported to the SSA. These statistics, reported in De- 
partment of Health and Human Services (1987, Table 
30), show that after 1981 over 90 percent of wage and 
salary earnings and over 75 percent of self-employment 
earnings were reported to the SSA. 

The W-2 earnings series excludes earnings from 
self-employment. Unpublished estimates indicate that 
roughly 6.4 percent of men born between 1944 and 1953 
with nonzero earnings in 1984 had self-employment 
earnings only (figures from private correspondence with 
SSA employees). Other differences between FICA and 
W-2 earnings coverage are described by Mary Millea 
and Beth Kilss (1980). 


3. Matching Dates of Birth to the CWHS- 

In the CWHS, information on race and sex is ob- 
tained from a computerized record of applications for a 
Social Security number called the NUMIDENT file. 
For this project, dates of birth were also matched from 
the NUMIDENT file to the CWHS. Draft lottery num- 
bers were then matched to the dates of birth using 
lottery number tables in Selective Service System Semi- 
annual Reports for 1969-73. A small number of indi- 
viduals ‘were discarded from the final data set because 
there was no information on either their sex, race, or 
exact date of birth. 


4. CWHS Descriptive Statistics 

Descriptive statistics for the CWHS cohorts studied 
here are reported in Table Al. The combined statistics 
for four cohorts were constructed by computing 
weighted averages of cohort means. Unless otherwise 
noted, statistics in the table refer to men with positive 
earnings. Sample sizes decline over time due to attrition 
from mortality. 

The descriptive statistics indicate that after 1972 the 
fraction of men in the sample with zero FICA earnings 
varies roughly between 15 and 22 percent for whites and 
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between 33 and 36 percent for blacks. To evaluate these 
figures, note that the author's tabulations show that 
roughly 10 percent of white men in these cohorts have 
zero recorded wage and'salary earnings in the late 
1970s’ Current Pcpulation Surveys. Suppose that of the 
90 percent who work, 12 percent are in the uncovered 
sector so that only 79 percent of the cohort may be 
expected to have positive FICA wage and salary earn- 
ings. Adding an estimated 5 percent who only have 
FICA self-employment earnings implies that 16 percent 
should have zero FICA earnings of any type. Thus, 
14-18 percent of CWHS earnings being zero for whites 
between 1973 and 1980 does not seem unreasonable. . 

In recent years, the fraction with zero earnings ap- 
pears to be too high to be accounted for by labor force 
participation or employment in the uncovered sector. 
This is probably because of the long delay in filing and 
recording Social Security taxable earnings. In their anal- 
ysis of Social Security data, Card and Sullivan (1988) 
also note the problems caused by filing delay. Problems 
of undercoverage and filing delay may be especially 
severe for nonwhites. The 15 percent of CPS nonwhites 
with zero wage and salary earnings is not large enough 
to explain the approximately 34 percent of nonwhites 
with zero FICA earnings in the CWHS. 

The fraction with FICA earnings at the taxable max- 
imum is more- variable than the fraction with zero 
earnings, ranging from 3 to 15 percent for whites and 
between 2 to 10 perzent for blacks. The FICA earnings 
of men with multiple employers can exceed the taxable 
maximum because reported earnings are censored by 
source. The fraction of men with FICA earnings above 
the taxable maximum is around 1-2 percent. The W-2 
and FICA earnings series show roughly equal fractions 
at or above the FICA taxable maximum, suggesting that 
both variables are drawn from the same underlying 
distribution. But problems with early years of the W-2 
series are clearly reflected in the sample statistics. For 
example, the standard deviation of whites’ W-2 earnings 
in 1978 is six times the mean and does not fall below 
the mean until 1981. Another disturbing feature of the 
W-2 series is that nominal earnings appear to fall from 
1978—80. The W-2 series also has a substantially higher 
fraction of zeros than the FICA series does. However, 
part of this difference is caused by the inclusion of 
self-employment earn:ngs in the FICA series. Also, gen- 
erally there are some individuals with FICA taxable 
earnings but no federally taxable compensation (Millea 
and Kilss, 1980). 


: 5. Covariance Estimates for Social Security Earnings 


and for Pej 

Information on second moments in the aggregated 
CWHS is restricted to variances. Therefore, off-diagonal 
elements of the covariance matrix of Social Security 
earnings must be estimated. Recall that y denotes the 
vector of y, where c indexes cohort, ¢ indexes year of 
earnings, and j indexes lottery number cells. The co- 
variance matrix of y is block diagonal, with nonzero 
elements for the covariance between y,, and y,,;, and 
zeros everywhere else. Thus, there is only correlation 
between elements of the time-series of earnings for a 
particular cohort and lottery number group. The in- 
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Racé Year 


White 69 


81 
82 


Nonwhite 69 


84 
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TABLE A1— DESCRIPTIVE STATISTICS FOR MEN BORN 1950-53 


N 
68,407 
68,339 
68,244 
68,154 
68,053 
67,966 
67,882 
67,194 
67,691 
67,598 
67,503 
67,413 
67,316 
67,265 
67,190 
67,114 
21,514 
21,501 
21,475 
21,443 
21,412 
21,380 
21,339 
21,310 
21,275 
21,243 
21,200 
21,167 
21,130 
21,109 
21,077 


21,042 


Taxable 
Maximum 
7,800 

7,800 
7,800 
9,000 
10,800 


13,200 


14,100 


15,300 
16,500 
17,700 
22,900 
25,900 
29,700 
32,400 
35,700 
37,800 

7,800 

7,800 

7,800 

9,000 
10,800 
13,200 
14,100 
15,300 
16,500 
17,700 
22,900 
25,900 
29,700 
32,400 
35,700 


37,800 


FICA 
Earnings 


1,473 
(1,457) 
1,977 
(1,825) 
2,581 
(2,233) 
3,614 


FICA at 
Limit^* 


0.003 
0.010 
0.030 
0.040 
0.046 
0.034 
0.053 
0.078 
0.107 
0.149 
0.092 
0.092 
0.089 
0.092 
0.089 
0.103 
0.002 
0.007 
0.020 
0.025 
0.029 
0.021 
0.031 
0.049 
0.071 
0.096 
0.061 
0.063 
0.063 
0.067 
0.067 


0.077 


FICA 
Zeros 


0.321 
0.241 
0.194 
0.157 
0.141 
0.145 
0.169 


0.167 


` 0.163 


0.165 


0.219 
0.464 
0.417 
0.388 
0.352 
0.330 
0.336 
0.359 
0.350 
0.341 
0.338 
0.329 
0.331 
0.335 
0.354 
0.358 


0.353 


W-2 


Earnings? 


15,435 
(91,032) 
14,786 
(65,359) 
14,561 
(28,180) 
16,363 
(15,295) 
17,907 
(16,298) 
19,595 
(22,470) 
21,595 
(20,856) 


13,439 
(63,722) 
13,581 
(63,244) 
11,716 
(13,895) 
13,421 
(14,644) 
14,983 
(16,112) 
16,271 
(19,089) 
17,905 
(20,784) 


W-2 at 
Limit? 


0.167 
0.097 
0.096 
0.092 
0.093 
0.089 
0.101 


0.116 
0.067 
0.067 
0.063 
0.065 
0.065 
0.073 
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W-2 
Zeros* 


0.244 
0.261 
0.272 
0.272 
0.282 
0.275 
0.281 


0.391 
0.393 
0.397 
0.388 
0.408 
0.396 
0.387 


Notes: Statistics are from the Social Security Administration CWHS. Standard deviations of earnings in parentheses. Amounts are in current 


dollars. Sample statistics are weighted averages of cells for each race, year of birth, and five consecutive lottery numbers. 


“Fractions at limit are fractions of nonzero earnings at or above FICA taxable maximum. Fractions zero are fractions of all nondecedents 


with zero earnings, 


"FICA earnings are wage and salary and self-employment earnings in Social Security taxable employment. FICA taxable earnings are 
censored at the taxable maximum except for those with multiple sources. Multiple sources are censored by source. 
*Ww-2 earnings are total W-2 form wage and salary compensation, not censored at the Social Security taxable maximum. W-2 2 earnings do not 
include earnings from self-employment. 
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tertemporal correlation structure is assumed constant 
across lottery number groups so that the 73 cells avail- 
able for each race and cohort may. be used to estimate 
correlation matrices for all race-cohort combinations. 
Correlations are converted to covariances using the 
within cell variances available from the CWHS data. 
This procedure is also applied to the adjusted FICA 
series described in Section 6 of the Appendix, with the 
modification that adjusted standard errors (diagonal 
elements of Q) are used to convert correlations to 
covariances. 

The covariance matrix of P is also block diagonal, 
with elements equal to the variance of f,, in every 
element of the block corresponding to the time series 
for cohort c and lottery cell j. The variance of f,, is 
estimated using the standard formula for an estimated 
proportion. The sample size in the formula is taken to 
be the size of the SIPP cohort for those born in 1950 
and the size of the CWHS cohort for those born from 
1951 to 53. 


6. FICA Earnings Adjusted for Censoring 

. For economy of notation, in this section all cells are 
indexed by j. The relationship between the expectation 
of censored earnings and the expectation of uncensored 
earnings for cell j is given by 


w= pi pli - Ly), 


where po is the mean of uncensored earnings, 5 is the 
mean of censored earnings, ul, is the mean of earnings 
above the taxable maximum, L; is the taxable maxi- 
mum, and pj is the fraction with earnings at or above 
the taxable maximum.” 

The FICA earnings series is adjusted for censoring 
by applying this formula using estimates of p^ tabulated 
from March Current Population Surveys (CPS) for each 
year, race, and cohort. Although this procedure involves 


This formula ignores the fact that the censored 
earnings of men with multiple employers may be above 
the taxable maximum. The formula may be used to 
analyze the bias in treatment effects estimated from 
censored data. Suppose that cell j is for a sample of 
draft-eligible men and that cell k is for a sample of 
draft-ineligible men. The draft-eligibility treatment ef- 
fect estimated from the difference between CWHS cen- 
sored mean earnings in cells j and & is the sample 
analogue of 


i-i = (ufo) + (n rh) s) 
+ nh i). 


Assuming that the effect of draft eligibility on both p’ 
and p! is of the same sign as the effect on p°, this 
expression shows that treatment effects estimated from 
the censored data differ from the true treatment effect 
by terms that are opposite in sign from the true effect. 
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no parametric distributional assumptions, the adjust- 
ment is only approximate because the CPS estimates of 


. ul, do not vary by lottery number. However, the adjust- 


ment does incorporate variation in p! by lottery num- 
ber. 

Data on earnings above the FICA taxable maximum 
(ui) are taken from the Mare-Winship March CPS 


- Uniform files. Cohort was determined on the basis of 


age in 1985; ages 34-35 were assigned 1950, ages 33-34 
were assigned 1951, ages 32-33 were assigned 1952, and 
ages 31-32 were assigned 1953. The CPS reports wage 
and salary earnings in the year preceding the survey 
year. To compute the adjusted 1981—84 earnings series 
used in Tables 3 and 4, data from the CPSs for 1982-85 
were used to compute separate a p} by cohort and race 
for each year. Additional details and summary statistics 
for the CPS data are reported in Appendix A of Angrist 
(1989c). It should be noted that CPS earnings data are 
also censored—at $75,000 for 1981—83 earnings and at 
$100,000 for 1984 earnings. 

Standard errors for the adjusted series are calculated 
as follows. Let m= [mi m4 m4} denote the vector of 
sample moments corresponding to u^, p’, and p! and let 
2, ;(i, j =1,2,3) denote the corresponding blocks of the 
covariance matrix of m. The covariance matrix of m is 
assumed to be given by 


IQ EA 0 
Z2 Žv 0 
0 0 Z4 


The delta-methed covariance matrix for the vector of 
adjusted earnings, m; + m5(m3 ~ L), is 


Q= Fy, +22 (my ~ L) 
+ In (m — Ly + m23. 


Only the diagonal elements of Q are estimated using the 
above formula. Estimates of diagonal elements of Zi 
and È, are available from the CWHS cell statistics, 
while diagonal elements of 2, are estimated from the 
CPS micro data. An estimator for diagonal elements of 
24, is easily shown to be (Angrist, 1989c) 


0,7 [ (5 7 #5) | /n,- 


A simplified procedure, described in Section 5 of this 
Appendix, is used to estimate the nonzero off-diagonal 
elements of Q directly from the adjusted cell means. 


. 7. Matching Draft-Eligibility Status to the SIPP 


The Survey of Income and Program Participation is 
a Census Bureau longitudinal survey of approximately 
20,000 households in the civilian noninstitutional popu- 
lation (U.S. Department of Commerce, 1985). Data for 
the first wave of the first SIPP panel were collected from 
four rotation groups in 1983 and 1984. 

The SIPP public-use tapes contain year and month 
of birth. A Census Bureau "in-house" version of the 
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SIPP contains information on day of birth, which is not 
released to the public. At the author’s request, this 
information was used to match a dummy variable for 
draft-eligibility status to the public use version of SIPP 
Panel I. Draft eligibility was determined by the official 
RSN ceiling for men born from 1944-52 and by RSN 
95 for men born in 1953. Vietnam era veteran status is 
coded from the SIPP variables VETSTAT, which records 
veteran status, and from U-SRVDTE, which records the 
period of service. ` 
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Unintended Impacts of Public Investments 
on Private Decisions: 
The Depletion of Forested Wetlands 


By ROBERT N. STAVINS AND ADAM B. JAFFE* 


By affecting relative economic returns, public infrastructure investments can 
induce major changes in private land use. We find that 30 percent of forested 
wetland depletion-in the Mississippi Valley has resulted from private decisions 
induced by federal flood-control projects, despite explicit federal policy to preserve 
wetlands. Our model aggregates individual land-use decisions using a parametric 
distribution of unobserved land quality; dynamic simulations are used to quantify 
the impacts on wetlands of federal projects and other factors. (JEL 717) 


Private land-use decisions can be affected 
dramatically by public investments in high- 
ways, waterways, flood control, or other in- 
frastructure. The large movement of jobs 
from central cities to suburbs in the postwar 
United States and the current destruction of 
Amazon rain forests have occurred with ma- 
jor public investment in supporting infra- 
structure. As these examples suggest, private 
land-use decisions can generate major envi- 
ronmental and social externalities. Hence, 
the extent to which major investment pro- 
grams create “secondary impacts” through 
their effects on private decisions is a matter 
of great public concern. In this paper, we 
demonstrate that the depletion of forested 
wetlands in the Mississippi Valley (an im- 
portant environmental problem and a North 


*John F. Kennedy School of Government, Harvard - 


University, Cambridge, MA: 02138, and Department 
of Economics, Harvard University, Cambridge, MA: 
02138, respectively. Helpful comments on earlier ver- 
sions of the paper were provided by: Zvi Griliches, 
Partha Dasgupta, Christopher Cavanagh, Joseph Kalt, 
Peter Timmer, Robert Pindyck, Hendrik Houthakker, 
Mark Watson, Jon Goldstein, Lawrence Goulder, Marie 
Lynn Miranda, James Hines; seminar participants at 
Harvard, Stanford, Michigan, Northwestern, the Rand 
Corporation, Resources for the Future, and the World 
Resources Institute; and two anonymous referees. The 
authors alone, however, are responsible for any remain- 
ing errors. Financial support was provided by grants 
from Resources for the Future, the U.S. Department of 
the Interior, and the Environmental Defense Fund. 
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American precursor to the loss of South 
American rain forests) has been and is cur- 
rently exacerbated by federal water-project 
investments, despite explicit federal policy to 
protect wetlands. 

We begin with a structural model of an 
individual landowner’s decision of whether 
or not to convert a parcel of land from its 
natural, forested state to agricultural use. 
This problem can be characterized as an 
optimal stopping problem, and is similar to 
those investigated by Glenn Gotz and John 
McCall (1980 and 1984), Ariel Pakes (1986), 
John Rust (1987), and Mark Rosenzweig 
and Kenneth Wolpin (1988). Unfortunately, 
such models require individual data for esti- 
mation; but in the present context data on 
land-use status and other characteristics of 
individual parcels would be prohibitively ex- 
pensive to obtain over an area large enough 
to contain significant variation in the extent 
of federal investment. Therefore, we need a 
model that will explain the proportion of 
some aggregate (counties, in our case) that 
will be converted; this leads to the problem 
of the appropriate aggregation of individual 
optimal decisions to the county level. 

If the land in a county could be assumed 
to be homogeneous, then the required aggre- 
gation could be accomplished simply by 
modeling a "representative parcel" and as- 
suming that variations within counties were 
purely random. Such an assumption, how- 
ever, is untenable. Some land in a county is 
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of higher (potential) agricultural quality and 
will therefore be developed first. If we ignore 
this reality and estimate a model on a panel 
of counties over time, any variable whose 
value is initially high, and then falls, will 
appear to cause conversion. More generally, 
if we are to predict, based on the past, what 
would happen if we change policy in the 
future, it is necessary to take into account 
the fact that the marginal acre today is dif- 
ferent from the marginal acre of the past. 
We solve the problem by positing a para- 
metric distribution for the unobserved qual- 
ity of land within a county. The parameters 
of this distribution are then estimated jointly 


with thé parameters of the individual-level, ' 


structural model. This allows us to quantify 
the effects of federal programs and policies, 
via dynamic simulations with the estimated 
distributional and structural parameters.! 

The first section of the paper describes the 
problem of wetland depletion. The second 
section develops the theoretical framework, 
including the modeling of unobserved het- 
erogeneity of land parcels. Section III pre- 
sents econometric results, used in Section IV 
to carry out dynamic simulations. Conclud- 
ing comments are found in Section V. 


I. Forested Wetlands and Public Policy 


Forested wetlands are among the world's 
most productive ecosystems, providing im- 
proved water quality, erosion control, 
floodwater storage, timber, fish and wildlife 
habitat, and recreational opportunities. Their 
continuing depletion is a serious land-use 
problem; preservation and protection of wet- 
ands have been major federal environmental 
policy goals for at least twenty years. The 


'The problem of estimating a transition process for 
individuals on the basis of aggregate data arises in 
many areas. In natural resource economics, in particu- 
lar, it has been identified as a major gap between the 
theoretical literature on optimal extraction and the em- 
pirical literature on resource depletion (Douglas Bohi 
and Michael Toman, 1984). Also related: is work in 
demography (for example, Richard Gill, 1986) and tech- 
nological diffusion (Zvi Griliches, 1957; Paul David, 
1966; and Richard Pomfret, 1976). 
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largest remaining wetland habitat in the con- 
tinental United States is the bottomland 
hardwood forest of the Lower Mississippi 
Alluvial Plain. Originally covering 26 million 
acres in seven states, this resource was re- 
duced to about 12 million acres by 1937. 
Since then, another 6.5 million acres have 
been cleared, primarily for conversion to 
cropland. 

The owner of a wetland parcel faces an 
economic decision involving revenues from 


- the parcel in its natural state (primarily from 


timber), costs of conversion (the cost of 
clearing the land minus the resulting forestry 
windfall), and expected revenues from agri- 
culture. Agricultural revenues depend on 
prices, yields, and, significantly, the drainage 
and flooding frequency of the land. Needless 
to say, landowners typically do not consider 
the positive environmental externalities gen- 
erated by wetlands; thus conversion may 
occur more often than is socially optimal. 

These externalities are the motivation for 
federal policy aimed at protecting wetlands, 
as embodied in the Clean Water Act. 
Nonetheless, the federal government has en- 
gaged in major public investment activities, 
in the form of U.S. Army Corps of Engi- 
neers (Corps) and U.S. Soil Conservation 
Service (SCS) flood-control and drainage 
projects, which appear to make agriculture 
more attractive and thereby encourage wet- 
land depletion. The significance of this effect 
is disputed by the agencies which construct 
and maintain these projects; they attribute 
the extensive conversion exclusively to rising 
agricultural prices (U.S. Army Corps of En- 
gineers, 1981). Our approach allows us to 
sort out the effects of federal projects and 
other changing economic forces. As we will 
see, these public investments appear to have 
been a substantial factor causing conversion 
of wetlands to agriculture.” 


Theoretical models of wetland conversion are found 
in John Brown (1972) and Leonard Shabman (1980). 
Our empirical results should be compared with Randall 
Kramer and Leonard Shabman 1986, and U.S. Depart- 
ment of the Interior 1988. 
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I. The Privately Optimal Land-Use Decision 
and Its Implications for the Behavior 
of Heterogeneous Aggregates 


The first step is the construction of a 
dynamic optimization model of forestry and 
agricultural production at the individual, 
landowner level. The solution of this model 
yields necessary conditions for conversion of 
individual forest parcels to agricultural pro- 
duction and for abandonment of parcels of 
cropland. An explicit model of the hetero- 
geneity of land allows for the aggregation of 
the respective necessary conditions. to the 
county level, so that an econometrically es- 
timable model can be specified. 


A. A Dynamic Optimization Model of 
Forestry and Agricultural Production 


Landowners observe a variety of eco- 
nomic, hydrologic, and climatic factors rele- 
vant to decisions regarding the use of their 
lands for forestry or agricultural production. 
Current and past values of variables presum- 
ably constitute the basis for expectations 
about future values of variables. In particu- 
lar, landowners observe agricultural prices 
and production costs, typical agricultural 
yields for their area, typical timber returns, 
and the suitability of individual land parcels 
for agriculture. A prime factor determining 
such suitability of land (in the geographic 
area of this study) is its wetness, that is, the 
degree of (natural and artificial) protection 
from flooding and poor drainage. 

A landowner continually faces a decision 
of whether to keep land in its current state, 
to convert forested land to agricultural pro- 
duction,’ or to abandon cropland and allow 


>The potential sale of a parcel is economically irrele- 
vant, since a new owner faces the same conversion/ 
abandonment decision. The option of residential or 
commercial development is empirically insignificant. Fi- 
nally, since land prices presumably reflect the present 
value of net revenues under optimal use, land-price 
data, if available, would provide an‘alternative means of 
examining the phenomena modeled here. See Richard 
Arnott and Frank Lewis (1979). 
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it to return to forest. A risk-neutral* land- 
owner faced with the decision of how to 
utilize his land, given the alternatives of 
forestry and agriculture, may be expected to 
seek to maximize the present discounted 
value of the stream of expected future re- 
turns to his land:? 


(1) (noni ax f f [Antin 4G] - PM 
7 Ci Bij + fuSi 
Wit8ije T Dji;] e 't' dt 
subject to: 
(2) S, Pipe B 
(3) 0x gi, S Es 
(4) 0x > Vije S Ujj, 


where i indexes counties, j indexes individ- 
ual land parcels, and ¢ indexes time; upper- 
case letters represent stocks or present val- 
ues; and lowercase letters represent flows. 
The variables are 


A 7 discounted present value of the infinite 
stream of typical expected agricultural 
revenues per acre in the county; 


^Evidence on the risk aversion of farmers is not 
consistent (Douglas Young 1979; Bruce Gardner and 
Jean-Paul Chavas 1979; Rulon Pope 1981). Provision 
for risk-averse behavior in the objective functional would 
lead to the inclusion in derived necessary conditions of 
second-order moments of stochastic variables. Due to 
lack of sufficient data, however, only expected values 
are used in the empirical analysis: we assume risk 
neutrality and independence of relevant factors. Be- 
cause flood protection projects may reduce the variance 
of returns (in addition to increasing average returns), 
the assumption of risk neutrality may lead to underesti- 
mation of the impacts of federal projects. 

5Note that the term in the objective function which 
represents the (discounted present value of) expect- 
ed future net revenue from agricultural production. 
Aid; jp — AC, is the price of farmland in a competitive 
market. f 

Though a discrete-time formulation would be more 
realistic, the continuous-time approach is simpler and 
easier to interpret. The econometric specification im- 
plied by the discrete-time formulation is identical. 
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=parcel-specific index of feasibility of 
agricultural production, including ef- 
fects of soil quality and soil moisture: 
g=acres of land converted from forested 
to agricultural use; 
v=acres of cropland: abandoned (gradu- 
ally returned to forested condition); 
AC=expected costs of agricultural produc- 
tion per acre, expressed as the dis- 
counted present value of an infinite 
future stream; 

C= average cost of conversion per acre (in- 

dexed by weather conditions); z 
f-expected annual net income from 
forestry per acre; 

S= stock (acres) of forests; 

r -- real interest rate; 

W= windfall of net revenue per acre from a 
one-time clearcut of forest (in the pro- 
cess of conversion); 

D=expected present discounted value of 
loss of income due to the gradual re- 
growth of forest (harvesting does not 

' occur until the year t+ R, where R is 
the exogenously - determined rotation 
length);* 

g= maximum feasible rate of conversion, 
defined such that 


(5) [eel dr= Sin | 


"Precipitation and consequent soil moisture are later 
allowed to influence conversion costs; the conversion 
cost term in equation (1) is then replaced by C; 
exp( a; PHDI,,}+g,,, where a, is an estimated parame- 
ter and PHDI ,, is the Palmer Hydrological Drought 
Index. 

- "The inclusion of this term allows for a category of 
land which is neither productive farm nor mature forest, 
‘but evolving "bush." The expected present discounted 
value of loss of income due to gradual regrowth of 
forest, D,, is: . 


t+R pa EN —R 
D= f °F hye 06070) dr= Rp {12e}, 


where F, is the present discounted value of an infinite 
future stream of net forest income, i.e. .F,-r=f,. If 
R=0, D, = 0 (if regrowth is instantaneous, there is no 
loss of revenue due to harvest delay); and if R — oo, 
Di, = F, (if the regrowth period is infinitely long, there 
is a complete loss of all forest revenue.) 
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for arbitrarily small interval, A, over 
which Bije is constant; 

D= = maximum feasible rate of abandon- 
ment, defined such that 


A 
(6) L [5] dr — Tin — Sije 


= AG 


ijt 


for arbitrarilv small interval, A, over 
which v,;, is constant. 
AG=stock (acres) of agricultural land; and 
T=total acreage of parcel in the flood 
plain available for conversion.? 


Note that only the control variables, the 
state variables, and the quality index, 4, are 
specific to the individual land parcel. All of 
the revenue and cost variables are measured 
at the county level. This is a consequence of 
the data, and is indicative of the information 
available to landowners. Aggregation of 
first-order conditions for individual land- 
owners to the county level will yield relation- 
ships among county-level variables and the 
distribution of q. The parameters of these 
relationships and of the underlying distribu- 
tion can then be estimated econometrically. 

The solution to the optimization problem 
is provided in a longer version of this paper 
(Stavins and Jaffe, 1988). To’ characterize 
that solution, the following notation is con- 
venient: ` 


(7) Xi jt =Á 


(8) Y, 


ijt 


it’ dig ^ ACi — Cia 7 FN,, 


= Fi, E Ait’ dij + AC, 


where FN, is net forestry revenue, Fp — W,,; 


uo 


and F, is delayed net forest revenue, 


it Dy. 


?Some land in the thirty-six counties was withdrawn 
from availability to the private. market during the study 
period as a result of designation of protected status by 
Federal and state authorities, including the U.S. Fish 
and Wildlife Service, the U.S. Forest Service, and state 
fish and game agencies. 


` when abandonment occurs, vf, = A 
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'The solution to the maximization problem 
implies that conversion or abandonment will 
occur under the following conditions: 


(9) Conversion occurs if X,,, > 0 


and parcel is forested, 


(10) Abandonment occurs if Y, , 


>0 


and parcel is cropland. 


Letting the time interval A in equations 
(5) and (6) be equal to unity (a single time 


period), the continuous-time model yields 


the following result for a discrete-time situa- 
tion: when conversion occurs, g*, ='S;,,; and 
ijt For 
each homogeneous parcel j, it is always 
optimal either to convert the entire parcel 
from forested condition to agricultural use 
(only, of course, if it is in a forested state), to 
abandon the entire parcel (only if it is agri- 
cultural cropland), or to do nothing. 


B.. Modeling the Unobserved 
Heterogeneity of Land 


Equations (9) and (10) imply that all land 
in a county of a given quality will be in the 
same use in the steady state. We do not, 
however, observe counties as all forest or all 
farmland. Partly, this may reflect deviations 
from the steady state (on which more below), 
but to a great extent it reflects heterogeneity 
of land within a county. We can characterize 
this heterogeneity in terms of a probability 
density function /;( 4,;,), as pictured in Fig- 
ure 1. 

The function pictured has a point mass of 
probability at q — 0, corresponding to land 
of hydrologic condition which renders agri- 
culture completely infeasible. The rest of the 
land in a county could be farmed, with agri- 
cultural yields being an increasing function 
of q, since q reflects primarily variations in 
frequency of flooding, drainage, and soil 
conditions. Thus it depends on the natural 
lay of the land, the type of soil, and impor- 
tantly, the existence of man-made flood- 


' control and drainage projects. 
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FIGURE 1. THE EFFECT OF FEDERAL FLOOD 
CONTROL AND DRAINAGE PROJECTS ON THE 
DISTRIBUTION OF UNOBSERVED LAND QUALITY 


The two panels of Figure 1 demonstrate 
the effects of public projects. Conceptually, 
there are three such effects, but two are 
indistinguishable empirically. First, flood- 
control and drainage projects may render 
feasible for agriculture land that was previ- 
ously infeasible; this is reflected as a transfer 
of probability mass from the point at zero to 
the rest of the distribution. Second, by im- 
proving drainage or reducing flooding, pro- 
jects may shift the quality distribution for 
previously feasible land to the right. 

Finally, projects may also change the dis- 
tribution of q for agriculturally feasible land 
because the land which was infeasible that is 
rendered feasible may have an underlying or 
potential quality distribution that is different 
from the distribution for already feasible 
land. This effect could go in either direction. 
On the one hand, it could be that infeasible 
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land, because it is low-lying and poorly 
drained, is likely to remain low quality even 
if given enough flood protection to be tech- 
nically feasible. Alternatively, it could be 
that the soil on such land is of high quality 
and that, with flood protection, it is very 
good agricultural land. In the former hypo- 
thetical, projects would shift the quality dis- 
tribution for feasible land to the left; in the 
latter to the right. Note that if any of the 
infeasible land is potentially of high quality, 
and if this is known in advance, then the 
government could increase the tendency for 
projects to be quality-improving by choosing 
preferentially to give flood protection to 
(potentially) high quality land. Thus, both 
the second and third effects are likely to 
result in an improvement in observed quality 
of feasible land, but these effects cannot be 
distinguished in the data. 
We parameterize this model of the quality 
distribution as follows:!? 


(11) log( aij) ~ N(u, o?) 
with probability d; 

4i je = 0 with probability (1—d,,) 
where d, is the probability that agricultural 
production i is feasible." The first effect iden- 
tified above is incorporated by allowing d,, 
to be a function of the extent of federal 
projects: 

1 


1 
em) 


1+ 
(13)  c(z) - DRY,* B: PROJ, 


(12) d;,— 





19 We focus on the lognormal distribution because we 
believe that its general shape is appropriate for a distri- 
bution over quality of land. We experimented with 
other functional forms, and we discuss some of these 
results below. 

In the current application, individual county means 
and variances are not estimated. Note, however, that 
separate i, and o; parameters are identified and could 
therefore, in principle, be estimated, given sufficient 
data. 
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where DRY, is a measure of the percentage 
of county i which is naturally protected from 
periodic flooding, PROJ, is an index of the 
share of county i at time ¢ which has been 
artificially protected from flooding by Corps 
and SCS projects, and f4 is a parameter 
which indicates the impact of artificial flood 
protection relative to the impact of natural 
flood protection.” 

The two effects of projects on the quality 
distribution for feasible land are incorpo- 
rated by allowing the parameters of the log- 
normal distribution to depend on the project 
index as well: 


(14) log( qj.) ~ N [n1 &PROJ), 
[o- (1 + &PROJ,)]'] with prob. d; 
qi jı = 0 with probability (1— d;,) 


The effects of federal projects on land-use 
decisions are thus captured through the em- 
ployment of three project-impact parameters 


: — f, Bo, and Bs. 


C. Aggregation of Necessary Conditions 
for Forested Wetland Conversion 


Having posited the basic nature of the 
heterogeneity of land, the distributional 
model can now be used to aggregate the 
individual-landowner necessary conditions 
previously developed.? Equation (9), above, 
indicates that there is an incentive to convert 
forested wetlands to agricultural cropland if 


X; j, > 0. Hence, there is a threshold value of 


Gi jt» denoted qz, above which the incentive 


12 The logistic specification is used to constrain d; to - 
values between zero and unity, because the empirical 
measures of Corps and SCS project impact areas and 
natural flood protection are only indexes of protection. 

“One alternative research strategy would be to col- 
lect data on individual land parcels and estimate a 
model such as that developed by Rust (1987). In the 
present context, however, it would be prohibitively ex- 
pensive to develop this data over an area large enough 
to identify the effects of interest. 
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for conversion manifests itself: 


Ca + FN, + AC; 
(15) gi- | it it | 


A; 


it 


If conversion cost is allowed to be hetero- - 


geneous across land parcels (within counties) 
and flood-control projects are believed to 
affect conversion costs as well as agricultural 
feasibility (yields), then the conversion cost 
term in equation (1) is replaced by the term 
0,'Q;4 C8, Where a, is a parameter which 
captures the relative effect of heterogeneity 
on conversion costs, compared with the ef- 
fect on agricultural yields. Next, allowing for 
the parametric effect of weather on conver- 
sion costs, q;; becomes 


(16) a 


l FN, t AC, g 
Air m a,C,,exp( a,PHDI,,} : 


In either case, there is an incentive to 
convert parcel j (in county i at time f) from 
a forested condition to.agricultural cropland 
if qip > qù. Therefore, the privately optimal 
(the desired or target) stock of converted 
land, expressed as a fraction of all land 
available for conversion, is 


o» [SE-- RT] 


-4 [LAC a 


where 7,{-} is the lognormal density func- 


tion. Therefore, 


* 


a [| -«-n- nte. 


where F[-] is the lognormal cumulative dis- 
tribution function, and 


* 


- a» [7] -«. 


-[1-F[[log[ az] - »]17o]]. 
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- where F[-] is the cumulative, standard nor- 


mal distribution function. 
There is an analogous equation for aban- 
donment, which gives the target stock of 


- forested land as a fraction of the total 


* 


Q9) || = 4, IFlogfar] - n1/e]] 
+ [l= d], 


where q} is the threshold value of 4;;, below 
which the incentive for abandonment man- 
ifests itself (see equation (8), above, and 
Table 1). 

These relationships are shown graphically 
in Figure 1. The area to the right of q* 
should be converted to farmland (if forested); 
the area to the left of q? should be aban- 
doned (if farmed). The area between q? and 
q* is worth farming if previously converted, 
but not worth converting if currently for- 
ested. Changes in the economic climate af- 
fect the position of the thresholds. For ex- 
ample, an increase in expected agricultural 
revenue will shift both the conversion 
threshold, q*, and the abandonment thresh- 
old, q", to the left, thereby encouraging con- 
version of forested land to agricultural use. 


m. Econometric Analysis 
A. Specification Issues 


Two specification issues must be ad- 
dressed before the model embodied in equa- 
tions (19) and (20) can be estimated. These 
are: (1) the possibility that adjustment to- 
ward optimal land use is not instantaneous; 
and (2) combining the conversion and aban- 
donment models into one estimating equa- 
tion. 

All of the analysis to this point has as- 
sumed that conversion to optimal land use 
(conditional on current prices) occurs instan- 
taneously. There are several reasons why this 
may not be true.” Although we estimate the 


l^These include: forest age distribution, liquidity 
constraints, uncertainty about the permanence of price 
movements, and decision-making inertia. 


D RETO 
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TABLE 1 —ECONOMETRIC MODEL OF FORESTED WETLAND CONVERSION 
AND AGRICULTURAL CROPLAND ABANDONMENT 


FORCH,, = FORCHE- Dé + FORCH; 


a Data, + dip 


_ FORCH§(-1) = x [ du- [1-F[ [log[ gf] — (L+ &; PROJ,)] 


/o(l+ Pho] -1] 
FORCHE = v4, [F[[los[az] -u (1+ B PRO )] — . 


/0 Q+ BPRON)]]+ 11- 43- [5]... 


T 
1 
e" 


FN, + AC 


a-| 


“frictionless” model implied by equations 
(19) and (20), we also want to consider the 
possibility of partial adjustment in each ob- 
servation period toward the optimal land use 
pattern. In the case of conversion, we have’ 


e» EPEL 


=y Riiel 


where y, is the rate of partial adjustment, 
[AG/T]* is given by equation (19), and ej 
is an error term composed of a county- 
specific (time-invariant) component, À;, and 
a component, $5, which has mean zero, so 
that ef,=A,+ 45. In abandonment situa- 
tions, we have 


e [3-3]... 


«dL - 1 


where v, is the rate of partial adjustment for 
abandonment, [S/T]* is given by equation 


+ E 





| , where a(z) = DRY, + By PROJ, 


; Fi, * AC, 
ea ERE tore gl =| ————. |. 
Aj, — aC, exp {a PHDI,) i An 


(20), and ef is an error term composed of a 
county-specific component, A,, and a com- 
ponent, $ip which has mean zero. Since 
county-level stocks are aggregates of individ- 
ual decisions, these adjustment parameters 
represent the probability that a landowner 

not in equilibrium in a given time period will 
switch to the optimal hag use within the 
initial period. 

To combine tition: Q1) and (22) into 
one relationship, we define the net change in 
the forested fraction of the county between 


I51t might seem that a superior approach would be to 


_ incorporate adjustment costs or lags into the original 


optimization problem, but this cannot be done in a way 
which yields necessary conditions which can be aggre- 
gated across heterogeneous parcels to the county level. 
Any such mechanism must depend on deviations of 
individual parcels from optimality. Estimating a model 
with adjustment costs requires observing the relation- 
ship between the magnitude of deviations from equilib- 
rium and the rate of movement. Since we do not ob- 


‘serve individual parcels, this cannot be done, so any 


adjustment mechanism built into the individual model 
could not be estimated with county data. One could 
specify a version of equation (1) with adjustment costs 
at the county level, but that would be equivalent to a 
representative-firm assumption. Thus, a fully dynamic 
optimal model can only be implemented with individual 
data. 


VOL. 80 NO. 3 


periods t-1 and ż as 
AG AG 
e» [Fl] 
T ja | T d 


[S]. [S] =(-1-ronce 
T i,Qt—1 T it . i 
Under the assumptions of the model, con- 
version and abandonment will never occur 
simultaneously in the same county, so we 
can write 


(24) FORCH, 


ae [T É 
NERA REM. 


* 
eon], 
Tod Tres 


where Dé and D? are dummy variables!$ for 
the conversion and abandonment regimes; 
[AG/T]* and [S/T * are the corresponding 
target stocks; and e, is a composite error 
term, defined by ej, — ef, + ef =A, + $$, + 4$, 
=A, +o, 

The county-specific components of the er- 
ror term, A,, are treated as fixed-effect pa- 
rameters and the ¢,, are assumed to be inde- 
pendently distributed across i and t, but not 
necessarily homoscedastic.!" Thus, equation 
(24) is a single-equation, fixed-effects model, 
the parameters of which can be estimated by 
nonlinear least squares, with county dummy 
variables employed to eliminate any bias due 
to the county fixed effect (Table 1). 





léThe dummies are endogenous variables. We first 
estimate separately the conversion and abandonment 
equations, and thus predict values for conversion and 
abandonment. The two equations are then combined as 
in equation (24) with dummies constructed on the basis 
of the predicted conversion or abandonment. 
Heteroscedasticity-consistent standard errors were 
calculated according to Halbert White (1980), and are 
reported in the table of econometric results. The possi- 
bility of serial correlation in ¢,, was explored. Neither J. 
Durbin’s (1970) test nor that suggested by T. S. Breusch 
(1978) and L. G. Godfrey (1978) indicated significant 
serial correlation. 
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B. Parameter Estimation of 
Alternative Specifications 


Using data for 36 counties in Arkansas, 
Louisiana, and Mississippi, during the pe- 
riod - 1935-1984,!8 the parameters of the 
model embodied in equations (24), (Table 1) 
were estimated econometrically. Panel data 
were incorporated into the estimation pro- 
cess by stacking the data for 36 counties for 
each of ten (five-year!?) time periods, for a 
total of 360 observations. 

The results of six versions of the model 
are presented in Table 2. The overall results 
lend support to the basic validity of the 
model. Estimated parameters are all of the 
expected sign, and nearly all estimates are 
significant at the 90 percent, 95 percent, or 
99 percent level. Also, both parameter and 
standard error estimates are quite robust with 
respect to modifications of the specifica- 
tion. Thus, the basic structural model of 
changes in forested acreage being a function 
of expectations regarding relative economic 
returns from agriculture and forestry is 
strongly supported.” In addition, the 
fixed-effects approach is clearly superior to a 
totally pooled model, as indicated by the 
appropriate likelihood ratio tests.?! 

Column 1 of Table 2 restricts flood-con- 
trol and drainage projects to affecting only 
agricultural feasibility, while columns 2 and 
3 also allow for effects on quality. The esti- 
mated partial adjustment coefficients on con- 
version, y,, and abandonment, y,, indicate 
that about 60 percent of the targeted de- 


18The nature and sources of data employed are briefly 
described in the Appendix, which also provides basic 
statistics for all variables. 

Limitations on the availability of data on the de- 
pendent variable (forested acreage) necessitated the use 
of a quinquennial as opposed to an annual model. 

The possibility exists of including in the model a 
measure of individuals’ expectations regarding future 
construction of flood-control and drainage projects, 
proxied by project authorizations. Given relatively con- 
stant real conversion costs, however, no incentive exists 
for landowners to convert their parcels prior to project 
construction (and consequent flood protection). 

For example, in the L1 specification, the likelihood 
ratio (LR) statistic is 69.9; the appropriate x? critical 
value is 58.6 at the 99 percent confidence level. 
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TABLE 2—-ECONOMETRIC ESTIMATION RESULTS— LOGNORMAL, NORMAL, 
AND UNIFORM DISTRIBUTIONS OF LAND QUALITY 


; Alternative Specifications? : 
Parameter LI L2 L3 L4 N1 U1 


Yq Abandonment 0.37618 0.32360 036717 - 041883 0.18288 
Partial Adjustment (0.190)? (0.177) — (0.184) (0.190) — (0.075) 
x, Conversion 0.44875 0.69352 0.64826 - 0.62814 0.29872 


Partial Adjustment — (0.142) (0.156) (0.154) (0.150) (0.114) 

p MeanofUnobserved 0.74095 0.83464 1.11650 1.41950 2.26650 . 
Quality Distribution — (0.368) (0.290) (0.364) (0354) (0419) . 

c Standard Deviation 0.38182 0.44438 0.43848 0.56324 043538  .- 
of Quality Distrib. (0.087) (0.069) (0.067) (0.021) (0.067) 


€ Upper Limit of ; 4.64270 
Quality Distribution : (2.173) 

6 Range of Unobserved - - - - - 1.34980 
Quality Distribution i (0.855) 

Bı Project Impact on 9.20170 8.83060 8.93700 8.37430 8.69140 8.94940 
Agric. Feasibility (3.216). (2.309) (2.465) (1.768) (2.394) (3.705) 

P Project Impact on - 1.07240 0.77193 0.36821 0.24691 - 
Heterogeneity Mean (1.467) (0.774) (0.449) (0.317) 

Ps Project Impact on - 0.53757 0.42799 0.36451 0.39361 - 
Heterogeneity S.D. (0.229) (0.183) (0.133) (0.176) 

o, Relative Conversion 1.58160 1.02070 - - - - 
Cost Impact ` (0.923) (1169) 

a, Weather Impact on . - - 1.59720 1.59410 1.41720 1.58600 
Conversion Cost (0.304) (0.296) (0.193) (0.302) 
Goodness-of-Fit* 0.6719 0.6743 0.6747 0.6681 0.6738 0.6742 


Log Likelihood Value 786.07 790.62 791.70 78823 791.57 787.89 
Degrees of Freedom 318 316 316 318 . 316 318 


*All versions also contain 36 county dummies. L1, L2, L3, and L4 (frictionless 
model) employ lognormal distributions of land quality; N1 emplovs a normal distribu- 
tion; and U1 employs a uniform distribution. ot er 

>Robust standard error estimates appear below parameter estimates. 

°This dynamic goodness-of-fit statistic is equal to one minus Theil’s U-statistic, 
based on comparing predicted and. actual net rates of conversion. 


crease in forested acreage (increase in agri- 
cultural cropland) and about 36 percent of 
the targeted abandonment occur in the ini- 
tial five-year period. 

The distribution of heterogeneity is non- 
degenerate: the standard deviation, o, of the 
lognormal distribution of.agricultural quality 
is significant quantitatively and statistically. 
Likewise, two of the three parameters cap- 
turing the impact of federal projects on con- 
version and abandonment are significant: di- 
rect. impact on agricultural feasibility, B; 
and impact on the standard deviation of 
agricultural quality, 8,. The impact on the 
mean of agricultural quality, B,, is positive 


but not significant.” The average direct im- 
pact of artificial flood protection on agricul- 
tural feasibility, Bi, is about nine times that 
of “natural flood protection.” 


Note that the model is, parameterized such that 
projects affect the mean and variance of the log of q. 
The expectation of q itself is 


_exp{ p(1+ B PROJ) «05e? (1 - 8), 


so both £, and f, indicate increasing agricultural qual- 
ity due to projects. 


VOL. 80 NO. 3 


It was not possible, due to limitations of 


the data, to estimate the equation with both 
a parameter for the effect of conversion costs 
relative to other benefits and costs, a,, and à 
parameter for the effect of weather on con- 


version costs, .a,, although the parameters of: 
such a specification are theoretically identi- . 


fied. The specification with a, (L2) appears 
inferior to the one with a, (L3), and the 
estimate of o, is not significantly different 
from 1.0. The impact of weather on conver- 
sion costs, a», is very significant. 

Columns L4, N1, and U1 explore the 
sensitivity of the results to dynamic and 
distributional assumptions. L4 assumes in- 
stantaneous rather than partial adjustment 
to the optimal state. This constraint is re- 
jected by the data,” but most parameters 
remain qualitatively similar. The dynamic 


‘goodness-of-fit, calculated as suggested by . 
Henri Theil (1961), shows only slight decline 


‘compared to the: partial adjustment case 
(L3).% Assuming a normal (N1) or uniform 
(U1) distribution of unobserved land quality 
also yields results that are qualitatively simi- 
lar, with slight decrease of dynamic good- 
ness-of-fit. 

Although the — in Table 2 exhibit the 
significance of prices, costs, and government 

.. projects in conversion and abandonment de- 
cisions, these results say little about the rela- 
tive importance of these. influences. Due to 
the nonlinear, dynamic form of the model, 


Individually, the Wald statistics for constraining 
the model to instantaneous adjustment are 5.2 for con- 
version and 11.8 for abandonment, compared with a 99 
percent critical (x?) value of 6.6; the joint test yields a 
statistic of 12.1; compared with a 99 percent critical 
value of 9.2. ° 

A frequently used measure : of dynamic performance 
is the root-mean-squared (RMS) error, but this measure 
suffers from the limitation that its magnitude is not 
standardized. An alternative is Theil’s inequality coef- 
ficient (Theil, 1961). "The numerator of this statistic is 
the RMS error, and the scaling of its denominator 
ensures that values fall within the bounds of zero and 
unity, where zero indicates a perfect dynamic fit. In 
keeping with the ordering of most goodness-of-fit mea- 
sures, the final comparative statistic shown in Table 2 is 
equal to one minus the Theil coefficient, so that a 
perfect fit is evidenced by a value of 1.0. 
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the importance of the various factors can be 


. “discerned. only through a series of dynamic 


simulations. 
IV. Dynamic Simulation Results 


To -provide a benchmark for comparison, 
the extent of conversion or abandonment is 
simulated using the econometrically esti- 
mated parameters and the actual, historical 
values of all variables (“factual simulation”). 
Then, in a series of “counterfactual simula- 
tions," the extent of conversion or abandon- 
ment is simulated using various assumed 
counterfactual values for certain exogenous 
variables, while maintaining all other vari- 
ables at their actual levels. Finally, the simu- 
lated changes in forested wetland acreage in 


` each counterfactual simulation are compared 
.to the factual simulation changes. Any dif- . 


ference represents an estimate of the land-use 
change that can be attributed to a change in 
an exogenous variable from the counterfac- 
tual value of the variable.to its actual histori- 


. cal pattern of values. 


The simulation results utilizing model L3 
are summarized in Table 3. Column 1 shows 
the total (net). conversion of forested wet- 
lands to farmland through 1984. In the fac- 
tual simulation, this is 3.68 million acres. 
(The true historical conversion was 3.64 mil- 
lion.) Because of the partial adjustment 
mechanism, conversion will continue into the 
future, even if all exogenous variables re- 


main unchanged. To capture this effect, col- 


umn 2 extends the simulations through 1999, 
keeping all (factual) variables at their 1984 
values. Thus, we predict that a total of 3.83 
million would be converted by the end of the 
century if 1984 conditions were to prevail. 
The target stock based on 1984 values corre- 
sponds to net conversion of 3.84 million 
acres, so the 1999 simulations come very 
close to the steady state. 

The second row in the table shows that 
simulated wetland depletion through 1999 if 
no federal projects had been built is about 
32 percent less than factually simulated de- 
pletion. For comparison, the third row shows 
simulated depletion if flood protection pro- 


vided by natural topography and the Missis- 
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TABLE 3——SIMULATED CHANGE IN STOCK OF FORESTED WETLANDS UNDER ALTERNATIVE SCENARIOS 
LOGNORMAL DISTRIBUTION. OF LAND QUALITY 


Change* in Forested 





65-Year Period 








Wetland Acreage ` 1935-1999 
50-Year 65-Year Percentage of Share of Depletion 
Period Period Factually Simulated Due to This 
Simulation 1935-84 1935-99 Depletion "Factor? in percent 
(1) (2) 8) , (4) 
(1) Factual —3.677 — 3.834 . (100) (0.0) 
(2) No Federal Flood-Control 
" or Drainage Projects — 2.527 — 2.612 68.1 : 31.9 
(3) No Flood Protection ] 
from Natural Topography ` ; 
and Mainline Levees —2.831 — 2.984 78.8 ‘ 21.2 
(4) Conversion Costs NC 
Set to Zero —4.354 —4.526 1-8.0 ~18.0 
(5) Net Forestry Revenue 
Set.to Zero —4.259 -4419 > 115.3 —15.3 
(6) Agricultural Prices 
.Kept at 1934 Levels —3.641 — 3.818 996 | 0.4 


* Millions of acres, based upon parameter estimates from specification L3, reported in column 3 of Table 2. 
Difference between counterfactual simulation and factual simulation, divided by factual simulation. 


sippi miainline levee system? were elimi- 
nated. This has less effect than elimination 
of federal projects, reducing depletion by 21 
percent. Rows 4 and 5 show that conversion 
costs and forestry revenues were significant 
forces restraining conversion. Simulated con- 
version is 18 percent more if conversion is 
costless, and 15 percent more if forestiy 
yielded no net revenue. 

Finally, the last row of Table 3 TAPA 
the hypothesis (maintained by the Corps of 


25« Effect of natural topography” refers to DRY; ( —1 
— FLRISK,), a measure of natural flood-proneness of 
counties (prior to construction of flood-control and 
drainage projects). The “mainline levee system" (MLS) 
along the Mississippi River, which was in place virtually 
from the beginning of the study period, may also have 
had an impact on wetland depletion. Data on the area 
protected by the MLS are not available. The protected 
'area is approximately proportional to the area affected 
by the great flood of 1927, but flood data are highly 
correlated with natural topography, and so multi- 
collinearity prohibits estimation of the effects of natural 
topography and the MLS in the same equation. With 
the omission of the MLS proxy variable, the FLRISK 
variable accounts for both phenomena. Thus, the re- 
ported impact of Federal projects on conversion refers 
exclusively to interior levee development and underesti- 
mates the impact of all projects. 


Engineers and o:hers) that rising agricultural 
prices drove we:land depletion, by siraulat- 
ing conversion if farm prices had held at 
their 1934 levels (in real terms). Net deple- 
tion is about 1 percent less than when factu- 
ally simulated. Thus, there is no evidence 
that rising agricultural prices were a signifi- 
cant factor driving conversion. Even wiih the 
depressed farm prices of 1934, economic in- 
centives favored conversion of many acres. 
The simulations in Table. 3 were also car- 
red out with the frictionless, normal, and 
uniform models (L4, N1, and U1 in Table 
2). Although the changes attributed to spe- 
cific factors vary somewhat, the ranking of 
factors by importance is the same in all 
versions. Two differences, however, are worth 
noting. The frictionless model, by assump- 
tion, does not predict continued conversion 
after 1984 based on 1984 conditions. In fact, 
it predicts slightly less net conversion by 
1984 (3.586 million acres). As noted above, 
the optimal stock corresponds to net canver- 
sion of 3.84 million acres, so the partial 
adjustment model run, out to 1999 would 
appear to be a better indicator of the ulti- 
mate conversion. At the other extreme; the 
uniform model predicts net conversion of 
4.065 million acres by 1999, more than twice 
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as much conversion between 1984 and 1999 
as is simulated using the preferred, lognor- 
mal model. Not surprisingly, the impact of 
distributional assumptions becomes greater 
as we move into the tails of the distributions. 


V. Conclusions and Policy Implications 


The statistical analysis leads to several 
conclusions. First, landowners responded 
to economic incentives in their land-use 
decisions. Second, construction of federal 
flood-control and drainage projects caused a 
higher rate of conversion of forested wet- 
lands to croplands than would have occurred 
in the absence of projects. Third, federal 
projects had this impact because they made 
agriculture feasible on land where it had 
previously been infeasible, and because, on 
average, they improved the quality of feasi- 
ble land. Fourth, adjustment. of land use to 
economic conditions was relatively gradual. 
On average, about 65 percent of forested 
acres which “should” have been converted 
to agriculture were converted in initial 
five-year periods; and about 38 percent of 
agricultural land which “should” have been 
abandoned were abandoned over initial 
five-year periods. 

Simulations with estimated parameters 
show the quantitative importance of these 
effects. If there had been no federal flood- 
control or drainage projects constructed in 
the 36-county area after 1934, approximately 
1.15 million fewer acres of forested wetlands 
would have been converted, 31 percent of 
total depletion. Long-term (steady state) de- 
pletion due to federal projects (constructed 
through the year 1984) is estimated to 
amount to more than 1.23 million acres, 
about 32 percent of estimated long-term de- 
pletion. Since the total acreage protected by 
projects was about 5.3 million acres, the 
results imply an average “propensity to con- 
vert” protected acres of about 0.22. 

Of the factors considered in the economet- 
ric model, flood protection and drainage 
provision afforded by federal projects had 
the largest impact on net changes in forested 
acreage. The joint effect of natural topogra- 
phy and the mainline levee system was of 
secondary importance; and net forestry rev- 
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enues and conversion costs exerted substan- 
tial restraints on wetland clearing. 

In terms of public policy, the evidence 
highlights a striking inconsistency in the fed- 
eral government’s approach to wetlands. In 
articulated policies, laws, and regulations, 
the government recognizes the large positive 
externalities associated with wetlands; the 


.Bush Administration has endorsed a “zero 


net-loss of wetlands" policy. But public in- 
vestments in wetlands (in the form of 
flood-control and drainage projects) create 
major incentives to convert these areas to 
alternative uses. Clearly, the overall justifi- 
cation for these federal programs ought to 
be reexamined, and stringent tests of public 
need should be applied to both public and 
private actions which have direct or indirect 
effects on wetland resources. 

The conclusion that major public infra- 
structure investments affect private land-use 
decisions (often thereby generating negative 
externalities) may not be a surprise to many 
readers of this journal, but the analysis de- 
scribed here provides evidence which con- 
trasts sharply with the accepted wisdom 
among policymakers. It is hoped that the 
quantification of these effects will give these 
realities a more prominent role in policy 
debates. As wetlands, tropical rain forests, 
barrier islands, and other sensitive environ- 
mental areas become more scarce, their 
marginal social value rises. If induced land- 
use changes are not considered, we will en- 
gage in more and more public investment 
programs whose net social benefits are nega- 
tive. 


DATA APPENDIX 


The data used in this study were collected as part of 
a previous research effort, sponsored by the U.S. De- 
partment of the Interior and carried out on behalf of 
the Environmental Defense Fund. For complete cita- 
tions of sources of data and more information regarding 


the construction of requisite variable series, see Stavins 


1986 and 1988. Also, a more comprehensive data ap- 
pendix is available on request from the authors of the 
present paper. l 
Land-Use Patterns: The U.S. Forest Service periodi- 
cally measures land use at sample sites, using a sam- 
pling procedure based upon aerial photographs. Sample 
locations are classified on a forest/non-forest basis, and 
these 0-1 observations are then converted into esti- 
mates of acreage of county land which is forested, S;,. 
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TABLE A1 —SUMMARY STATISTICS FOR MAJOR VARIABLES? 


1939 and 1984 

1939 1984 
Variable? 
Name Minimum Mean Maximum Minimum Mean Maximum 
a 64.00 147.12 221.83 201.29 246.70 318.21 
ac 62.41 155.38 197.51 117.92 180.97 295.46 
ACCU 0.000 0.002 0.057 0.000 0.271 0.963 
C 4.77 4.77 4.77 22.76 22.76 22.76 
DRY 0.142 0.529 0.823 ` 0.142 0.529 0.823 
fn —0.03 0.59 2.03 3.56 11.81 27.38 
f 1.50 4.00 8.57 5.32 14.80 25.92 
FORCH —0.072  —0.020 0.080 —0.144  —0.023 0.016 
PHDI —2450  —1.054 — 0.020 —1.050 0.738 1.690 
S 44.7 165.8 367.8 11.5 73.2 277.2 
SCCU 0.000 0.000 0.000 0.000 0.100 0.864 
T 129.9 360.4 590.7 110.9 354.6 590.7 


,All monetary figures are real 1984 dollars. 


* Variables are defined as follows: 


a-- annual agricultural revenue (dollars per acra); 
ac= annual agricultural costs of production (dollars per acre); 
ACCU = share of county protected by Army Corps of Engineers projects; 


C cost of conversion (dollars per acre); 


DRY - share of county naturally protected from flooding; 
fn= annuity of net forestry revenue minus windfall from clearcut; 
f= annuity of delayed net forestry revenue (dollars per acre): 
FORCH - change in forestland as a share of total county over 5 years; 


PHDI = Palmer hydrological drought index; 
S stock of forestland (thousand acres); 


SCCU = share of county protected by USDA SCS projects; and 
T= total county area (available for conversion, thousand acres. 


Total county land available for conversion (not con- 
. served by government) is represented by 7;. The net 
change in forest land as a share of all available county 
land is FORCH,. 

Agricultural Revenue: The average (gross) real agri- 
cultural revenue per acre, a;,, is a weighted average for 
four crops (soybeans, cotton, rice, and corn), based 
upon agricultural prices, production levels, yields, and 
acreages. Data on crop acreages, production levels, and 
prices come from the U.S. Census of Agriculture plus 
State publications. A weighted average of production 
costs, ac,,, was developed from state documents; costs 
considered were case expenses, which include variable 
plus fixed expenses (general farm overhead, taxes and 
insurance, and interest), but not capital replacement nor 
allocated returns to owned inputs. 

Forestry Revenue: Annual forestry net revenue per 
acre, fn,,, consists of two components: the difference 
between the (annualized) revenue stream generated ‘by 
periodic harvesting of limber and the (annualized) one- 
time revenue received from a clearcut of the forest prior 
to conversion. Thus, real forestry net revenue per acre is 
a weighted average of, annual revenues from sawlogs 
and pulpwood minus the annuity of a windfall which is 
gained from a clearcut of timber if conversion is carried 
out. If farmland is abandoned and allowed to return to 
its forested state, there is a delay equal to the rotation 


length before harvests can commence. The annuity of 
delayed net forestry revenue is fy (see equation (8) in 
text). 

Cost of Conversion: The time-series, C,, is the average 
cost of conversion of wetlands to cropland. Because 
geographic variance in the cost of conversion is largely a 
function of soil maisture, a panel of conversion cost 
estimates were developed by allowing for the interaction 
of C, and PHDI,,, as described above. 

Artificial Flood Protection and Drainage Provision: 
Projects of the U.S. Army Corps of Engineers and the 
Soil Conservation Service were considered. In both cases, 
the primary measure of project impact was the “pro- 
tected acreage” of projects, hydrologically defined as 
the area which experiences some reduction in the extent 
and frequency of flooding as a result of project con- 
struction. The respective Corps and SCS variables are 
ACCU; and SCCU,, the sum of which is PROJ,- 

Natural Flood ana Drainage Conditions: A measure of 
the average natural probability of flooding of sample 
counties, FLRISK;, was developed from the National 
Resources Inventory, conducted by the Soil Conserva- 
tion Service. The relevant variable for the analysis is the 
quantity, DRY, =1— FLRISK;. 

Weather Conditions: The Palmer Hydrological 
Drought Index, estimated by the National Climatic 
Data Center, is related to precipitation, runoff, evapo- 
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transpiration, recharge, and soil water loss. Monthly 
drought index data were aggregated into " quinquennial 
averages by county, PHDI,,. 
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Utility Functions That Depend on Health Status: 
Estimates and Economic Implications 


By W. Kip VISCUSI AND WILLIAM N. EVANS* 


Taylor's series and logarithmic estimates of health state-dependent utility func- 
tions both imply that job injuries reduce one's utility and marginal utility of 
income, thus rejecting the monetary loss equivalent formulation. Injury valuations 
have unitary income elasticity, and the valuation of non-incremental risk changes 
and effects of base risks follow economic predictions. (JEL 851,026,913) 


In the basic von Neumann-Morgenstern 
framework, individual utility depends on a 
single attribute—one’s wealth. In some con- 
texts, the character of the lottery payoffs 
may be so sweeping that it transforms the 
utility function. Consider, for example, a 
utility function for a risk-averse individual 
so that utility increases with wealth, but at a 
diminishing rate. If one were to treat death 
as being tantamount to a drop in income, 
then one would obtain the unreasonable re- 
sult that death boosts the marginal utility of 
income. This implausible result highlights 
the fallacy of treating death and other severe 
health effects as monetary equivalents. 

Robert Eisner and Robert H. Strotz (1961) 
first noted this class of difficulties in their 
analysis of flight insurance. They suggested 
that a bequest function is a more appropri- 
ate formulation of the utility function after 
one's death. Modification of the standard 
utility theory approach to recognize the com- 
plications posed by other forms of state de- 
pendence has not posed any insurmountable 
difficulties, as the theory. of state-dependent 
utility is now well developed.! 


*George G. Allen Professor of Economics, Depart- 
ment of Economics, Duke University, Durham, NC, 
27706, and Assistant Professor of Economics, Depart- 
ment of’ Economics, University of Maryland, College 
Park, MD, 20742, respectively. The authors would like 
to thank Gregory M. Duncan, two anonymous referees, 
and seminar participants at various universities for 
helpful comments. 

1See, for example, Arrow (1964), Jack Z. Hirschleifer 
(1970), Ralph L. Keeney and Howard Raiffa (1976), 
and Edi Karni (1985). ` 
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Economists have applied the state-depen- 
dent approach to a diverse set of economic 
problems involving irreplaceable effects, 
product safety, and accidents.? By far the 
most widespread use of this formulation is 
with respect to state-dependent variations 
with individual health status. Richard J. 
Zeckhauser (1970, 1973) and Kenneth J. 
Arrow (1974) developed analyses of health 
care and health insurance decisions in which 
the utility functions for good and ill health 
may assume quite different shapes. These 
formulations led to an overhaul of the eco- 
nomic analysis of the optimal structure of 
health insurance. 

In particular, let there be n discrete states 
of the world indexed by j—1,...,n. Each 
state has an associated health level, health ,, 
and income level Y,. One can then write 
individual expected utility EU as the sum of 
the utilities in each health state weighted by 
the probability of s, that that health state, 
occurs, or 


EU= Y. sU(health,,¥,), 


jel 


as in Charles E. Phelps (1973) and Arrow 


?See Zeckhauser (1970, 1973), Arrow (1974), Philip J. 
Cook and Daniel A. Graham (1977), Spence (1977), 
Viscusi (1978, 1979), Joseph Pliskin, Donald S. Shepard, 
and Milton C. Weinstein (1980), Daniel A. Graham 
(1981), George W. Torrance (1986), Steven M. Shavell 
(1987), Viscusi and Michael J. Moore (1987), and Victor 
R. Fuchs and Richard J. Zeckhauser (1987). 
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(1974). Ideally, one would like to obtain a 
continuous measure of health capital to ana- 
lyze explicitly how individual health status 
alters the structure of utility functions and to 
explore the economic implications of health. 
Since our empirical analysis includes only 
two health states—good health and a post- 
injury state—a continuous measure of health 
cannot be constructed. Instead, we subsume 


the role of health into a state-dependent - 


utility function as in Arrow (1974), or 


EU- Ù sUj(Y,). 


jel 


More specifically, let the notation U(Y) de- 
note the utility function in good health and 


V(Y) denote the utility function in the post- - 


injury state. This differing notation denotes 
that there is a health state-dependent com- 
ponent of a stable preference relationship 
that we cannot estimate explicitly. 

Two assumptions are pivotal. First, for 
any given level of income, one’s overall level 
of utility is greater when in good health than 
in ill health, or U(Y) > V(Y). This assump- 
tion is not controversial; nor does it distin- 
guish the health state approach from earlier 
models that treated adverse health effects as 
being equivalent to financial losses. Second, 
to make any judgments about the extent of 
the optimal insurance, one must make an 
assessment with respect to the influence of 
one’s state on the marginal utility of income, 
for any income level. Optimal insurance cov- 
erage when there is actuarially fair insurance 
available will equate the marginal utility of 
income in each health state (for example, see 
Zeckhauser (1970, 1973), Arrow (1974), A. 
Michael Spence (1977), W. Kip Viscusi 
(1979)).? If ill health does not alter the 
marginal utility of income, for any given 

‘income level, then full insurance is optimal. 
If ill health lowers (raises) the marginal util- 
ity of income for any given income level, less 
: (more) than full income insurance is desir- 
able. Thus, the relative magnitudes of U'(Y) 


"Although the assumption of actuarial fairness is 
clearly unrealistic (ie., it assumes the insurance pro- 
vides a free service with no administrative costs), it is a 
frequent reference point in theoretical analyses. 
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and V'(Y) are key empirical parameters. The 
assumptions one makes about the shape of 
the vtility function govern the fundamental 
aspects of all of the economic results derived 
with such models. 

. In the extreme case of one's death, it is 
not controversial to assume that one's 
marginal utility declines after the adverse 
health effect. For other health outcomes, the 
justification for assuming a drop in marginal 
utility is less clearcut. There 1s no theoretical 
basis for determining the shape of the utility 
function in these instances. 

In Section I we describe the set of data 
used to estimate state-dependent utility 
functions. We will use two empirical ap- ' 
proaches. First, Section II imposes no func- 
tional form restrictions on the utility func- 
tion other than a Taylor's series expansion. 
This unrestricted approach provides tests of 
the two key assumptions of the state-depen- 
dent approach— whether utility is greater in 
good health or ill health and whether the 
marginal utility of income is boosted by or 
reduced by adverse health outcomes. In Sec- 
tion III we impose a specific functional form 
on the utility function (a logarithmic utility 
function) and then estimate the utility func- 
tion in each of the two health states. Section 
IV uses these results to address a variety of 
key, but previously unresolved issues, includ- 
ing: the income elasticity of the implicit value 
of ax injury, the valuation of non-incremen- 
tal changes in risk, changes in risk-dollar 
tradeoffs with a change in the base level of 
risk, and the optimal rate of replacement of 
worker earnings through workers' compensa- 
tion insurance. Many of these findings are of 
substantial, independent economic interest. 


I. Sample Description 


Although there is a considerable literature 
on wage-risk tradeoffs, estimating individual 
utility functions is not feasible with standard 
sets of survey data. Figure 1 makes the source 
of the difficulty apparent. Let ABC be the 
frontier of offered wage-risk combinations 
available in the market. The worker selects 
the optimal job B from this frontier, where 
his locus of constant expected utility EU is 
tangent to the wage of opportunities fron- 
tier. 


Y 
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Income 
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P, P, 
Job Risk 


FIGURE 1. THE MARKET OFFER CURVE AND THE WORKER'S EXPECTED UTILITY 
Locus 


Hedonic wage studies of compensating 
differentials for job risks involve estimation 
of the average rate of tradeoff for the equi- 
librium set of expected utility-wage offer 
tangencies observed in the market. The lin- 
ear wage equation imposes a constant risk- 
dollar tradeoff, and a semilogarithmic for- 
mulation makes the tradeoff risk-dependent. 
Market data can only provide evidence re- 
garding the slope of the observed tangencies 
with the frontier 4BC. One cannot make 
any inferences regarding the shape of the 
individual worker utility functions except 
with respect to the rate of tradeoff at tan- 
gency with the opportunities locus. 

We will follow an alternative approach of 
augmenting market data with reservation 
wage data obtained in response to different 
risk levels. In particular, we utilize the 1982 
chemical worker survey by W. Kip Viscusi 
and Charles J. O'Connor (1984).* That anal- 
ysis was primarily concerned with the eco- 
nomic implications of chemical labeling, 


*More specifically, we will utilize the subsample of 
workers analyzed in Section III of Viscusi and O'Con- 
nor (1984). These workers experienced an increase in 
their job risk. Workers who were randomly assigned to 
the risk decrease experimental cell were not asked a 
reservation wage question. 


whereas this paper is concerned with the 
utilization of the wage and risk information 
to estimate worker utility functions. 

Table 1 summarizes the characteristics of 
the chemical worker sample. The personal 
characteristic variables included information 
on the workers age (AGE) sex (MALE 
dummy variable-d.v.), marital status (MAR- 
RIED d.v.), number of children (KIDS), race 
(BLACK d.v.), years of experience at the 
firm (TENURE), and education (EDUC). 

The survey also elicited the worker's pay 
for the period that was most meaningful for 
that particular occupational class. To pro- 
mote comparability, these statistics have been 
converted to weekly, after-tax earnings, us- 
ing average federal and state tax rates for the 
worker's income class and family status (i.e., 
marital status and number of dependents).? 
The workers in the sample averaged about 
$18,000 per year (1982 dollars) in after-tax 
income. i 

The key job attribute is the worker’s per- 
ceived probability of an accident on his job, 
which is denoted by p, i=1,2, where the 


The tax adjustments were made using information 
provided in The Commerce Clearing House, State Tax 
Handbook, and U.S. Master Tax Guide. 
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TABLE 1—SAMPLE CHARACTERISTICS: MEANS AND STANDARD DEVIATIONS 


Variable 

AGE (in Years) 

MALE (0~1 Sex Dummy Variable d.v.) 
KIDS (Number of Children) 

BLACK (0—1 Race d.v.) 

TENURE (Years of Experience at Firm) 
EDUC (Years of Education) 

p; (Prior Probability of Accident) 

p» (Posterior Probability of Accident) 

Y (Weekly before Tax Earnings) 

8 (Percent Wage Differential) | 


r, (Fraction Earnings Replacement by 
Workers' Compensation on Job 1) 
r, (Fraction Earnings Replacement by 
Workers' Compensation on Job 2) 
t, (Average Tax Rate on Job 1) ` 


t; (Average Tax Rate of Job 2) 


Sample Size 


subscript 1 pertains to the pre-labeling situa- 
tion and the subscript 2, pertains to the 
post-labeling situation. Workers assessed this 
probability using a linear risk scale on which 
there was indication of the average level of 
the risk for the entire private sector. Workers 
marked on the scale their assessed job risk 
with respect to this standardized injury scale, 
thus providing a risk metric scaled in terms 
of an annual job risk, that is, the assessed 
annual probabilities of injury are in the in- 
terval [0, 1]. 

The risk metric is equivalent to the U.S. 
Bureau of Labor Statistics (BLS) private sec- 
tor injury and illness rate. The risk assess- 
ments were 10 percent greater than the pri- 
vate sector average risk for that period and 
were 46 percent larger than the average 


chemical industry accident rate. Since BLS | 


accident statistics do not capture the longer 
term health risks in the chemical industry, 
this pattern is quite reasonable. 


. Means 
(Std. Deviation) 


38.3 
(11.8) 
0.56 
(0.50) 
1.30 
(1.53) 
0.06 
(0.23) 
7.77 
(6.97) 
14.3 
(3.37) 
0.084 
(0.055) 
0.249 
(0.091) 
392.13 
(161.52) 
0.173 
(0.150) 
0.637 
(0.077) 
0.615 
(0.093) 
0.124 
(0.047) 
0.141 
(0.052) 
249 


The workers were randomly assigned to 
one of four different chemical labeling 
groups—asbestos, TNT, sodium bicarbon- 
ate, and chloroacetophenone (an industrial 
chemical that causes tearing). The workers 
were told that the chemical would replace 
the chemical with which they currently 
worked. Thus, there would be a change in 
the chemical used rather than a change in 
the labeling of the chemical with which the 
individual currently worked. Respondents 
then assessed the posterior risk, p,, which is 
roughly triple the prior risk, Only workers 
who reported an increased risk assessment 
are included in the sample analyzed in this 
paper, since it was only for this group that 
reservation wage information was obtained. 
Almost all workers who reported a risk de- 
crease were shown a label for sodium bicar- 
bonate, which was the experimental .treat- 
ment that corresponded to elimination of the 
chemical hazards. 
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The risk scale established a standardized 
reference lottery that the worker could use in 
assessing the risk equivalent of his job before 
and after seeing a warning label. Ideally, 
both the severity and duration of the refer- 
ence injury should be the same, where the 
scale is used to establish differences in prob- 
abilities. Although this was the intent of the 
survey design, the significant differences in 
injury severity across the label treatment 
groups may have affected worker responses. 
In particular, the empirical properties of the 
responses by workers in the asbestos and 
TNT label groups are similar, but the behav- 
ior implied by the chloroacetophenone group 
is somewhat different. Since chloroacetophe- 
none leads to only temporary eye irritation, 
and the other two chemicals pose risks of 
death, this difference is consistent with the 
character of the injuries. We will provide 
empirical estimates for each labeling sub- 
sample as well as for the full sample to 
explore differences across chemicals. 

For workers who assessed their job risk as 
being greater, the survey ascertained the per- 
centage wage increases 6 needed to compen- 
sate the worker for the increased risk. The 
average ô value was just under 20 percent. 
Since the respondents were told that the 
results would be used for a doctoral disserta- 
tion at an institution in a different state and 
would not be disclosed to their employers, 
there was no apparent incentive for them to 
overstate their reservation wage. Moreover, 
the implicit value of a statistical injury re- 
ported in Viscusi and O’Connor (1984) and 
in Section IV of this paper are not exces- 
sively large and are in line with the litera- 
ture. Section III examines the effect on the 
estimates of potential response biases. 

In terms of Figure 1, the survey first ascer- 
tained the information associated with point 
B—the base risk p, and the associated 


©The chemical worker sample yields rates of tradeoff 
that imply a value of $10,000-$20,000 per statistical 
injury reduced. If there were upward response bias, 
these estimates should exceed comparable values ob- 
tained using market data and hedonic wage equations. 
However, the estimates in the literature tend to be 
somewhat greater, as they cluster in the $20,000—$30,000 
range. See the survey by Viscusi (1986) and the.recent 
estimates by Viscusi and Moore (1987). 
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weekly earnings Y. It then altered the risk to 
a level assessed by the worker as being pz, 
with an associated weekly earnings of Y(1 + 
ô), which is point D. Thus, the survey in- 
cludes information with respect to two 
points, B and D, on a constant expected 
utility locus EU where the expected utility 
EU, on the initial job equals the expected 
utility EU, after the wage and risk increase. 
The starting point and the post-labeling point 
D differ across workers so that in effect we 
observe 249 different pairs of points along 
249 differeni utility functions. In contrast, 
the most that can be accomplished using 
observed market wage-risk data for this sam- 
ple is to estimate ABC using the 249 points 
B. Since the survey generates only one equa- . 
tion, EU, = EU,, we are not able to identify 
the shape of both U and V. However, we are 
able to generate relationships between the 
two utility functions, such as differences and 
ratios. 

The final variable needed to complete the 
formulation of the payoffs in each state is 
the level of workers’ compensation benefits 
after an injury. Since these benefits are not 
taxed, for comparability the income in the 
healthy state is in after-tax terms. The earn- 
ings replacement rate variable is based on 
the state benefit formulas for temporary to- 
tal disability and the characteristics of the 
individual respondent. The benefit calcula- 
tion takes into account the worker's income, 
benefit ceilings and floors, and the depen- 
dence of benefits on family characteristics." 
The average replacement rate at the initial 
job (7,) and for the experimental job (7,) are 
both about two-thirds. 

Let U denote the utility of wealth in good 
health, and let V denote the utility of money 
after a job injury. Then a wage increase that 
equates the expected utility that the worker 
obtains from his initial job and the trans- 
formed job satisfies. 


(1) (1- p))U(Y(1-4)) + pV (Yn) 
= (1- p)U(Y(1- 8)(1- 15)) 
+ p,V(¥(1+8)n). 
"The reasonableness of this approach to capturing 


empirically the role of workers’ compensation is dis- 
cussed in Viscusi and Moore (1987). 
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The worker reports his base earnings Y, his 
required wage increase 6, and his prior and 
posterior risk assessments, p, and p}. Infor- 
mation regarding the worker’s income level 
is used to construct the tax variables, r, and 
t,, and the workers’ compensation replace- 
ment rates, r, and r,. The formulation in 
equation (1) takes into account the favorable 
tax treatment of workers’ compensation ben- 
efits, as taxes only affect earnings in the 
good health state. 

The principal empirical test of whether the 
state-dependent approach is valid is whether 


(2) U(Z) >V(Z) 
and 
(3) U(Z) »V'(Z), 


for identical income levels Z in each state. 
Inequality (2) will be satisfied by both a 
health state and a monetary loss equivalent 
model since in each case ill health lowers 
welfare. The distinctive condition is defined 
by inequality (3). Under the state-dependent 
approach, inequality (3) could be in either 
direction, although the sign in inequality (3) 
is the more frequent assumption. With a 
monetary loss equivalent approach, an injury 
will boost the marginal utility of income for 
the usual risk-averse preferences, leading to 
a reversal of inequality (3). If inequality (3) 
is satisfied, we can reject the monetary 
equivalent model and the class of health in 
state models that do not alter the utility 
function in the manner indicated by inequal- 
ities (2) and (3). 


TI. Estimates with Unrestricted 
£ Functional Forms 


Ideally, one would like to estimate equa- 
tion (1) without imposing any restrictions on 
the shapes of U and V. However, with infor- 
mation on two particular points along the 
constant expected utility locus, one cannot 
estimate two different nonlinear functions 
with available data. 

Two approaches are feasible. First, one 
can estimate specific features of the U and V 
functions without imposing functional form 
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restrictions on their shape, as in this section. 
Second, one czn impose constraints on the 
shapes that U and V can take, as in Section 
IV. These two different estimation ap- 
proaches provide a robustness check on the 
results. 


A. Firsi-Order Taylor's Series 

The procedure that we adopt in this sec- 
tion is to construct a first-order Taylor's 
series approximation of utility functions in 
each health state. The second-order Taylor's 
series terms, which we will explore in Section 
IIB, were not statistically significant. 

For each of the utility functions, we will 
use the same level of income as the point of 
expansion, where this level is Y, the weekly 
before-tax income. From the definition of 
the Taylor's series, we generate the following 
approximations to utility: 

(4) UYO- =U) . 
*(Y(1-4)-Y)U'(Y), 

(4b) V(Yn) xV(Y) - (Yn, - Y)V'(Y), 

(4) U(¥(1+8)(1—14)) =U(Y) 

| F(Y(1-8)(1—-5) - Y) U' (Y), 

and 

(ad) V(Y(148)rn) -V(Y) 
*(Y(14-8)r, - Y)V' (Y). 


After substituting the values of (4a)-(4d) 
into equation (1), we obtain 


(5 (»-p)(U(Y)-V(Y)) 
= ((1— p3)t - (1 p) (15 + 855 —8)) 
xYU'(Y) t(p,(1— n5) 


- pY(1-n- r8))YV( Y). 


All of the variables in equation (5) are known 
except for those parameters involving the 
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utility functions for each state. It is these 
terms that will be estimated. 


Let 
(6a) Bi- U(Y) - V(Y), 
(6) — &-U(), 
and 


(6) ^ B&e-v(Q). 


The dependent variable will be the per- 
centage wage compensation 8 that the worker 
requires to face the increased risk. This em- 
pirical structure follows that of the question- 
naire, as the survey asked workers how much 
additional compensation they required to 
work with the new chemical. Although work- 
ers could respond either in absolute or per- 
centage terms, in each case it was the wage 
premium that was elicited. We can consider 
ô as the dependent variable since its value is 
conditioned on knowing all other variables 
—namely Y, p,, and pp. 

Inserting the values for £, B, and f, 
from equation (6) into equation (5) and solv- 
ing for the endogenous value 8 yields 


(7) 8- | (H,B, + HB) Y - (p3— p) By 
(0 72)(65-71)8, - pz 8) Y 
+e, 


where 


(8a) A,=(1-p,)4-(1- p3)t;. 


and 
(80 H,=p,(1- n) - pí(1- n). 


Introducing subscripts to denote the ith 
individual, we have thus hypothesized an 
empirical relationship of the form 


(9) 8,- f(Xi, B) + 


where f(-) is a nonlinear function capturing 
the bracketed term on the right-hand side of 
equation (7), X, is a (k X1) vector of vari- 
ables unique to the respondent (t;, r,, Y;, etc.), 


B is a (pX1) vector of parameters to be 
estimated, and e, is an iid. error term. We 
can obtain an estimate of f via nonlinear 
least squares. The nonlinear least squares 
estimator? B is consistent so long as the 
error terms are independent and identically 
distributed, with mean zero and finite vari- 
ance o?— assumptions that will be tested 
below. 

Given the structure of equation (7) and 
the nature of the data, it is possible to esti- 
mate only two of the three parameters. With 
no loss of generality, set the coefficient 


(10) B, - U'(Y) =1. 


The two tests that will be possible with the 
model are whether utility is greater in the 
healthy state, or 


Q1 - &-U(Y)-V(Y) »0, 


and whether ill health lowers the marginal 
utility of income, or 


(12) Bj - V (Y) «1. 


A test of the financial loss model of adverse 
health effects would be to test. the joint re- 
striction f, > 0 and £,>1. Thus, the distin- 
guishing test of the health state approach, as 
compared with the financial loss model, is 
inequality (12). 

Table 2 presents the nonlinear least 
squares estimate of equation (7), where f, 
has been constrained to equal 1. The first 
equation in Table 2 presents the estimates 
for the full sample, and the next three equa- 
tions report estimates for each label subsam- 
ple. The utility function parameters can be 
viewed as averages across the sample. The 
final equation allows each parameter to be a 
linear combination of the major human capi- 
tal variables, providing evidence on hetero- 
geneity of preferences. In this case we report 
both the individual parameter estimates as 
well as the estimated B, and £, values eval- 
uated at the sample mean. 


*A. Robert Gallant (1975) describes the nonlinear 
estimation procedure and its properties. 
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TABLE 2— NONLINEAR LEAST-SQUARES ESTIMATES OF FIRST-ORDER 
TAYLOR’S SERIES EXPANSION 


Coefficient Estimate, (Asymptotic Standard Error) and 
[Heteroscedastic Consistent Standard Error] 


Full 


Parameter Sample 


Bio (Intercept) - 
By, (EDUC) 
B1; (TENURE) 


Ay 167.7 
(11.81) 
[12.90] 


1941 
(17.07) 
[17.36] 
P3o (Intercept) 


By, (EDUC) 
By, (TENURE) 


B, ^ 67D 
(0.134) 
[0.163] 


0.818 
(0.194) 
[0.153] 


R? 0.505 0.438 
nR? Test 1.469 1.123 
Observations 249 78 


TNT Asbestos 


165.5 
(15.63) 
[17.07] 


Full 
Chloroacetophenone Sample 

225.3 

(70.92) 

[64.25] 

—5.30 
(5.28) 
[4.72] 

1.22 
(1.81) 
[1.98], 

158.9 

(17.80) 

[19.17] 

0.775 
(0.826) 
[0.783] 

— 0.023 
(0.043) 
0.044] 

— 0.032 
(0.018) 
[0.020] 

0.859* 
(0.228) ` 
[0.225] 


—38&.74 
(60.31) 
[105.3] 


0.415 
(0.187) 
[0.173] 


2.522 
(0.561) 
[1.310] : 


0.480 
2.166 


0.456 
20.19 


0.513 
11.20 


87 84 249 


p Evaluated at sample averages for EDUC and TENURE. 
PThis statistic for the first 4 columns i is asymptotically distributed x? with 3 degrees 
of freedom. The critical value for x? (3 d.f.) at the 95 percent conten level is 7.81. 
The statistic in the fifth column is asymptotically distributed as x? (18 d.f.) with a 95 


percent confidence level of 28.87. . 


"The estimates of P, and B, are extremely 
precise and in the expected direction for all 
but the chloroacetophenone results. The co- 
efficient £}, which represents the difference 
between the utility when healthy and when 
injured, has the expected positive sign, with 
a coefficient that is over 10 times larger than 
its standard error for the first three sets 
of results. The f, coefficients for these 
first three columns also are not significant- 
ly different from each other. Since von Neu- 
mann-Morgenstern utility functions are de- 
fined only up to a positive linear transforma- 
tion, it is only the sign of f, rather than its 


magnitude that is of consequence. Individu- 
als prefer the good health state, as predicted. 
The coefficient of B, is also positive and 
passes tests of statistical significance at very 
demanding levels since the asymptotic t-ratio 
is almost. 6. The point estimate of B, Zor the 
full sample is 0.773, which implies that the 
marginal utility of income in the ill health 
state is about three-fourths that óf the good 
health state. The confidence intervals of f, 
for TNT and asbestos overlap the confidence 
intervals of the full sample estimate o? fA. 
; The most pertinent’ statistical test is not 
whether f, is significantly different from zero 
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but whether B, is significantly below 1, the 
marginal utility of income when healthy. One 
can reject the hypothesis that B, equals 1 at 
the 5 percent confidence level. The marginal 
utility of income is significantly lower in ill 
health than when in good health for the first 
three columns of estimates. 

This result provides the distinguishing test 
between the financial loss and health state 
models of injuries. For all results except 
chloroacetophenone, there is evidence that 
the injury lowers welfare, so that B, will be 
positive. If this lowering takes the form of 
being tantamount to being a drop in income, 
then 8, will exceed 1. With the health state 
model, the injury alters the ‘shape of the 
utility function, with the most common as- 
sumption being that an injury reduces the 
marginal utility of income. The estimate of 
P4 is in line with the health state model, and 
it suggests that treatment of health effects as 
being equivalent to monetary losses is inap- 
propriate. 

The one divergent set of results is for 
chloroacetophenone. The sign of f, is nega- 
tive, which is the opposite of the expected 
relationship in inequality (11), but this co- 
efficient is not statistically significant at the 5 
percent level. The raised marginal utility of 
income after an accident ( B, = 2.522) is con- 
sistent with the financial loss equivalent of 
the model rather than the health state ap- 
proach. This result is not implausible since 
this chemical imposes no permanent health 
impairment. It is an eye irritant that causes 
tearing but does not inhibit one's ability to 
derive utility from additional expenditures. 
In Section III we will impose more structure 
on the estimation, which reduces but does 
not completely eliminate the differing per- 
formance of the chloroacetophenone group. 

To explore possible heterogeneity in the 
parameter estimates, in the final equation in 
Table 2 the parameters vary across individu- 
als according to the linear equation, 


B; = Bio + By EDUC + B,TENURE, 
for i=1 and 3, 


where worker education (EDUC) and job 
experience (TENURE) are good measures of 
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lifetime wealth.? Thus, for both f, and f, 
this equation includes estimates of this pa- 
rameter and its interaction with education 
and job tenure. If better educated and expe- 
rienced workers suffer a greater (lower) drop 
in utility after an injury, then the sign of the 
interaction with f, will be positive (nega- 
tive). Similarly, a larger (smaller) drop in 
marginal utility after an accident will lead f 
to have a negative (positive) sign. 

Evaluating the sum of the parameter esti- 
mates at the sample averages, we find $; = 
158.9 and £, = 0.859. These results are within 
one standard error of the full sample esti- 
mates in column 1 of Table 2, even when the 
adjusted standard errors are used.!? Because 
of the large variances for the variables in the 
final equation, the estimated $, is less than 
one standard deviation away from the criti- 
cal value of 1 that is pertinent for the testing 
of the hypothesis given in inequality (12) 
above. None of the personal characteristic 
variables is statistically significant at the 5 
percent level (two-tailed test). The most pre- 
cisely estimated interaction is the f, interac- 
tion with tenure (significant at the 5 percent 
level, one-tailed test), which suggests that 
more experienced workers will suffer a 
greater drop in marginal utility after an acci- 
dent. This result is expected since the greater 
family responsibilities and more limited mo- 
bility of more senior workers creates a de- 
mand for greater insurance coverage after an 
accident, which is the substantive implica- 
tion of a lower value of f,. 

One extreme hypothesis that can be tested 
using the results in Table 2 is whether in- 
juries have no effect whatsoever on either the 
level of utility or the marginal utility of 
income. This hypothesis can be rejected at 
even very demanding confidence levels in 
every case shown in Table 2. 

Two statistical issues must be addressed 
before turning to alternative specifications of 


?More detailed sets of interaction created conver- 
gence problems and could not be estimated. 

The adjusted standard errors are developed using 
the procedure in White and Domowitz (1984). This 
procedure is designed to adjust for the influence of 
heteroscedasticity. 
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the utility function. First, the iid. assump- 
tion of the model may not be satisfied for 
this cross-sectional data base, as the error 
term may be heteroscedastic. Table 2 in- 
cludes both the conventional standard error 
and the heteroscedastically consistent stan- 
dard error for nonlinear models." The sec- 
ond set of standard errors has very similar 
indications for statistical significance so that 
any heteroscedasticity that is present does 
not appear to be consequential. 

Second, it is possible to test the i.i.d. as- 
sumption explicitly and to test the correct- 
ness of the model specification. Based on the 
Halbert L. White and Ian Domowitz (1984) 
test summarized in the Appendix, one can- 
not reject at the 95 percent confidence level 
the assumption that the errors are ho- 
moscedastic. Furthermore, one cannot reject 
the assumption that the first-order Taylor's 
series model for the full sample is a correct 
specification up to an independent additive 
error term. l 


B. Second-Order Taylor’s Series 


To test the robustness of the previous 
model and to explore the potential role of 
second-order terms, we also estimated a 
model based on a second-order Taylor’s se- 
ries. The second-order expansion will allow 
us to estimate U"[-] and V"[-], which can be 
used to calculate measures of risk aversion. 
The second-order expansion is substantially 
more difficult to solve algebraically because 
the expansion of U(Y(1--65)(1—:,)) and 
V(Y(1- 8)r,) about Y generate a quadratic 
expression in 6, our variable of interest. The 
second-order model can be constructed as 
follows. Denote the quadratic expression in 
6 as 


A6? + Bà - C «0, 
and define B,, 8, and f, as before. Let 
By — U"(Y) and B, — V" (Y). 


Some straightforward (but lengthy) alge- 
bra generates the following values for the 


Urpbid. 
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quadratic coefficients: 
EU, = B, + (1- p) ( - B,¥t, 0.5( Yn)? Ba } 
+p { Y 31) 8*05[Y(n—0)]^ 85]. 
A-0.5(1- p) ([Ya — 5)] 85 
+ pi Yn.) Bas. 
B= (1- p) Y(1- tz) (By - Yt Boa) 


+ pjYr,( B, ¥( ry ~ 1) B5), 


and 
C= By +(1— p2) { - Y; +0.5( Yt) Boo } 


+p,{ Y(,—)8,0.5(Y( 7-1)) B3] 


= EU,. 


The solution to the quadratic suggests two 
possible roots. In preliminary analysis with 
this expansion, numeric calculations indicate 
that only one cf the roots can predict posi- 
tive values for 5, and therefore, the implicit 
equation we choose to estimate is of the 
form: 


A fra +e, 


ô= |- B«(B?-44C) 
As is the case with the first-order series, both 
B, and B, (the marginal utility terms) are 
not identified and so without loss of general- 
ity, we set B,=1 and test whether 6, <1. 
Consumer theory suggests that both second- 
order terms should be negative, but there is 
no theoretical basis for predicting which term 
should be larger in absolute value. 

In the first column of Table 3, we present 
Taylor’s series results for a model where 
both of the second-order terms are allowed 
to vary. The value of fj, drops substantially 
from the 0.77 estimated in the first-order 
case. However, we accept the hypothesis that 
the coefficient is significantly below the criti- 
cal value of 1 at about the same confidence 
level as in Table 3, which is the main hy- 
pothesis of interest. The magnitude and the 
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TABLE 3— NONLINEAR LEAST SQUARES ESTIMATES OF SECOND-ORDER TAYLOR'S 
SERIES EXPANSION 


Coefficient Estimate, (Asymptotic Standard Error) and 
[Heteroscedastic Consistent Standard Error] 


Parameter Full Sample Full Sample £5; = B4; 
B 1842 170.3 
(13.47) (11.79) 
[15.63] [12.33] 
B 0.261 0.701 
(0.316) (0.188) 
[0.350] [0.189] 
Bs 21E-3 —-74E-4 
(24E- 3) (1.5E—3) 
[2.4E—3] f11E-3] 
By -1.8E-3 -"74E—-4 
(1.6E — 3)  (L5E-3) 
[1.7E- 3] [1.5E — 3] 
R? 0.509 0.506 
nR? Test 33.86* 13.89" 


: "This statistic is asymptotically distributed x? with 10 degrees of freedom. The critical 
value for x? (10 d.f.) at the 95 percent confidence level is 18.31. 
>This statistic is asymptotically distributed x? with 6 degrees of freedom, The critical 
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value for x? (6 d.f.) at the 95 percent confidence level is 12.59. 


significance of 8, is quite similar to the 
first-order result. The coefficients for the sec- 
ond-order terms are not estimated with a 
great deal of precision, and we cannot reject 
the joint hypothesis that both terms equal 
zero. 

Given the lack of precision in the second- 
order terms, we estimated a second model 
reported in the final column of Table 3, 
where the second-order terms are restricted 
to be equal. By restricting £5, = f, the esti- 
mates for the second-order terms are both 
negative but insignificant, and the estimates 
for B, and f, are quite similar to the first- 
order series results. Although the estimate 
for B, is quite different in both columns in 
Table 3, we cannot reject the hypothesis that 
the results in both columns are equal.!* This 
is not surprising given the closeness of the 
estimates for B, in both columns, the impre- 


12 The test statistic is asymptotically an F with 1 and 
245 degrees of freedom. The test statistic was 1.81, 
which is below the 95 percent critical value of 3.84. 


cision in both second-order terms, and the 
large variance for B, in column 1.” 

Although the second-order results are con- 
sistent with the theoretical predictions, the 
failure of any second-order terms to be sta- 
tistically significant suggests that the earlier 
first-order results represent a reasonable ap- 
proximation. The specification test results 
reinforce this conclusion. 


HI. Estimates with Logarithmic 
Utility Functions 


To obtain estimates of the entire utility 
function shape, as opposed to simply the 


Notice also that the nR? test indicates possible 
model misspecification. The large variances for the sec- 
ond-order terms and the rejection of the nR? test are 
not surprising given that the survey asked for only one 
response. The survey was originally designed to elicit 
the response AY for a given Ap. Without placing more 
structure on the utility function, it may be difficult to 
determine second moments of utility since respondents 
were not asked additional questions to indicate changes 
in the rate of tradeoff between p and Y. 
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utility differences and the relative marginal 


utility of the two states, one must impose 
additional structure on the utility functions. 
The particular functional form we have se- 
lected is the Cobb-Douglas parameteriza- 
tion, where U(Y) is of the form Y". Al- 
though this functional form does not have 
the same flexibility as the approximation in- 
volving the Taylor's series expansion, it is 
not extremely restrictive.!^ Upon taking log- 
arithms of the within-state utility, we obtain 
a logarithmic utility function where U(Y) 
equals u[log(Y )]. 

In the case of this model, the logarithmic 
formulation implies that 


(13) U(Y) — u[log( Y)], 
and 
(14) V(Y) 7 v[log(Y)], 


where u is a multiplicative parameter for the 
healthy state utility function and v is the 
parameter for the unhealthy state. If à? v, 
then the utility and the marginal utility of 
income are greater when the worker is in the 
good health state, which is the standard as- 
sumption in the literature. 

The logarithmic utility function is fre- 
quently used in finance contexts and in 
empirical applications analyzing von Neu- 
mann-Morgenstern "utility functions. This 
function embodies decreasing risk aversion, 
which is a common empirical phenomenon.!$ 

The major drawback of the logarithmic 
approach is that. the utility and marginal 
utility will each be governed by a single 
parameter. This formulation in effect links 
the tests of whether there is a utility drop in 
the ill health state (i.e, v « u) and whether 
there is a marginal utility decline (i.e, v/u 
«1), so that it does not provide an uncon- 


I^Melvyn A. Fuss, Daniel McFadden, and Yair 
Mundlak (1978) discuss its use in production contexts. 

15 Examples of the use of logarithmic utility functions 
in the finance literature abound. See Albert L. Kraus 
and Robert H. Litzenberger (1975), Mark E. Rubinstein 
(1977), and Paul A. Samuelson (1969). 

16See Keeney and Raiffa (1976). 
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strained test of the model.. However, the 
overall test of behavior was the subject of 
the Taylor’s series test, and one can view the 
logarithmic utility function as imposing more 
specific functional structure on the relation- 
ships that were shown to hold in Section II. 
The purpose of the additional structure is to 
obtain estimates that will be used in greater 
detail to examine attitudes toward risk. Al- 
though other functional forms for utility 
functions have appeared in the literature, 
these could not be used because of both the 
nature of the data and the iterative search 
procedure that was used.!" 

There are several types of checks on the 
realism of the model. Many of these checks 
are based on comparison with the Taylor's 
series estimates. First, does utility drop in 
the ill health state? The magnitude of any 
such drop is irrelevant since von Neumann- 
Morgenstern utility functions are unaffected 
by a positive linear transformation. Second, 
does the point estimate of the ratio of the 
marginal utility in ill health relative to good 


We also attempted to estimate equation (1) with 
two other specifications for utility: the constant risk 
aversion (CRA) utility function and the constant rela- 
tive risk aversion (CRRA) functions. However, these 
models are not icentified given the formulation of the 
problem, as presented in equation (1). The CRA utility 
function is typically denoted as U  — exp[— rY ], where 
r « 0. given the CRA specification, we are unable to 
obtain a closed-form solution for 6. Instead, we at- 
tempted to estimate the implicit equation EU, — EU, = s, 
where c is an iid. error with mean zero. Since there is 
no appropriate normalization of the parameters, we 
must estimate two variables, 7j in a healthy state, and 
r, in an unhealthy state. Given the properties of the 
CRA function, the sum of squared errors is minimized 
where 7, =n = 0, which forces utility to equal ane in all 
periods. Subsequently, the parameter values generate 
the equality 


EU, -EU = [(1- p)+ p]- [(1- 2) + 4] =0. 


The CRRA utility function is defined to be U= 
YO -9/(1— p), where p 21. As in the previous exam- 
ple, we are unable to obtain a closed form solution for 6 
so we must estimate the implicit equation EU, ~ EU, = e. 
Algorithmically, the sum of squared errors can be mini- 
mized by choosing extremely large values for both p, 
and p,. This has the property of forcing utility in all 
periods to machine zero and therefore the difference, 
EU, — EU,, is also zero. 


s 
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health differ statistically in the Taylor's se- 
ries and logarithmic utility function cases? 
Third, do the models provide similar esti- 
mates of [U(Y) - V(Y)]/V'(Y), which is a 
utility difference statistic that is unaffected 
by positive linear transformation of the util- 
ity function? Finally, we will provide a 
White-Domowitz (1984) test of whether the 
model is correct up to an additive indepen- 
dent error, thus providing a formal specifi- 
cation test. 

The requirement given by equation (5) 
above can be written as 


(15) (-p)w(Iog[ Y - 4)]) 
+ py (log( Yr,) 
-(1- p3) u(log[ ¥(1+8)(1-1,)]} 
+ pw {log[Y(1+8)r]}, 


which equates the expected utility of the 


initial job and the transformed job. 

Even in conjunction with the imposed 
functional form, it will not be possible to 
estimate both parameters, u and v. As a 
result, we will estimate their ratio a given by 


(16) 


As in the Taylor’s series case, the depen- 
dent variable is ô, the percentage wage in- 
crease that the respondent requires to face 
an increased risk. Using the normalization of 
equation (16), we solve for 6 to yield. 


K,~K, 
Us cwm eas: z] 


a=u/v. 


—I+e, 
where - 
K,= (1- p,)alog[ Y(1- 4)] 
+ pylog[ Yn], 
and 


K,=(1— p;) alog[ Y(1—t,)] 
+ p,log[ Yr, ]. 
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We will estimate equation (17) using nonlin- 
ear least squares, again assuming the error 
term is ii.d. with mean zero and finite vari- : 
ance. 

The principal hypothesis is that 


(18) 


or, for any given level of income, both the 
level of utility and the marginal utility of 
income are higher in the good health state. 
Two cases will be considered. First, we 
will estimate the homogeneous preference 
model in which all utility functions are iden- 
tical (i.e., a= æg), thus providing an average 
value for a across the sample. Second, we 
then permit a to vary with personal charac- 
teristic variables X,. Doing so leads to the 
heterogeneous preference assumption that 


a1, 


(19) a= a+ B,EDUC + B,TENURE, 


where the personal characteristic variables 
are education and job experience. 

The estimate of the homogeneous prefer- 
ence model appears as equation (1) in Table 
4. The estimate of ay is clearly statistically 
different from zero (asymptotic t —134), but 
the more relevant issue is whether a differs 
from 1.0. The estimate for the full sample 
that ag equals 1.077 lies 9.6 standard devia- 
tions above 1.0, so one can reject the hy- 
pothesis that u = v, indicating that both the 
utility level and the marginal utility are 
greater in the good health state. 

The.a, values for each of the chemical 
subsamples are also above 1.0. Both the TNT 
and asbestos a, values are not significantly 
different from each other or the full sample 
results. However, one can reject the hypothe- 
sis that all of the a coefficients are identi- 
cal. Nevertheless, the magnitude of the ay 
value for chloroacetophenone is not greatly 
different in magnitude (for example, its a 
confidence interval overlaps with that for 
asbestos), and the overall structure of the 
utility function implied by the results is very 
similar to that for the other chemical sub- 
samples. The additional structure imposed 
by the logarithmic utility function may have 
muted some of the differences across label- 
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TABLE 4—NONLINEAR LEAST-SQUARES ESTIMATES OF 
LOGARITHMIC UTILITY MODEL 








< Coefficient Estimate, (Standard Error), and 
[Heterscedastic Consistent Standard Error] 
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; Model 1 Model 2 
Full Chloroaceto- Full 
Parameter Sample TNT Asbestos phenone Sample 
Intercept 1.294 
(0.033) 
[0.033) 
EDUC — 0.013 
(0.002) 
[0.002] 
TENURE — 0.004 
(0.001) 
[0.001] 
&o 1.077 1.094 1.065 1.043 1.082* 
(0.008) (0.013) (0.012) (0.018) (0.016) 
f0.009] [0.014] [0.015] [0.022] [0.016] 
R? 0.363 0.147 0.208 0.436 0.466 
nR? Test of : 
Specification 12.005 1.63? 6.55? 103 . 3413* 
Observation 249 78 87 £4 249 





„Evaluated at sample averages for EDUC and TENURE. 
>This test statistic is asymptotically distributed x? with 1 degree of freedom. The 
critical value for x (1 d.£) at the 95 percent confidence level is 3.84. 
°This statistic is asymptotically distributed x? with 6 degrees of freedom. The 
critical value for x? (6 d.f.) at the 95 percent confidence level is 12.59. 


ing groups that were apparent in the Taylor's 
series results.  . 

The relative discrepancy | between the 
marginal utility in the two health states is 
narrower for the logarithmic model than the 
first-order Taylor's series expansion. The ra- 
tio of marginal utilities, V'(Y)/U'(Y), is 
given by 1/a for the logarithmic model and 
by B, in the case of the Taylor's series model. 
The estimates are 0.93 and 0.78, respectively. 
In each case, the accident lowers the utility 
and marginal utility of income, but the Tay- 
lor's series estimates imply a greater relative 
gap in the marginal utilities. The 95 percent 
confidence intervals for V'(Y )/U'(Y) for the 
two different estimation approaches overlap 
(full sample results). 

Another comparison that is meaningful, 
given possible differences in the utility met- 
ric, is the ratio of the utility difference to the 


marginal utility of income when injured. One 
establishes a comparable metric for utility 
differences by dividing by a marginal utility 
term. The value of [U(Y)— V(Y)]/V'(Y) is 
216 for the full sample Taylor's series results 
and is 179 for the full sample logarithmic 
results—a difference of under 20 percent. 

As in the case of the Taylor's series re- 
sults, the adjusted standard errors are similar 
to those that have not been adjusted for 
heteroscedasticity. In addition, for Model 2 
(but not Model 1), one cannot reject the 
hypothesis that the specification is correct up 
to an additive independent error (see Ap- 
pendix). This result is not inconsistent with a 
similar finding for the first-order Taylor's 
series model since one can view the flexible 
form of the Taylor's series model as provid- 
ing an approximation to the logarithmic for- 
mulation. 
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TABLE 5— SIMULATION RESULTS, SENSITIVITY OF ESTIMATES 
TO OVERESTIMATE OF $ 





i . Coefficient Estimates and Standard Errors 
Percent Reduction in ô `: 





Taylor's Series Model Logarithmic Model 
in percent ` Bı B R? |" a R? 

5 158.2 0.740 0.513 1.069 0.367 
(11.25) (0.125) (0.007) 

10. : 147.9 0.707 0.522 1.061 0.377 
(10.67) — (0.116) (0.007) 

25 1182 0.608 0.544 1.036 0.377 
(8.89) (0.091) (0.006) 


The imposition of additional structure with 
the logarithmic model has also facilitated 
the estimation of variations in individual 
preferences. Recall from equation (19) that 
positive coefficients imply that the particular 
demographic group has a higher utility of 
income value and higher marginal utility of 
income when in good health relative to ill 
health. Both education and tenure have neg- 
ative signs, implying that there is less of a 
drop in the marginal utility after an accident 
for these workers. : 

A possible bias in the model may arise if 
workers responded strategically to the sur- 
vey, for example, by exaggerating claims of 
the required wage increase 6. To check our 
results against a possible bias in 6, we reesti- 
mate the Taylor's series and the logarithmic 
: model after systematically depressing the re- 
sponse variable ô. This simple test illustrates 
whether our results are sensitive to exagger- 
ated claims of 6. Table 5 reports the test 
results from reducing the values of 6 by 5, 
10, and 25 percent. 

The sensitivity analysis for the logarithmic 
model indicates that the parameter a is sen- 
sitive to a possible bias in the response vari- 
able 6. However, even a 25 percent overesti- 
mate in ô does not alter the basic conclusion 
that a is significantly greater than 1. Like- 
wise, the general character of the results for 
the Taylor's series case are not altered as ô is 
decreased. As the percentage reduction in 6 
is increased, the primary parameter of inter- 
est, £4, actually declines in value, indicating 
that the choice between the health state and 


the monetary loss model is not biased by a 
strategic response for 6. 


IV. Economic Implications 


Without knowledge of the shape of indi- 
vidual preferences, the domain of economic 
inquiry is largely limited to a single issue— 
the local rate of tradeoff between risk and 
money. Using the estimates of the logarith- 
mic utility function, we will extend the do- 
main of inquiry to assess how risk-money 
tradeoffs vary with the base level of risk, the 
extent of the risk change, and individual 
income. We will also estimate the optimal 
workers compensation replacement rate. 
Knowledge of the utility function enables us 
to address a variety of concerns that have 
been central to the risk bearing field but 
which have never been addressed empiri- 
cally. 


A. Variation in the Implicit Value of 
Statistical Injury with the Base Risk 


The most useful means for expressing the 
risk-money tradeoff is in terms of the dollar 
compensation required per unit of risk. This 
rate of ‘tradeoff can be calculated for 
marginal changes of risk as it represents the 
value of dY/dp for a given value of ex- 
pected utility.!® At the mean risk level for the 


J8Using the formula in Viscusi (1979, p. 12), the 
expected utility formulation from the left side of equa- 
tion (15), and assuming that an individual works 50 
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TABLE 6 — EFFECT OF THE BASE RISK 
LEVEL ON THE IMPLICIT VALUE OF AN INJURY 


Implicit Dollar Value of Injury 
Base Risk Level Logarithmic — Taylor's Series 
0.0 13,262 11,313 
0.1 13,357 11,569 
0.2 13,454 11,838 
0.3 13,553 12,119 
0.4 13,653 12,414 
0.5 13,754 12,724 
0.6 13,857 13,050 
0.7 13,961 13,392 
0.8 14,067 13,753 
0.9 14,174 14,134 
1.0 14,284 14,537 
0.085 13,343 11,530 
(Sample Mean) 


sample of approximately 0.085, the logarith- 
mic utility function estimates yield a value of 
an injury of $13,343 (1982 dollars). As the 
value of injury figures reported in Table 6 
indicate, there is not substantial variation in 
the implicit value of an injury with the base 
risk level, as the range is from $13,262 to 
$14,284. 

The Taylor's series results can be used to 
generate similar estimates of the implicit 
value of injury, which is dY/dp, holding 
EU, constant.? The implicit value of injury 


weeks per year, the implicit value of a statistical injury 
Z in the logarithmic utility function case is given by 


[ein(Y(1- £?)) -1n vr] 


Z-50Y 
(1- p)a* p 
The statistic can be written as 
oY Uu(Y(i—-t))- V(r) 








êp  (1- p)U'(Y(1- 0))* pV (Yr) 
The explicit characterization of Z in the Taylor's series 
model is generated by using a first-order series expan- 


sion to approximate U[Y(1— t)] and V[¥r] about Y, - 


and a first-order series to approximate U'[Y(1— 1)] and 
V'IYr] about Y. Defining the parameters Bj B Bs 
Bog, and f, as before, and assuming the worker is 
employed for 50 weeks, we can write Z as 


"EE B, — Yik — Y(r - 1) 
(1— p)f + phs- (1— p)Ytfi, + pY(r -1)Ba- 





The calculation uses the Taylor's series results from the 
second column of Table 3 and the mean risk level for p. 


JUNE 1990 


for the unconstrained ‘second-order Taylor's . 


series estimates is $11,530, which is some- ` ` 


what below the logarithmic estimate. As the 
results in the final column of Table 6 indi- 
cate, the variation in the implicit value of an 
injury with the risk level follows the same 


. general pattern as in the logarithmic case, 


but is somewhat greater. Using the estimates 
in which we constrain the statistically in- 
significant second-order terms to equal zero 
(see final column in Table 3), the implicit 
value of an injury rises to $12,057. This 
estimate is closer to the logarithmic utility 
function result. 

The variation of the injury value with the 
risk level is of independent economic inter- 
est. Several economic models predict that the 
valuation of a risk change should be an 
increasing function of the base risk level. 
The source of this effect is the lower oppor-. 
tunity cost of resources with high risk levels. 
At high levels of risk, the probability of 
spending the money when in good health is 
less. Since the utility and marginal utility of 
money is less when one is injured than when 
one is healthy, for any given income level, 
the additional expected utility produced by 
wage compensction is reduced by increases 
in the base risk. 

The types of variations that are predicted 
theoretically are borne out by the results in 
Table 6 for both the logarithmic and Taylor's 
series cases. The additional compensation 
required to accept an increase in risk is 
greater for high basé risks. The Table 6 
results also ind:cate a change in the tradeoff 
with the base risk. These patterns follow 
economic predictions, as 0Z/àp » 0 and 
à?z/üp? p*>0. 


B. Income Elasticity of the Value 
of an Injury 


On a theoretical basis, the value of an 
injury should increase with individual in- 
come and wealth, and available labor market 


20 For discussion of this and related issues, see 
Viscusi (1979) and Weinstein, Shepard, and Pliskin 
(1980). 
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data are consistent with this relationship (see 
Viscusi (1978, 1979)). However, the extent of 
the observed income effects have not been 
large since existing data sets are not well- 
suited to disentangling the role of compen- 
sating differentials for risk and income ef- 
fects that govern job choice. 

` At the mean value of an injury for the 
sample, the income elasticity of the value of 
an injury in the logarithmic case is 1.0995.” 
Thus, the value of an injury is roughly pro- 
portional to one's base income level. Using 
Taylor's series results from the final column 
in Table 3, the parameter estimates suggest 
that the elasticity in the Taylor's series case 
is approximately 0.67.7 In the health insur- 
ance context, estimated income elasticities 
are generally lower— typically 0.5 or less.” 
One might expect the health insurance in- 
come elasticity to be below the elasticity of 
the value of an injury since demand will be 
muted to the extent that additional health 
expenditures have a diminishing, probabilis- 
tic effect on one's well-being. 


? For the logarithmic case, the income elasticity e of 
the value of an injury is given by 

Y aZ i a-1 
= — — =] + — 

Z ðY eln(Y(1- ()) -In(Yr) ' 


€ 


which is obtained by using the value of Z from fn. 18. 

We can calculate e for the second-order Taylor's 
series model by differentiating Z with respect to Y and 
multiplying by Y/Z. Assuming U "(.)V(-)—0, 
the value for e can be written as 


pa A — Bs — tYBo, - Y(r -1)85 
B, — (YR, - Y(r - 1), 


Y 
~ (=p) B+ phs- (1- p) YBa + Y(r 1) By 





Given the imprecision with which ‘the second-order 
terms are estimated, we use the results from column 2 of 
Table 3 where fj, is restricted to equal fj, to calculate 
the value for Z. Income elasticities cannot be derived 
using the first-order results. 

23The income elasticity for health insurance esti- 
mates and the underlying theory are discussed in Joseph 
P. Phelps (1973), Charles E. Newhouse and Phelps 
(1976), and Phelps (1987). 
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The estimates of the income elasticity of 
injuries indicate that the value placed on 
individual health is not a constant, but ex- 
hibits substantial heterogeneity (see Viscusi, 
1979, 1983). Knowledge of the income elas- 
ticity of the value of statistical injuries is 
likely to be particularly useful in the valua- 
tion of government programs with long-term 
effects since the growth in income over time 
will boost the value of the risks reduc- 
ed, offsetting much of the influence of dis- 
counting. 


C. The Value of Non-Incremental 
Risk Changes 


In some cases the risk change that must be 
valued does not involve a small incremental 
change in the probability. Although medical 
contexts create the greatest opportunities for 
quantum changes in tbe risk level, changes in 
large individual risks resulting from govern- 
ment regulation (for example, seatbelt use 
requirements) pose similar problems. 

From an economic standpoint, individuals 
should exhibit a diminishing marginal valua- 
tion of risk reduction and an increasing 
marginal acceptance price for risk increases. 
Market risk data do not enable one to ad- 
dress these issues since the observed risk 
changes tend to be small.” 

Knowledge of utility functions enables one 
to make such assessments, as Table 7 sum- 
marizes the value of non-incremental risk 
changes from the starting point of the mean 
injury risk of 0.085. The purchase of a risk 
reduction of — 0.085 is tantamount to com- 
plete elimination of the risk. Using the loga- 
rithmic estimates, there is an associated value 
per unit risk reduction of $12,865 for such a 
complete elimination of the risk. Similarly, 
there is a $8,989 value for the first-order 
Taylor’s series estimates. At the opposite 


2 The predicted pattern of behavior has been borne 
out in -an experimental consumer context by W. Kip 
Viscusi, Wesley A. Magat, and Joel C. Huber (1987). In 
their study, the marginal valuations of successive risk 
reductions -were elicited directly, whereas here we will 
estimate the value of non-incremental changes using the 
estimated utility function. 
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TABLE 7— DEPENDENCE OF THE IMPLICIT VALUE OF AN INJURY ON THE EXTENT 


OF THE RISK CHANGE 
Implicit Dollar Value of an Injury 

Risk Increment 
from a Sample Mean (0.085) Logarithmic Taylor's Series 
—0.085 12,865 . 8989 
— 0.050 13,059 9,131 
—0.010 13,286 9,299 
0.010 13,401 9,386 
-- 0.050 13,637 9,563 
-- 0.250 14,918 10,566 
+0.500 16,792 12,157 
+0.750 19,041 14314 - 
-- 0.915 20,777 


extreme, workers would require a risk-dollar 
tradeoff of $20,777 (logarithmic) or $16,213 
(Taylor's series) to incur an increase in the 
injury probability from 0.085 to 1.0, or a risk 
increase of -- 0.915. 

These results suggest how non-incremental 
risk changes differ in value from estimates 
based on small marginal changes in the risk. 
The value of an injury of $13,343 (logarith- 
mic) for incremental changes at: the sample 
mean is only 4 percent larger than the injury 
value associated with complete. elimination 
of the risk since the initial injury probability 
is close to zero. The injury value associated 
with risk increases to a risk of 1.0 is 56 
percent greater (logarithmic) than the value 
at the mean since the risk change is quite 
substantial. In addition, the change in the 
value of an injury increases at an increasing 
rate as the risk level rises. For example, the 
implicit value of an injury (logarithmic) rises 
at 1.67 times the rate over the interval 
(+ 0.750, + 0.915), as compared with the in- 
terval that it did over (0.0, + 0.250). A similar 
pattern is observed in the Taylor's series 
case. Individuals demand increasingly large 
prices per unit risk for successive risk in- 
creases and are willing to pay successively 
smaller amounts for additional risk de- 
creases, as predicted.” 


?5See Viscusi, Magat, and Huber (1987) for a deriva- 
tion and review of the antecedents in the literature. 
Milton C. Weinstein, Donald S. Shepard, and Joseph 
Pliskin (1980) present a related discussion for fatality 
risks. 


16,213 


D. The Optimal Workers’ Compensation 
Earnings Replacement Rate 


At present, the workers’ compensation 
earnings replacement rate is based on an 
algorithm that typically set the benefit equal 
to two-thirds of the worker’s gross wage rate, 
subject to ceriain minimum benefit levels, 
maximum benefit levels, and benefit dura- 
tion amounts. Since the marginal utility of 
income is reduced by an injury, as our re- 
sults for both the Taylor’s expansion and 
logarithmic model indicate, less than full 
earnings replacement is desirable. 

How much earnings replacement is de- 
sirable cannot: be determined based on 
available labor market data.?$ Using worker 
utility functions, a precise assessment is pos- 
sible. In particular, suppose that workers 
must purchase workers’ compensation 
through an insurance market. If the risk of 
injury is p, then the price of actuarially fair 
insurance is p/(1— p) and the cost of buy- 


©The most that can be done is to assess the wage 
offset that workers are willing to accept in return for 
workers’ compensation benefits and compare this offset 
with what would b2 observed if insurance were optimal. 
The estimates in Viscusi and Moore (1987) imply that 
the levels of benefits were suboptimal in the 1970s, but 
the extent of the suboptimality could not be deter- 
mined. Estimates “or the 1980s in Moore and Viscusi 
(forthcoming) indicate that substantial increases in the 
benefit levels since the 1970s have led to a situation in 
which current replacement rates are close to the optimal 
level. 
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ing insurance in the healthy state that yields 
' payoff rY when injured is prY/(1— p). If 
there is some insurance loading factor h(h >= 
1) to cover administrative costs and a return 
to the insurance industry, then the cost be- 
comes AprY/(1— p). 

The task of ascertaining the optimal insur- 
ance policy in the logarithmic utility func- 
tion case is to 


Max V =(1—p)aén{ ¥[1—(hpr/(1-p))] 


(1—1)} + pén(Yr). 


After substituting for the appropriate nu- 
merical values and taking the partial deriva- 
tive with respect to r, one obtains the result 
that 


hr = 0.85. 


If workers’ compensation were provided on 
an actuarially fair basis, the optimal replace- 
ment rate would be 85 percent. Less than 
full earnings replacement is desirable since 
the marginal utility of income is lower in the 
ill health state. Taking into account the role 
of taxes, an earnings replacement of 0.85 of 
gross earnings does replace most of the 
worker's after tax income. 

Under the current workers’ compensation 
system, administrative costs are nontrivial, 
so that after the insurance loading costs are 
taken into account workers receive 80¢ for 
each dollar contributed, or for each dollar of 
benefits they pay. $1.25 in premiums (i.e., 
h —1.25) After taking these costs into ac- 
count, the optimal replacement rate is 0.68. 

The current workers compensation for- 
mulas that provide for two-thirds wage re- 
placement are close to optimal, given the 
role of administrative costs. The role of ben- 
efit caps and other provisions, however, re- 
duces the effective replacement rate to only 
0.64, which is slightly below this amount. In 
addition, if our reference point for benefit 
- provision is what would be optimal if there 
were actuarially fair insurance available, then 
there is a much more substantial divergence 
from the optimal amount. 
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V. Conclusion 


Analyses of risky decisions using market- 
based data are by necessity restricted to uti- 
lizing the information generated by the ob- 
served local tradeoff revealed in the market. 
Although this literature has yielded many 
profitable insights, the domain of inquiry has 
been substantially limited. 

In this paper we explored the implications 
of knowing two wage-risk combinations 
along the individual's indifference map. This 
information was developed based on a sur- 
vey of worker responses to the risks in- 
dicated by hazard warnings. The overall 
objective was to assess individuals’ utility 
functions for good health and ill health, 
which will convey much more information 
about the character of individual preferences 
than the local tradeoff. 

The two approaches that were used—a 
Taylor's series expansion with respect to a 
general functional form and a logarithmic 
utility function—each yielded similar re- 
sults. Since being injured will clearly reduce 
the level of utility, the main question of 
interest is how the marginal utility of income 
is affected by an injury. In each case, the 
marginal utility of a given level of income 
was greater when healthy than when injured. 

This result has fundamental implications 
for the optimal level of insurance since it 
implies that less than full insurance of in- 
come losses is optimal. This type of result 
has played a major role in the health eco- 
nomics and social insurance literature, but 
except in the case of death, the empirical 
foundation for making this determination 
has been lacking. 

Even more striking is that the estimates of 
the logarithmic utility function enable us to 
ascertain not only whether less than full 
insurance is optimal but also what the opti- 
mal level of insurance is. In particular, we: 
showed that the optimal earnings replace- 
ment rate for workers’ compensation is 85 
percent if insurance is provided on an actu- 
arially fair basis and 68 percent if insurance 
is provided at the current degree of insur- 
ance loading. In each case, current benefit 
levels are slightly suboptimal, as has been 
shown using a different methodology by W. 
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Kip Viscusi and Michael J. Moore (1987), 
but in this case we can ascertain the extent 
of the divergence from optimality as well. 

Knowledge of the utility function shape 
enables us to address a variety of other is- 
sues that have long been the subject of theo- 
retical inquiry and empirical speculation. 
Perhaps the most ‘striking result is the in- 
come elasticity of thé value of an injury, 
which was found to range from 0.67 (Taylor’s 
series) to roughly 1.0 (logarithmic case). This 
result enables one to make precise distinc- 
tions with respect to the heterogeneity in the 
value of risk-dollar tradeoffs across income 
groups. The greatest policy importance of 
this result is with respect to deferred risks 
since it indicates that the injury value figures 
used for deferred risk reductions should take 
into account the income growth of those 
affected by the regulations, leading to an 
adjustment that will serve to mute much of 
the role of discounting. These results may 
prove to be particularly useful in assessing 
the value of risk reduction to future genera- 
tions. 

There were two other types of concerns 
for which we obtained new results because 
of our focus on risk-dollar tradeoffs in more 
than a local region. As predicted by several 
theoretical analyses, increases in the base 
risk reduced the implicit value of an injury. 
The empirical sensitivity of the results to the 
base risk level was not, however, great. 

Of much greater consequence was the 
change in the implicit value of an injury with 
the magnitude of any non-incremental risk 
change. Implicit values of an injury associ- 
ated with the purchase of risk reductions 
diminished at an increasing rate as the ex- 
tent of the risk reduction increased, and the 
implicit values associated with compensation 
for a risk increase rose at an increasing rate 
as the extent of the non-incremental risk 
change increased. As with the earlier results, 
these patterns are consistent with a rational 
economic choice model and lie outside the 
scope of concerns that can be addressed 
using market data. 

Analysis of the survey experiment on 
worker responses to changes in their job risk 
. has greatly expanded the range of risk-dollar 

tradeoffs that can be addressed empirically. 
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The most reassuring aspect of the findings is 
that even the more refined predictions of the 
expected utility model with health state- 
dependent utility functions are borne out. 
Knowledge of the utility function shape also 
enables us to address for the first time many 
issues that have played a central role with 
respect of the economic performance and 
optimal government policies in coniexts in- 
volving risk. 


APPENDIX 
SPECIFICATION TESTS 


In this appendix we will summarize the results of the 
White-Domowitz (1984) specification tests. These tests 
have two objectives. First, they provide a formal test of 
the homoscedasticity assumption. Second, they provide, 
a test of whether the model specification is correct up to 
an additive error term. 

Consider first the logarithmic case. To perform the 
test, we must first define some terms. Le: @=6— 
f(x B), and let g; (B) be the jth element of the 
gradient 0f ( X;,/),/6B, where the gradient is evaluated 
at the estimated parameter vector, f. Define the vector 
$; to be formed by all nonredundant cross products of 
the gradient, g;-(B)g4(B), for i, j€ (1,2,..., p). By 
definition, $; has a maximum length of p( p  1)/2. Let 
n equal the number of observations. The test statistic is 
generated from the regression of the square of the 
predicted residual on the vector 4; and a constant, 


& — yw, 


where y isa p(p t 1)/2X1 vector of parameters to be 
estimated. The test statistic is formed by multiplying the 
number of observations times the (constant adjusted) 
R? of the above regression, The statistic is dis-ributed as 
a x? with p( p +2)/2 degrees of freedom. If nR? is less 
than the critical value of the x? distribution, one ac- 
cepts the null hypothesis of no heteroscedasticity. 

The nR? test is also a test of the model specification. 
Rejection of the null hypothesis can be due either to 
heteroscedasticity or model misspecification. If one ac- 
cepts the null hypothesis, then the model is correct up 
to an independent additive error term. In the first-order 
Taylor's series case, the nR? statistic of 1.469 for the 
full sample is well below the critical value of 7.81 at the 
95 percent confidence level. The TNT and asbestos 
subsamples also have nR? values below the critical 
level, but the chloroacetophenone sample does not. Ex- 
cept for the chloroacetophenone subsample results, het- 
eroscedasticity is not a problem, and there is also no 
evidence of statistically significant misspecification of 
the model. 

The results foz the logarithmic case, which are sum- 
marized at the bottom of Table 4, are similar. Consider 
the full sample results. The nR? statistics are 12.00 for 
Model 1 and 12.82 for Model 2, which is above the 
critical 95 percent confidence levels of 3.84 for Model 1 
but below the critical level of 23.69 for Model 2. Simi- 
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larly, the Model 1 results for asbestos are below the ` 


critical statistic for TNT and asbestos is not. Thus, one 
cannot reject either (i) the assumption of homoscedastic 
errors or (ii) the assumption that the model specification 
is correct up to an additive error term for Model 2 or 
for two subsample estimates of Model 1 (TNT and 
chloroacetophenone). - 

Although we cannot reject the hypothesis that both 
models are correctly specified, this result is not contra- 
dictory since we can consider the Taylor’s series. as 
simply approximating the logarithmic function. Addi- 


tional evidence for this conclusion is in the closeness of. 


‘the two estimates B, and 1/a. 
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Comparing Information in Forecasts from 
Econometric Models 


By Ray C. FAIR AND ROBERT J. SHILLER* 


The information contained in one model’s forecast compared to that in another 
can be assessed from a regression of actual values on predicted values from the 
two models. We do this for forecasts of real GNP growth rates for different pairs 
of models. The models include a structural model (the Fair (1976) model), various 
versions of the vector autoregressive (VAR) model, and various versions of a 


model we call the 


“autoregressive components" (AC) model. Our procedure 


requires that forecasts make no use of future information, and we have been 
careful to try to insure this, including using the version of the Fair model that 
existed in 1976, the beginning of our test period. (JEL 132) : 


Many econometric models are used to 
forecast economic activity. These models 
differ in structure and in the data used, ànd 
so their forecasts are not perfectly correlated 
with each other. How should we interpret 
the differences in forecasts? Does each model 
have a strength of its own, so that each 
forecast represents useful information unique 
to it, or does one model dominate in the 
sense of incorporating all the information in 
the other models plus some? 

Structural econometric models often make 
use of large information sets in forecasting a 
given variable. The information set used in a 
large-scale macroeconometric model is typi- 
cally so large that the number of predeter- 
mined variables exceeds the number of ob- 
servations available for estimating the model. 
Estimation can proceed effectively only be- 
cause of the large number of a priori restric- 
tions imposed on the model, restrictions that 
do not work out to be simple exclusion re- 
strictions on the reduced form equation for 
the variable forecasted. 

Vector autoregressive (VAR) models are 
typically much smaller than structural mod- 


. *Cowles Foundation, Box 2125, Yale Station, New 
Haven, CT 06520. We are indebted to the referees for 
many helpful comments, in particular for pointing out 
some of the special cases discussed at the end of Sec- 
tion I. This research was supported by the National 
Science Foundation under grant nos. SES-8520075 and 
SES-8717080. . 


els and in this sense use less information. 
The above question with respect to VAR 
models versus structural models is thus 
whether the information not contained in 
VAR models (but contained in structural 
models) is useful for forecasting purposes. In 
other words, are the a priori restrictions of 
large-scale models useful in producing de- 
rived reduced forms that depend on so much 
information, or is most of the information 
extraneous? 

One cannot answer this question by doing 


' conventional tests of the restrictions in a 
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structural model. These restrictions might be 
wrong in important ways and yet the model 
contain useful information. Even ignoring 
this point, however, one cannot perform such 
tests with most large-scale models because, 
as noted above, there are not enough obser- 
vations to estimate unrestricted reduced 
forms. : 

We will examine the question whether one 
model’s forecast of real GNP carries differ- 
ent information from another's by regressing 
the actual change in real GNP on. the fore- 
casted changes from the two models. This 
procedure, which is discussed in the next 
section, is related to the literature on encom- 
passing tests? and the literature on the opti- 


1See, for example, Russell Davidson and James G. 
MacKinnon (1981), David F. Hendry and Jean-Francois 
Richard (1982), Yock Y. Chong and David F. Hendry 
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mal combination of forecasts.2 From our 
point of view, this procedure has two advan- 
tages over the standard procedure of com- 
puting root mean squared errors (RMSEs) to 
compare alternative forecasts. First, if the 
RMSEs are close for two forecasts, little can 
be concluded about the relative merits of the 
two. With our procedure, as will be seen, one 
can sometimes discriminate more. Second, 
even if one RMSE is much smaller than the 
other, it may still be that the forecast with 
the higher RMSE contains information not 
in the other forecast. There is no way to test 
for this using the RMSE framework. | 

It should be stressed that our procedure 
does not allow us to discover whether all the 
variables in a-model contribute useful infor- 
mation for forecasting. If, say, our regression 
results reveal that a large model contains all 
the information in smaller models plus some, 
it may be that the good results for the large 
. model are due to a small subset of it. We can 
only say that the large model contains all the 
information in the smaller models that it has 
been tested against, not that it contains no 
extraneous variables. 

We compare the forecasts from the struc- 
tural model in Ray C. Fair (1976) with those 
of various atheoretical models. The atheoret- 
ical models include various versions of the 
VAR model and various versions of a model 
we call the “autoregressive components" 
(AC) model. The AC model divides GNP 
into components and estimates an autore- 
gressive equation for each component. Like 
the Fair model, the AC model uses a fairly 
large amount of information on the compo- 
nents of GNP and yet it has a simple struc- 
ture. Like parts of the Fair model, it uses 
lagged values of the components in the de- 
termination of the current values. The vari- 
ous versions of the AC model differ in their 
degree of disaggregation of the components, 


(1986), and Grayham Mizon and Jean-Francois Richard 


(1986). See also Charles R. Nelson (1972) and J. Phillip 
Cooper and Charles R. Nelson (1975) for an early use 
of encompassing-like tests. 

?See, for example, Clive W. J. Granger and Paul 
Newbold (1986). 
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which allows us to examine how disaggrega- : 
tion affects the forecasts. 

The AC model is of use to compare to the 
VAR model as well as to a structural model 
like the Fair model. It will be of interest to 
see within the class of atheoretical models 
how it compares to the VAR model. If, for 
example, it is found to contain information 
not in the VAR model, this indicates that the 
VAR model is missing useful information in 
the components. The AC model can be 
looked upon as an alternative to the VAR 
model when simple atheoretical models are 
desired. 

Our procedure requirés that forecasts be 
based only on information available prior to 
the forecast period. We call forecasts that 
meet this requirement "quasi ex ante" fore- 
casts. To guard against future information 
creeping into the specification of the model 
and thus the forecasts, we chose the version 
of the Fair model that existed in 1976 and 
based all the tests on the period since 1976. 
We also converted the model to a model 
with no exogenous variables by adding an 
autoregressive equation for each exogenous 
variable in the model: Finally, we estimated 
the model (including the exogenous-variable 
equations) thrcugh period ¢ —1 for each new 
beginning period ¢ of the forecast. Using 
these “rolling” estimation forecasts is impor- 
tant because in doing so we are producing 
the actual forecasts that one could make 
with the model as time progresses. 

We followed the same rolling estimation 
procedure for the. VAR. and AC models. 
These models have no exogenous variables, 
and so no adjustment was needed here. The 
Robert B. Litterman prior (1979) that we use 
for one of the versions of the VAR model 
was published near the beginning of the test 
period. Thé AC model is new, but it has 
such a simple structure that it is unlikely to 
be accused of having had future information 
used for its specification. 

The quasi ex ante forecasts that we gener- 
ate may have different properties from fore- 
casts made with a model estimated with fu- . 
ture data. If the model is misspecified (for 
example, parameters change through time), 
then the rolling estimation forecasts (where : 
estimated parameters vary through time) may 
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carry rather different information from fore- 
casts estimated over the entire sample.? Also, 
some models may use up more degrees of 
freedom in estimation than others, and with 
varied estimation procedures it is often very 
difficult to take formal account of the num- 


ber of degrees of freedom used up. In the. 


extreme case where there were so many pa- 
rameters in a model that the degrees of 
freedom were completely used up when it 
was estimated (an obviously over parameter- 
ized model), it would be the case that the 
forecast value equals the actual value and 
there would be a spurious perfect correspon- 
dence between the variable forecasted and 
the forecast. One can guard against this de- 
grees of freedom problem by requiring that 
no forecasts be within-sample forecasts.* 


I. The Procedure 


Let ,:,¥;, denote a forecast of Y, (in our 
application, log real gross national product 
at timet) made from model 1 using informa- 
tion available at time ? —5 and using the 
model's estimation procedure and forecast- 
ing method each period. Let , ,Y,, denote 
the same thing for model 2. (In the notation 
above, these two forecasts are 
ante" forecasts. The parameter s is the 
length ahead of the forecast, s > 0. Note that 
the estimation procedure used to estimate a 
model and the models forecasting method 
are considered by us as part of the model; 
we take no account of these procedures here. 


?Even if the model is not misspecified, estimated 
parameters will change through time due to sampling 
error. If our purpose were to evaluate the forecasting 
ability of the true model (i.e, the model with the true 
coefficients), we would face a generated regressor prob- 
lem. However, we are interested here in the perfor- 
mance of the model and its associated estimation pro- 
cedure. If one were interested in adjusting for generated 
regressors, the correction discussed in Kevin M. Mur- 
phy and Robert H. Topel (1985) could not be directly 
applied here because the covariance matrix of the coef- 
ficient estimates used to generate the forecasts changes 
through time because of the use of the rolling regres- 
sions. Murphy and Topel require a single covariance 
matrix. U 

^Nelson (1972) and Cooper and Nelson (1975) do 
not require the forecasts to be based only on informa- 
tion through the previous period. 


“quasi ex: 
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We consider the followin regression equa- 
tion: 


(1) Y, => Y., e t B(, Y, E Y 


a Cae b: E Ys] Ta. 


If neither model 1 nor model 2 contains any 
information useful for s-period-ahead fore- 
casting of .Y,, then the estimates of B and y 
should both be zero. In this case the estimate 
of the constant term a would be the average 
s-period-change in Y. If both models contain 
independent information? for s-period-ahead 
forecasting, then f and y should both be 
nonzero. If both models contain informia- 
tion, but the information in, say, model 2 is 
completely contained in model 1 and model 
1 contains further relevant information as 
well, then B but not y should be nonzero. (If 
both models contain the same information, 
then the forecasts are perfectly correlated, 
and f and y are not separately identified.) 
The procedure we are proposing in this 
paper is to estimate equation (1) for different 
models' forecasts and test the hypothesis H, 
that B — 0 and the hypothesis H, that y = 0. 
H, is the hypothesis that model 1's forecasts 
contain no information relevant to forecast- 
ing s periods ahead not in the constant term 
and in model 2, and H, is the hypothesis 
that model 2's forecasts contain no informa- 
tion not in the constant term and in model 1. 
As we noted above, our procedure bears 
some relation to encompassing tests, but our 
setup and interests are somewhat different. 
For example, it does not make sense in our 
case to constrain f and y to sum to one, as. 
is usually the case for encompassing tests. If 
in our case both models' forecasts are just 
noise, the estimates of both 8 and y should 
be zero. Also, say that the true process gen- 
erating Y, is Y, equal to X,+ Z,, where X, 
and Z, are independently distributed. Say 
that model 1 specifies that Y, is a function of 


51f both models contain “independent information" 
in our terminology, their forecasts will not be perfectly 
correlated. This can arise either because the models use 
different data or because they use the same data but 


impose different. restrictions on the reduced form. 
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X, only and that model 2 specifies that Y, is 
a function of Z, only. Both forecasts should 
thus have coefficients of one in equation (1), 
and so in this case f and y would sum to 
two. It also does not make sense in our setup 
to constrain the constant term a to be zero. 
If, for example, both models' forecasts were 
noise and we estimated equation (1) without 
a constant term, then the estimates of 8 and 
y would: not generally be zero when the 
mean of the dependent variable is nonzero. 
It is also not sensible in our case to as- 
sume that u, is identically distributed. It 
seems quite likely that u, is heteroskedastic. 
If, for example, a = 0, B —1, and y — 0, u, is 
simply the forecast error from model 1, and 
in general forecast errors are heteroskedastic. 
Also, we will be considering four-period- 
ahead forecasts in addition to one-period- 
ahead forecasts, and this introduces a third- 
order moving average process to the error 
term in equation (1). We correct for both 
heteroskedasticity and the moving average 
process in the estimation of the standard 
errors of the coefficient estimates. We use 
the procedure given by Lars Peter Hansen 
(1982), Robert E. Cumby, John Huizinga, 
and Maurice Obstfeld (1983), and Halbert 
White and Ian Domowitz (1984) for the esti- 
mate of the asymptotic covariance matrix of 
the estimate 0 of the parameter vector 0 = 
(a B y) in (1). Define X as the T X3 matrix 
of variables, T7. row ¢ is X,—[1, (, ,Y, 
Y.) G-,12,7 Y,.,)) and let &,—Y,— 
Y. — X,6. We cud the covariance ma- 
trix of 6, V(Ó), as: 


(2) rw(6)-(xx)'s(xx) 
where 


à s—1 ; 
(3 S=Q)+ Y (2,97), 
ja 


T 
(4) Qi 2 (uu; ,) X/X,_;, 


t=j+1 


®The error term in equation (1) could, of course, be 
serially correlated even for the one-period-ahead fore- 
casts. Such serial correlation does not appear to be a 
problem with any of the models we study here, however, 
and we have assumed it to be zero. 
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where Ê is. the ordinary least squares esti- 
mate of @ and s is the forecast horizon. 
When s equals 1 the second term on the 


` right hand side of (3) is zero, and the covari- 


ance matrix is simply Halbert White's (1980) 
correction for heteroskedasticity. 

Note that as an alternative to equation (1) 
we could have regressed the level (rather 
than the change) of GNP on the forecasted 
levels and a constant. GNP has a very strong 
low frequency component. It may be an 
integrated process, and any sensible forecast 
of GNP will be cointegrated with GNP it- 
self. The sum of £ and y will thus be con- 
strained in effect to one, and in the levels 
regression we would be estimating in effect 
one less parameter. If GNP is an integrated 
process, running the levels regression with an 
additional independent variable Y, , 
(thereby estimating f and y without con- 
straining their sum to one) is essentially 
equivalent to our differenced regression (1). 
: It should finally be noted that there are 
cases in which an optimal forecast does not 
tend to be singled out as best in regressions 
of the form (1), even with many observa- 
tions. Say the truth is Y, - Y, = aX, + e, 
Say that model 1 does rolling regressions of 
Y,— Y, on X, , and uses these regressions 
to forecast. Say that model 2 always takes 
the forecast to be bX, ,, where b is some 
number other than a, so that model 2 re- 
mains forever an incorrect model. In equa- 
tion (1) regressions the two forecasts tend to. 
be increasingly collinear as time goes on; 
essentially they are collinear -after the first 
part of the sample. Thus, the estimates of B 
and y.tend to be erratic. Adding a large 
number of observations does not cause the 
regressions to single out the first model; it 
only has the effect of enforcing that È + 


(76)/a =1. 
Il. Quasi Ex Ante Forecasts 


As noted above, we want forecasts from 
models that are based only on information 
through the period prior to the beginning of 
the forecast period (through period t — s for 
a forecast for period z). There are four ways 
in which future information can creep into a 
current forecast. The first is if actual values 
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of the exogenous variables for periods after 
t — s are used in the forecast. The second is 
if the coefficients of the model have been 
estimated over a sample period that includes 
observations beyond t—s. The third is if 
information beyond t—s has been used in 
the specification of the model even though 
for purposes of the tests the model is only 
estimated through period t — s. The fourth is 
if information beyond period t — s has been 
used in the revisions of the data for periods 
t — s and back, such as revised seasonal fac- 
tors and revised benchmark figures. `. 

The VAR, AC, and AR models discussed 
below have no exogenous variables, and .so 
there are no exogenous-variable problems 
for these models. The way we have handled 
the problem for the Fair model is to add 
autoregressivé equations for the exogenous 
variables to the model. For each exogenous 
‘variable in the model an eighth-order autore- 


gressive equation (with a constant term and . 
time trend included) has been postulated. 


When these equations are added to the 
model, the model effectively has no exoge- 
nous variables in it. This method of dealing 
with exogenous variables in structural mod- 
els was advocated by: Cooper and Nelson 
(1975) and Stephen K. McNees (1981). 
McNees, however, noted that the method 
handicaps the model: “It is easy to think of 
exogenous variables (policy variables) whose 
future values, can be anticipated or con- 
trolled with complete certainty even if the 


: historical values can be represented by co-. 


variance stationary processes; to do so intro- 


duces superfluous errors into the model solu- . 


tion” (McNees, 1981, p. 404). 
For the coefficient-estimate problem, we 
use rolling estimations for all the models. 


For the forecast for period t, we estimate the © 


model through period t — s; for the forecast 
for period t+1, we estimate the model 
through period t—s+1;. and.so on. By 
*model" in this case we mean the model 
inclusive of any exogenous-variable equa- 
tions. The beginning observation is held fixed 
for all the regressions; the sample expands 


~ . by. one observation each time a time period 


elapses. - 
The third mobiem — ihe possibility of us- 
ing information beyond period t— s in the 
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specification of the model—is more difficult 
to handle. Models are typically changed 
through time, and model builders seldom go 
back to or are interested.in “old” versions. 
We have, however, attempted to account for 
this problem in this paper regarding the Fair 
model. We consider the version of the Fair 
model that existed as of the second quarter 


` of 1976. 


We have done nothing about the data-re- 
vision problem in this paper. The data that 
have been used are the latest revised data. It 
would be extremely difficult to try to purge 


- these data of the possible use of future infor- 


mation, and we have not tried. Note that it 
is not enough simply to use data.that existed 


at any point in time (say period t — s) be- 


cause data on the s-period-ahead value 
(period ./) are needed. to estimate equation 
(1). We would have to try to construct data 
for period t that are consistent with the old 
data for EnS t—s. 


m. aes 
A. The Fair Model (FAIR) -- 


- The first version of the Fair model was 
presented in Fair (1976) along with the esti- 
mation method and method of forecasting 
with the model. This version was based on 
data through 1975 I. One important addition 


_ that was made to the model from this ver- 


sion was the inclusion of an interest rate 
reaction function in the model. This work is 
described in Ray C. Fair (1978), which is 
based on data through 1976 II. The version 
of the model ‘in Fair (1976) consists of 26 
structural stochastic equations, and with the 
addition of the interest rate reaction func- 
tion, there are 27 stochastic equations. There 
are 106 exogenous variables, and for each of 
these variables an eighth-order autoregres- 
sive equátion with a constant and time trend 
was added to the model. This gave a model 
of 133 stochastic equations, and this i is the 
version that was used., 

For the rolling estimations, the first esti- 
mation period ended in 1976 II, which is the 
first quarter in which the model could defi- 


' nitely be said to exist. This allowed the model 


to be estimated 40 times (through 1986 I). 
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Because the version of the Fair model 
used here existed as of 1976 II, because it 
has in effect no exogenous variables, and 
because it is estimated via rolling estimation, 
the forecasts from it can be said to be fore- 
casts that are truly based only on informa- 
tion through the period prior to the first 
period of the forecast (except for the data 
revision problem). This may be the first 
time that a large model this old has been 
tested. . 


B. The VAR Models (VAR4, VARAPI, 
VAR4P2, VAR4P3, VARI, 
and VAR2) 


We consider six VAR models in this pa- 
per. The first, VAR4, estimated by ordinary 
least squares, is the same as the model used 
in Christopher A. Sims (1980) except that we 
have added the three-month Treasury bill 
rate to the model. There are seven variables 
in the model: real GNP, the GNP deflator, 
the unemployment rate, the nominal wage 
rate, the price of imports, the money supply, 
and the bill rate. All but the unemployment 
rate and the bill rate are in logs. Each equa- 
tion consists of each variable lagged one 
through four times, a constant, and a time 
trend, for a total of 30 coefficients to esti- 
mate. 

The next three VAR models—VAR4P1, 
VAR4P2, and VAR4P3—have Bayesian pri- 


"This statement needs to be qualified slightly. Al- 
though the structural stochastic equations used for the 
Fair model are exactly as in Fair (1976) and (1978)— 
same left-hand side and right-hand side variables and 
same functional forms—the data revisions in the Na- 
tional Income Accounts since 1976 have required slight 
modifications to some of the identities in the model. 
Also, the identities in Fair (1976) for the government 
sector are for the total government sector, whereas in 
the version used here there are separate identities for 
the federal government sector and the state and local 
government sector. This disaggregation of the govern- 
ment sector does not affect anything except that it 
means that there are more exogenous variables (and 
thus more’ exogenous-variable equations) in the version 
used here than there were in Fair (1976). No structural 
stochastic equations need to be modified because of this 
disaggregation because the government variables do not 
appear as explanatory variables in these equations. The 
government variables appear only in the identities. 
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ors imposed on the coefficients of VAR4. We 
impose the Litterman prior that the variables 
follow. univariate random walks. The stan- 
dard deviations of the prior take the form 


(5) S(i, j,k) =yg(k) (4 J)(sj/s;), 


where i indexes the left-hand side variable, j 
indexes the right-hand side variables, and k 
indexes the lag. s; is the standard error of 
the unrestricted equation for variable i. 
VARAPI imposes parameter values that im- 
ply fairly loose priors. They are: 1) f(i, j) «1 
for all i and j, 2) g(k) —1 for all k, and 3) 
y 502. VAR4P2 imposes parameter values 
that imply much tighter priors: 1) f(i. /) «1 
for i= Jj, f(i, j) * 0.5 for i* j, 2) g(k)= 
k^, and 3) y 2 0.1. VAR4P3 is the same as 
VARAP2 except that f(i, j) 0.2 for i j, 
which implies even tighter priors than for 
VAR4P2. The parameter values for VAR4P2 
are those imposed by Litterman (1979, 
p. 49). 
The fifth VAR model, VAR2, uses only 
the first two lags of each variable, for a total 
of 16 coefficients in each equation. The sixth 
model, VARI, uses only each variable lagged 
once, for a total of 9 coefficients. No priors 
were imposed on VAR2 and VARI; they 
were estimated by ordinary least squares. 
Each VAR model was estimated 4C times 
using the same sample periods as were used 
for the Fair model. Each model was then 
used to make 40 forecasts of real GNP. 


C. The AC Models (AC-6, AC-13, 
AC-17, AC-48, AC-E6, AC-E13, 
AC-EI7, and AC-E48) ` 


Time-series models like VAR models typi- 
cally ignore the components of GNP. For 
example, the VAR models used in this paper 
contain no components. The current model 
used by Christopher A. Sims (serial) for 
forecasting includes only the component 
nonresidential fixed investment. Including 
manv components in a VAR model rapidly 
uses up degrees of freedom, and this is un- 
doubtedly one of the main reasons the com- 
ponents are seldom used. A possible alterna- 
tive to the VAR approach, but one that also 
does not use much economic theory, is to 
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model each of the components of real GNP 
by a simple autoregressive equation (but not 
real GNP itself) and then determine real 
GNP as the sum of the components, (i.e., by 
the GNP identity). 

The AC model may be regarded as an 
extreme caricature of large-scale macro- 
econometric models. Like the latter, lagged 
values of components are used to predict 
components, and the predicted components 
are added up to predict GNP. The AC model, 
however, treats all components in a simple 
and symmetrical way and carries the number 
of lags further than is often the case with 
large-scale models. Note also that the AC 
model could be adapted to forecast variables 
other than GNP. For any variable that is 
determined by an identity, autoregressive 
equations could be used to forecast the 
right-hand side variables and then the iden- 
tity used to forecast the variable itself. Also, 
the AC model for real GNP could be added 
to equations explaining other variables to 
create a more complete macroeconometric 
model. 

We have considered eight AC models in 
this paper, all estimated by ordinary least 
squares. The models are first distinguished 
by whether they include 6, 13, 17, or 48 
components. Increasing the disaggregation 
of the components allows one to examine 
how much additional useful information for 
forecasting purposes is contained in the more 
disaggregated components. Each equation 
for a component contains the first eight 
lagged values of the component, a constant, 
and a time trend. None of the AC equations 
is in log form. For the AC-6, AC-13, AC-17, 
and AC-48 models, these are all the variables 
included in the equations. For the AC-E6, 
AC-E13, AC-E17, and AC-E48 models (E 
for "extended"), the first four lagged values 
of real GNP are added to each equation. 
This allows for an impact of aggregate eco- 
nomic activity on each component and uses 
up only four degrees of freedom per equa- 
tion. The components used for each model 
are listed in the Appendix. The 17 compo- 


nent models (AC-17 and AC-E17) use the ` 


same components as does the Fair model. 
The same sample periods and procedures 
were used for the AC models as were used 
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for the Fair and VAR models except the for 
the 6, 13, and 48 component models the 
beginning quarter for the estimation periods 
was 1961 I rather than 1954 I. 

The AC models are of interest in two 
respects. First, if the Fair model turns out to 
dominate the VAR models (which it does), it 
is of interest to know if this is due simply to 
the fact that the Fair model is dealing with 
the lagged components of GNP. If it is as 
simple as this, then the AC models might do 
even better, and this can be tested. Second, 
the AC models are to some extent competi- 
tors of the VAR models within the class of 
atheoretical models, at least regarding the 
predictions of GNP. Both models are based 
on very little economic theory. It is thus of 
interest to see if one type of model domi- 
nates the other. 


D. The Autoregressive Models 
(AR4 and AR8) 


AR4 and ARS are simple benchmark 
models, estimated by ordinary least squares. 
For AR4 real GNP was regressed on its first 
four lagged values, a constant, and a time 
trend. For AR8 real GNP was regressed on 
its first eight lagged values, a constant, and a 
time trend. The same sample periods were 
used here as were used for the Fair and VAR 
models. 


IV. The Results 


The results comparing’ the Fair model to 
the other models are presented in Tables 1 
and 2. The sample period used for the one- 
quarter-ahead results is 1976 111—1986 II, for 
a total of 40 observations. The sample pe- 
riod for the four-quarter-ahead results is 1972 


This choice was dictated by the available data. 
Fortunately, the results do not appear to be sensitive to 
the use of the later beginning quarter. For the 17 
component model the estimation periods could begin in 
1954 I, and so two versions of AC-17 and AC-E17 were 
estimated, one for the periods beginning in 1954 I and 
one for the periods beginning in 1961 I. The two ver- 
sions gave very similar results. 
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TABLE 1— BIas AND RMSE For EACH MODEL’s FORECAST 


One-Quarter-Ahead Forecasts Four-Quarter-Ahead Forecasts 
Sample Period =1976 III-1986 II Sample Period = 1977 11-1986 II 
const SE DW RMSE const ‘SE DW RMSE 
FAIR 0.0034 0.00883 2.04 0.00945 0.0087 0.0146 1.04 0.0170 
(2.43) (3.60) 
VARI (—0.0014 0.01179 1.47 0.01188 -0.0127 0.0263 0.56 0.0292 
(0.78) (2.93) 
VAR2 —0.0021 0.01090 1.36 0.01111 .—0.0157 0.0285 0.40 0.0326 
(1.23) (3.33) 
VAR4 0.0012 0.01145 1.22 0.01152 -0.0011 0.0291 0.53 0.0291 
(0.68) (0.24) 
VAR4P1 0.0004 0.01105 1.41 0.01106 —0.0028 0.0291 0.43 0.0292 
(0.24) . — (0.59) 
VAR4P2 —0.0017 0.01004 1.68 0.01018 -—0.0105 0.0220 0:52 0.0244 
(1.07) (2.90) 
VAR4P3 —0.0010 0.00976 1.75 0.00981 -—0.0063 0.0231 0.43 0.0239 
(0.66) (1.65) 
AC-6 0.0016 0.01070 1.64 0.01081 0.0052 0.0251 0.42 0.0256 
(0.93) (1.26) 
AC-13 0.0007 0.01071 1.49 0.01073 0.0007 0.9259 0.38 0.0259 
(0.39) (038) . 
AC-17 0.0018 0.01080 1.56 0.01095 0.0031 0.0264 0.38 0.0266 
(1.06) (0.72) 
AC-48 0.0008 0.01141 118 0.01144 0.0019 0.0281 031 0.0282 
: (0.46) (0.41) 
^ AC-E6 0.0020 0.01047 1.96 0.01066 0.0068 6.0245 0.54 0.0254 
(1.23) (1.69) 
AC-E13 0.0005 0.00957 1.93 0.00958 0.0001 0.0222 0.52 0.0222 
(0.34) (0.02) 
AC-E17 0.0017 0.00936 2.25 0.00952 0.0035 0.0228 0.62 0.0231 
(1.18) (0.95) 
AC-E48 0.0009 0.00993 1.84 0.00997 0.0037 0.0266 0.50 0.0268 
(0.60) (0.84) 
AR4 0.0026 0.01023 2.02 0.01055 0.0134 0.0277 0.48 0.0308 
(1.60) (2.94) 
AR8 0.0028 0.01067 1.98 0.01102 0.0147 0.0286 0.52 0.0321 
(1.63) (3.13) 


Notes: const is the estimate of the constant term in a regression of the forecast error on 
the constant term. The forecast error is ,..,7,, — Y, for the one-quarter-ahead results and 
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t-statistics in absolute value are in parentheses. 
SE is the estimated standard error of the regression not adjusting for degrees of 


freedom. 


Y, for the four-quarter-ahead results. Y is the log of real GNP. 
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RMSE is the root mean squared error of the forecast. 


11-1986 II, for a total of 37 observations. 
Remember that each observation for a 
model’s forecast is based on a different set of 
coefficient estimates of the model—the 
rolling estimation. Remember also that for 
the Fair model all exogenous variable values 
are generated from the autoregressive equa- 
tions; no actual values are used. Finally, 
remember that in Table 2 the estimated stan- 
dard errors of the coefficient estimates are 
corrected for heteroskedasticity and (for the 


four-quarter-ahead results) for the. moving 
average process of the error term.? 


?In two cases for the four-quarter-ahead results the 
matrix V(8) was singular or nearly singular. Ia these 
two cases we assumed a second-order MA process for 
the error term instead of a third order, which solved the 
problem. Had this been a more widespread problem, we 
would have used one cf the estimators in Donalé W. K. 
Andrews (1987), but this seemed unnecessary given only 
two failures. The two failures are FAIR versus VAR4 
and FAIR versus VAR4P1. 
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TABLE 2— FAIR MODEL VERSUS THE OTHERS: ESTIMATES OF EQUATION (1) 
One-Quarter-Ahead Forecasts Four-Quarters-Ahead Forecasts 
Dependent Variable is Y, — Y, , Dependent Variable is Y, — Y,.., 
Sample Period —1976 111-1986 II Sample Period = 1977 11-1986 II 
Other FAIR OTHER FAIR OTHER 
Model const pifi- Y-i 1Y- SE DW const ,4fJ4,-Y 4,4 1,-Y 4 SE DW 
VARI — 0.0034 1.05 —0.10 0.00915 2.02 —0.0135 1.06 020 0.0134 1.66 
(1.20) (4.51) (0.38) (2.28) (6.78) (3.09) 
VAR2 — 0.0029 0.89 0.14 0.00912 2.02 -— 0.0135 1.08 0.18 0.0135 1.59 
(1.00) (3.47) (0.69) (2.08) (6.30) (2.66) 
VAR4 —0.0031 0.86 0.14 0.00908 1.93 — 0.0164 1.07 0.20 0.0130 1.62 
] (1.06) (3.53) (0.89) (2.72) (6.53) (3.31) 
VAR4P1 —0.0033 0.91 0.11 0.00914 2.02 — 0.0159 1.08 0.19 0.0132 1.64 
(1.07) (3.83) (0.70) (2.55) (6.32) (2.79) 
VARAP2  —0.0031 0.89 0.17 0.00913 2.06 -— 0.0145 1.04 0.29 0.0133 1.61 
(1.05) (3.56) (0.64) (2.47) (6.20) (2.76) 
VAR4P3 = — 0.0041 0.87 0.37 0.00910 2.08 — 0.0209 1.07 0.49 0.0135 1.57 
45 . (Q3) (0.89) (3.69) (6.70) (2.24) 
AC-6 — 0.0041 0.94 0.16 0.00913 2.10 — 0.0201 1.13 0.22 0.0140 1.50 
(1.11) (3.98) (0.67) (3.09) (6.89) (1.43) 
AC-13 — 0.0034 0.99 0.02 0.00918 2.04 - 0.0193 1.18 0.16 0.0144 1.45 
(0.92) (3.93) (0.05) . (2.39) (7.34) (0.58) 
AC27 -—0.0031 1.00 — 0.04 0.00917 2.03 — 0.0164 1.22 — 0.00 0.0145 1.42 
(0.84) (4.07). (0.14) (1.86) (8.11) (0.00) ; 
AC-48 — 0.0039 0.97 0.10 0.00915 2.04 —0.0191 1.19 0.14 0.0143 1.48 
(1.15) (4.00) (0.50) (2.85) (7.63) (0.85) 
C-E6 — 0.0042 0.91 0.20 0.00910 2.14 —0.0195 1.15 0.17 0.0142 1.51 
(1.23) (3.69) (0.80) (2.80) (7.05) (1.15) 
AC-E13 — 0.0043 0.81 0.40 0.00899 2.16 —0.0171 147 0.10 0.0145 1.41 
(L35). — (281) (1.05) (2.47) (6.30) (0.41) 
AC-E17 — 0.0045 0.76 0.42 0.00894 2.31 — 0.0168 1.20 0.04 0.0145 1.42 
(1.41) (2.79) (1.50) ^ (2.50) (6.19) (0.16) 
AC-E48 — 0.0047 0.86 0.36 0.00894 2.21 -— 0.0168 1.21 0.03 0.0145 1.43 
(1.42) (3.64) (1.65) (2.62) (7.16) (0.19) 
ARA — 0.0084 0.96 0.59 0.00885 2.47 — 0.0168 1.22 0.01 0.0145 1.42 
(2.01) (4.51) (1.87) (1.12) (9.24) (0.03) 
AR8 — 0.0067 0.98 0.38 0.00901 2.32 —0.0144 1.22 —0.05 0.0145 1.41 
(1.72): (4.48) (1.33) (1.70) (9.54) (0.27) 








Notes: Y = log of real GNP. 
t-statistics in absolute value are in parentheses. 
See text for discussion of estimation methods. 


The bias and RMSE for each forecast are 
presented in Table 1.1? The bias is estimated 
by regressing the forecast error (predicted 
change minus actual change) on a constant. 
If the constant term is zero in this regression 
then the standard error (SE) of the regres- 
sion is the same as the RMSE; otherwise the 
RMSE is larger. The errors are roughly in 


or reference purposes, the predicted values and 
errors for three models—Fair, VAR4P3, and AC- 
E17— are presented in Table A in the appendix. - 





percentage points (0.01 is a 1 percent error) 
because real GNP is in logs. For the one- 
quarter-ahead results the Fair model has the 
largest bias, but even this bias is only 0.34 
percent. It also has the smallest RMSE, al- 
though a number of RMSEs are quite close. 
The second best model in terms of RMSE is 
AC-E17. The best VAR model is VAR4P3. 
For the four-quarter-ahead results a number 
of the estimated biases are significant. The 
Fair model has by far the smallest RMSE. 
The best VAR model is again VAR4P3, and 
the best AC model is now AC-E13. AC-E13 
and AC-E17 are better than VAR4P3. 
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Consider now the results in Table 2. The 
coefficient estimate for the Fair model fore- 
cast is always significant at the 5 percent 
level for both the one-quarter-ahead and 
four-quarter-ahead results. None of the co- 
efficient estimates for the other models’ fore- 
casts is significant at this level for the one- 
quarter-ahead results. The regression gives a 
` fairly large weight to the AR4 and AC-E17 
forecasts, although these are not quite statis- 
tically significant. For the four-quarter-ahead 
results the only significant estimates (aside 
from those for the Fair model) are for the 
VAR models. The results across the different 
VAR models are fairly close, with perhaps 
VAR4 performing the best. 

The results in Table 2 are thus rather 
striking. They provide strong support for thé 
hypothesis that the Fair model carries useful 
. information not in the other models. The 

'significance of the VAR forecasts for the 
four-quarter-ahead results ‘indicates that 
some information is in the VAR forecasts 
that the Fair model is not using for the 
four-quarter-ahead forecasts, but this is the 
only significantly negative aspect of the re- 
sults for the Fair model. It perhaps indicates 
some dynamic misspecification for the Fair 
model. 

Comparing Tables 1 and 2 shows one of 
the advantages of our procedure over the 
RMSE procedure. For the one-quarter-ahead 
results in Table 1 the RMSEs are all fairly 
close, and even though the RMSE is smallest 
for the Fair model, it is only slightly smaller. 
One might conclude from this table that the 
models are all about the same. The results in 
Table 2, on the other band, show that the 
Fair model dominates the others by a fairly 
large amount. Conversely, the four-quarter- 
ahead results in Table 1 show the Fair model 
dominating the others by a large amount, 
but the results in Table 2 show that the VAR. 
forecasts contain information not in the Fair 
forecasts. One would not have known this 
from Table 1. 

Table 3 goes on to compare the VAR 
models with the AC and AR models. It is 
hard from Table 2 to pick out which AC and 
VAR model performs the best because the 
models are so dominated by the Fair model, 
but Table 3 provides more ability to discrim- 
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inate. Regarding the AC models, the best 
results in terms of significant coefficient esti- 
mates are obtained for the 13 and 17 compo- 
nent versions and for the extended (AC-E) 
versions. In other words, going from 6 to 13 
or 17 components does help, but going be- 
yond this does not seem to add further use- 
ful informaticn, and adding the lagged val- 
ues of real GNP to the equations seems to 
add useful information. 

The best performing AC model in Table 3 
is probably AC-E17, and so consider the 
comparisons cf AC-E17 with the VAR mod- 
els. AC-E17 performs better than any VAR 
model for the one-quarter-ahead results. No 
VAR model coefficient estimate is significant 
for these comparisons, although some are 
close to being significant. For the four- 
quarter-ahead results the VAR models per- 
form about as well as,does AC-E17. The best 
fit is for AC-E17 versus VAR4P2, where the | 
t-statistic for VAR4P2 is 3.73 and the t-sta- 
tistic for AC-E17 is 3.41. In other words, 
both AC-E17 and the VAR models appear 
to contain independent information useful 
for forecasting four quarters ahead. Compar- 
ing across the VAR models, VAR4P2 and 
VAR4P3 are probably the best, although the 
results are quite close across the models. The 
results in Table 3 also show that the VAR 
models dominate the AR models. Clearly the 
VAR models contain information not in the 
AR models, but not vice versa. 

The results in Table 3 thus indicate that 
AC models like AC-E17 contain useful fore- 
casting information not contained in even . 
the best VAR model. In other words, there 
appears to be useful forecasting information 
in the components of GNP that is not cap- 
tured in the VAR models, and so within the 
class of fairly atheoretical models, AC mod- 
els appear to be useful alternatives to the 
VAR models. This conclusion is strength- 
ened by the fact that for the one-quarter- 
ahead forecasts the VAR models appear to 
contain little information not already in AC- 
E17. 

Regarding the AC models, note from Table 
2 that the AC forecasts are not statistically 
significant at the 5 percent level when com- 
pared with the Fair forecasts. Although the 
AC-E17 forecast gets a weight of 0.42 for the 


TABLE 3—VAR MoDzLs VERSUS AC AND AR MODELS: ESTIMATES OF EQUATION (1) 
C = CONSTANT, V = VAR MODEL, Á = AC or AR MODEL 


ONE-QUARTER-AHEAD FORECASTS DEPENDENT VARIABLE IS Y, ~ Y, ., SAMPLE PERIOD = 1976 111-1986 II 











VARI VAR2 VAR4 VAR4P1 VAR4P2 
c V^. A c V A c V A c V A Cc. V A 





AC-6 

AC-13 
AC-17 
AC-48 
AC-E6 
AC-EL 


AC-El 


^ VAR4P3 
C V- A 





0.0029 0.14. 0.37 0.0034 0.39 0.17 0.0019 0.36 0.23 0.0022 0.36 0.23 -0.0025 0.56 0.16 
(0.98) (0.59) (1.94) (1. o. Ec (0.74) (0.62) (2.05) (0.97) (0.70) (2.21) (0.99) (0.83) (2.24) (0.71) 
{0.01057} 01014] [0.01003] {0.01020} [0.01011] 


0.0035 0.21 0.28 0.0039 0.43 0.10 0.0023 0.38 0.18 0.0023 0.40 0.21 0.0028 0.60 0.11 
(0.95) (0.94) (0:74) (1.12) (2.39) (0.25) (0.66) (2.13) (0.44) (0.64) .(2.37) (0.51) (0.78) (2.38) (0.29) 
[0.01068] [0.01018] [0.01008] [0.01024] [0.01013] 


0.0026 0.25 0.32 0.0027 0.43 023 0.0014 0.39 0.25 0.0011 0.42 031 0.0016 0.61 0.24 
(0.67) (1.05) (1.05) (0.74) (2.49) (0.75) (0.39) Q 3n (0.82) (0.29) (2.60) (1.01) (0.41) (2.50) (0.80) 
[0.01065] {0.01013} [0.01005] [0.01019] (0.01008] 


0.0039 0.23 0.20 0.0040 0.43 0.10 0.0026 0.39 0.13 0.0027 0.40 0.14 0.0025 0.60 0.16 
(1.48) (1.00) (1.09) (1.69) (2.38) (0.50) (1. m on (0.69) (1.06) Q 43) (0.73) (0.92) a (0.81) 
[0.01067] [0.01017] 01007] [0.01024] (0.0: 


0.0019 0.14 0.48 0.0022 0.36 0.32 0.0012 033 0.33 0.0014 0.32 0.34 0.0014 0.50 0:31 
(0.57) (0.55) (1.89) (0.69) a. 91) (1.31) (0.39) a. m (1.27) (0.45) a. 81) (1.29) (0.44) a. 96) (.27) 
[0.01037] [0.01000] [0.00! [0.01009] [0.00997] 


3 0.0000 0.13 0.82 0.0003 0.31 0.69 —0.0007 0.30 0.70 —0.0005 028 0.71 —0.0003 0.42 0.67 
(0.03) (0.58) (2.54) ma: ES m (212) (021) a. 3D (1.99) (0.14) (1.86) (2.06) (0.07) (1.85) (2.03) 
[0.00988] (0.00946] [0.00963] [0.00958] 


7 —0.0008 0.09 0.83 —0.0005 0.27 0.70 —0.0015 0.28 0.71 —0.0011 0.24 0.72 —0,0009 0.36 0.96 
(0.22) (0.40) (3.26) (0. - et (2.89) em. Eo m (2.66) (0.35) a 6n (2.73) (027) (1.54) (2.74) 
[0.00967] 944] [0.00949] [0.00946] 


AC-E48 0.0011 0.14 0.64 0.0013 0.32 0.51 0.0003 0.31 0.51 0.0006 0.29 0.52 0.0004 0.47 0.51 


AC-6 


AC-13 


AC-17 


AC-48 


AC-E6 


AC-El 


AC-El 


AC-EA 


(0.35) (0.63) (3.29) (0.50) Ms 21 (2.63) (0.11) (1.82) (2.36) (0.21) (1.83) (2.51) (0.15) (2.05) (2.75) 
[0.01009] {0.00978} [0.00967] [0.00985] . [0.00971] 


—0.0003 0.21 0.63 0.0004 0.39 0.48 —0.0021 0.38 0.62 —0.0005 0.36 0.49 0.0004 0.54 0.39 
(0.07) (0.88) (1.65) (0.11) (2.17) (1.31) (0.52) (2.44) (1.69) (0.11): (2.17) (1.26) (0.10) (2.03) (1.00) 
[0.01044] [0.01001] {0.00978} {0.01010] [0.01003] 


0.0021 0.23 0.36 0.0025 0.42 0.24 0.0003 0.39 0.35 0.0017 0.40 0.23 0.0023 0.59 0.15 
(0.50) (0.94) (0.96) (0.63) (2.31) (0.67) (0.08) (2.36) (0.97) (0.43) (2.28) (0.60) (0.53) (3.10) (0.55) 
[0.01063] [0.01013] [0.00999] [0.01023] [0.01012] 


0.0124 0.45 0.25 0.0127 0.37 031 0.0073 0.36 0.32 0.0082 0.36 030 0.0095 0.69 0.19 
027 (1. 88) (0.59) (1.28) (1.61) (0.74) (0.82) (1.91) (0.79) (0.91) (1.71) (0.73) (1.05) (1.87) (0.43) 
[0.0230] [0.0236] [0.0236] [0.0237] 10.0222] 


0.0142 0.51 0.19 0.0164 044 0.19 0.0100 0.43 0.21 0.0107 0.43 021 0.0110 0.75 043 
(1.04) (2.34) (0.32) (1.13) (2.00) (0.30) (0.79) (2.32) (0.33) (0.82) (2.15) (0.32) (0.86) (2.27) (0.21) 
[0.0232] 10.0240] [0.0240] [0.0240] [0.0223] 


0.0049 0.50 0.49 0.0079 0.44 0.47 0.0020 0.42 0.46 0.0024 0.43 0.47 0.0016 0.72 0.45 
(0.42) (3.66) (1.18) (0.64) (2.93) (1.04) E" Eo s (1.03) (0.20) (3.38) (1.04) (0.14) (3.35) (1.06) 
[0.0225] [0.0235] 35] [0.0235] [0.0218] 


0.0199 0.56 ~0.04 0.0234 0.52 —0.09 0.0163 0.51 —0.10 0.0170 0.51 —0.09 0.0155 0.82 —0.07 
(1.77) (2.48) (0.09) (1.93) (2.18) (0.18) (1. 5 E9 (0.20) (1.59) Q 31) (0.18) (1.49) (2.40) (0.16) 
[0.0233] [0.0219] [0.0241] [0.0241] ; [0.0224] 


0.0087 0.42 0.36 0.0092 0.35 0.40 0.0044 0.34 0.40 0.0051 0.34 0.40 0.0064 0.64 0.30 
(0.78) (2.53) (1.19) (0.79) (2.14) (1.40) (0.39) (2.49) (1.37) (0.45) a 2 (1.33) (0.59) (2.38). (0.93) 
[0.0226] [0.0232] [0.0233] [0.0: [0.0219] 


3 0.0003 0.39 0.78 0.0016 0.33 0.81 —0.0027 0.31 0.80 —0.0025 0.32 0.81 ~0.0020 0.57 0.73 
(0.03) (4.02) (3.17) OP e 69) (2.92) (0.22) (3.35)- (2.77) (0.21) (3.25) (2.86) (0.19) (3.80) (3.19) 
[0.0199] 0.0207] 10.0209] [0.0207] [0.0195] 


7 —0.0015 0.42 0.74 —0.0003 0.36 0.76 —0.0049 0.34 0.76 —0.0044 0.34 0.76 —0.0036 0.60 0.69 
(014) (3.97) (3.52) ©. P. A n (3.15) (0.42) (3.43) (2.95) (0.39) (3.31) (3.00) (0.37) (3.73) (3.41) 
[0.0201] [0.0210] (0.0210). —— [0.0197] 


8 0.0111 0.47: 0.29 0.0131 0.40 0.30 0.0081 0.39 0.28 0.0083 0.39 029 0.0078 0.69 0.25 
(1.24) (3.35) (120) (L. uu e 66) (1.19) (0.87) (3.05) (1.06) (0.91) (2.90) (1.08) (0.87) (2.99) (0.98) 
. * {0.0226} 34] 10.0236] {0.0235} [0.0219] 


—0.0004 0.94 021 
(0.10) (2.39) (0.93) 
[0.01008] 


—0.0004 1.01 0.19 
(010) (2.52) (0.49) 
[0.01011] 


—0.0018 1.05 0.30, 
(0.38) (2.66) (0.99) 
[0.01005] 


~ 0.0008 1.04 0.22 
(0.22) (2.68) (1.14) 
[0.01004] 


~0.0011 0.87 0.33 
(0.27) (2.19) (1.39) 
(0.00995] 


—0.0023 0.73 0.68 
(0.57 (2.05) (2.10) 
[0.00957] 


—0.0026 0.62 0.69 


(0.67) (1.74) (2.87) 
[0.00945] 


—0.00019 0.82 0.52 
(0.50) (2.27) (2.88) 
[0.00969] 


0.0014 0.92 0.31 
(0.32) (2.03) (0.76) 
. [0.01008] 


0.0002 1.05 0.06 


(0.06) (2.29) (0.15) 
[0.01014] 


—0.0052 1.08 0.34 
(0.49) (1.57) (0.86) 
[0.0230] 


—0.0059 117 02 
(0.50) (1.90) (0.55) 
10.0234] 


—0.0155 122 0.59 
(1.22) - (2.87) (1.39) 
[0.0226] 


—0.0013 1.28 0.07 


(011) (197) (0.15) 
[0.0237] 


—0.0075 0.59 0.42 
(0.61) (1.96) (1.41) 
10.0226] 


—0.00145 0,96 0.82 
(114) (3.12) (3.70) 
[0.0199] 


~0.0171 1.01 0/77 
(1.47) (320) (3.81) 
[0.0202] 


—0.00604 112 0.34 
(0.55) (2.48) (1.34) 
[0.0227] 





AR4 0.0075 0.54 0.29 0.0069 0.48 0.36 —0.0017 0.46 0.41 0.0018 0.47 0.34 0.0106. 0.78 0.09 ~0.0044 131 0.11 
(0.36) (3.94) (0.58) (0. x ac 32) (0.68) (0. m it 15) (0.74) qe E 77) (0.59) (0.48) (3.70) (0.16) (0.19) (2.71) (0.16) 
[0.0232] 240] 240] 40] [0.0224] [0.0237] 
ARS 0,0212 0.55 —0:06 0.0162 0.49 0.12 0.0075 0.47 0.17 0.0124 0.47 0.07 0.0218 0.80 —0.20 0.0086 1.36 —023 
(113) (408) (0.14) (0: ug Es (0.32) (0.41) (4.13) (0.42) (0.65) {3.78} (0.16) (1.23) (3.97) (0.49) (0.46) ^ (2.89) (0.47) 
[0.0233] 0.0241] - [0.0241] [0.0241] [0.0223] [0.0236] 
Notes: Y = log of real GNP. 


t-statistics in absolute value are in parentheses. 
Estimated standard errors of the regressions are in brackets. 
Sec text for discussion of estimation methods. 


Va Yu 7 Y, for one-quarter-ahead results for VAR models. 


Vayu 2 —4 for four-quarter-ahead results for VAR. models. 
A mimi u 7 Ya for one-quarter-ahead results for AC and AR models. 
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one-quarter-ahead results in Table 2, it is not 
statistically significarit, and for the four- 
quarter-ahead results the weight on the AC- 
E17 forecast is much smaller. These results 
may be interpreted as indicating that the 
Fair model captures most of the information 
in the components of GNP. 


V. Conclusion 


The procedure uséd in this paper for ex- 
amining models appears to be useful in com- 
paring the different models. Using this pro- 
cedure we have learned that the Fair model 
does very well relative to the other models. 
The Fair model cannot be dismissed as being 
based on the same information used in the 
other forecasts. The fact that the forecasts 
from the Fair model are significant shows 
. that they are not collinear with the other 

forecasts and that the differences between 
the Fair model and the other models are 
meaningful. 

We have also learned that information 
about components matters. The information 
about components of the kind incorporated 
in the AC models does help improve fore- 
casts when compared with the VAR and AR 
models, but it is at best of modest benefit in 
improving the Fair model forecasts. In this 
sense the useful information in the Fair 
model not in the VAR or AR models in- 
cludes information about components of 
GNP. We have also learned something about 
how to combine forecasts. While it appears 
that the VAR and AC forecasts do not con- 
tain a lot of information not in the Fair 
model forecasts for one-quarter-ahead fore- 
casting, it may be that a combination of the 
forecasts from the Fair and. VAR models is 
useful for four-quarter-ahead forecasting. 
While the AC model was dominated by the 
Fair model, it clearly contains information 


not in the VAR model. The. VAR models . 


seem to be losing useful information by ig- 
noring the components of GNP and the GNP 
identity. 

We should conclude with a warning about 
the interpretation of our results. The fact 
that one model does well or poorly for one 
sample period (in our case 1976 III-1986 IT) 
does not necessarily mean that it will do well 
or poorly in future sample periods. The re- 
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sults could change if the structure of the 
economy is changing, which is, of course, 
true of any econometric result. In our case, 
however, the results could also change if the 
magnitudes of the forecast errors of the dif- 
ferent models are changing at different rates 
through time. The errors could, for example, 


. be changing at different rates because the 


data are providing different rates of im- 
provement of the models' parameters. 


APPENDIX 
Components of AC Models 


AC-6: 

. Personal consumption expenditures 

. Gross private “ixed investment : 

. Change in business inventories 

. Government purchases of goods and services 
. Exports 

. Imports 


ON t^ 4 CO P2 rS 


AC-13: 

. Personal consumption expenditures, durable goods 
. Personal consumption expenditures, nondurable 
. goods i 

3. Personal consumption expenditures, services 

4. Gross private fixed investment, nonresidential 

5. Gross private fixed investment, residential 

6. Change in business inventories, nonfarm 
7 
8 


Ne 


. Change in business inventories, farm 
. Government purchases of goods, federal 
9. Government purchases of goods, state and local 
10. Government purchases of services, federal 
11. Government purchases of services, state and local 
12. Exports ' 
13. Imports 


AC-17: 
1. Personal consumption expenditures, durable goods 
2. Personal consumption expenditures, nondurable 
goods 

3. Personal consumption expenditures, services 

4. Gross private fixed investment, nonresidential, firm 
sector 

5. Gross private fixed investment, nonresidential, fi- 
nancial sector 

6. Gross private fixed investment, nonresidential, 
household secter 

7. Gross private fixed investment, residential, firm sec- 
tor 

8. Gross private fixed investment, residential, financial 
sector 

9. Gross private fxed investment, residential, house- 
hold sector ; yim 

10. Change in business inventories, firm sector 

11. Change in business inventories, household sector 

12. Government purchases of goods, federal 

13. Government purchases of goods, state and local 

14. Government purchases of services, federal 

15. Government purchases of services, state anc local 

16. Exports 


E 
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17. Imports : 

Note: See Ray C. Fair (1984) for the definitions of firm, 
financial, and household sectors. This breakdown is 
from the Flow of Funds Accounts. 


AC-48: 
Personal consumption expenditures, durable goods: 
1. Motor vehicles and parts 
2. Furniture and household equipment 
3. Other 


Personal consumption expenditures, nondurable goods: 
4. Food 

5. Clothing and shoes 

6. Gasoline and oil 

7. Fuel oil and coal 

8. Other 


Personal consumption expenditures, services: 
9. Housing 

10. Household operation, electricity and gas 

11. Household operation, other 

12. Transportation 

13. Medical care 

14. Other 


Gross private fixed investment: 

15. Nonresidential structures 

16. Nonresidential producers’ durable equipment 
17. Residential 


Change in business inventories: 

18. Farm 

19. Nonfarm, manufacturing, durable goods 

20. Nonfarm, manufacturing, nondurable goods 

21. Nonfarm, merchant wholesalers, durable goods 

22. Nonfarm, merchant wholesalers, nondurable goods 
23. Nonfarm, nonmerchant wholesalers, durable goods 
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24. Nonfarm, nonmerchant wholesalers, nondurable 
goods 

25. Nonfarm, retail trade, durable goods 

26. Nonfarm, retail trade, nondurable goods 

27. Nonfarm, other, durable goods 

28. Nonfarm, other, nondurable goods 


Government purchases of goods and services, federal: 

29. Durable goods 

30. Nondurable goods 

31. Services, compensation of employees, national de- 
fense, military 

32. Services, compensation of employees, national de- 
fense, civilian 

33. Services, compensation of employees, nondefense 

34. Services, other services 

35. Structures 


Government purchases of goods and services, state and 
local: 

36. Durable goods 

37. Nondurable goods 

38. Services, compensation of employees 

39. Services, other services 

40. Structures 


Exports of goods and services: 
41. Merchandise, durable goods 


:42. Merchandise, nondurable goods 


43. Services, factor income 
44. Services, other 


Imports of goods and services: 

45. Merchandise, durable goods 
46. Merchandise, nondurable goods 
47. Services, factor income 

48. Services, other 


TABLE A— PREDICTED VALUES AND ERRORS FOR THREE MODELS 
ONE-QUARTER-AHEAD FORECASTS 


Change 


i Actual Forecast 

Quarter Change Change Error 
1976.3 0.0042 0.0115 0.0073 
4 0.0099 0.0104 0.0004 
1977.1 0.0136 0.0157 0.0021 
2 0.0159 0.0136 — 0.0023 
3 0.0200 0.0122 — 0.0078 
4 — 0.0027 0.0116 0.0143 
1978.1 0.0088 0.0105 0.0016 
2 0.0310 0.0103 - — 0.0207 
3 0.0087 0.0112 0.0025 
4 0.0123 0.0066 — 0.0057 
1979.1 0.0000 0.0049 0.0048 
2 — 0.0010 0.0102 0.0112 
3 0.0091 0.0087 — 0.0003 
4 — 0.0019 0.0076 . 0.0095 
1980.1 0.0099 0.0009 — 0.0090 
T2 — 0.0238 0.0055 ` 0.0293 
3 0.0006 0.0133 0.0127 
4 0.0127 0.0188 0.0062 


VAR4P3 AC-E17 
Forecast Forecast 
Change Error Error 
0.0050 0.0008 0.0147 0.0105 
0.0043 — 0.0056 0.0106 0.0007 
0.0051 — 0.0085 0.0221 0.0085 
0.0060 ~ 0.0099 0.0150 — 0.0010 
0.0061 — 0.0139 0.0152 — 0.0048 
0.0066 0.0093 0.0124 0.0151 
0.0038 — 0.0051 0.0092 0.0004 
0.0033 — 0.0277 0.0019 — 0.0291 
0.0054 — 0.0033 0.0142 0.0056 
0.0042 — 0.0081 0.0110 — 0.0013 
0.0032 0.0031 0.0043 0.0042 
0.0010 0.0020 0.0062 0.0073 
0.0001 — 0.0090 0.0017 — 0.0073 
0.0013 0.0032 0.0074 0.0093 
0.0002 — 0.0097 0.0059 — 0.0040 
0.0005 0.0243 0.0054 0.0291 
— 0.0005 — 0.0010 0.0008 0.0002 
0.0017 0.0051 — 0.0076 


— 0.0109 
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TABLE A— CONTINUED 





FAIR. VARAP3 AC-E17 
Actual Forecast Forecast Forecast 

Quarter "Change Change Error Change Error Change Error 
1981.1 0.0191 0.0059 — 0.0132 0.0020 —0.0171 0.0101 —0.0090 
: 2 — 0.0033 0.0021 0.0054 0.0036 0.0069 0.0032 0.0065 
3 0.0045 0.0030 — 0.0014 0.0023 — 0.0022 0.0084 0.0039 
4 —0.0140 — 0.0031 0.0109 0.0022 0.0163 — 0.0005 0.0135 
1982.1 — 0.0153 —0.0017 0.0136 0.0026 0.0179 0.0024 0.0177 
2 0.0030 0.0052 0.0022 0.0021 —0.0009 — 0.0008 --0.0037 
3 — 0.0080 0.0044 0.0124 0.0048 0.0128 0.0049 0.0129 
4- ) 0.0016 0.0034 - 0.0019 0.0060 0.0044 0.0018 0.0002 
1983.1 0.0085 0.0229 0.0143 0.0081 — 0.0004 0.0200 0.0115 
2 0.0223 0.0198 — 0.0025 0.0099 — 0.0124 0.0158 — 0.0065 
3 0.0147 0.0190 0.0044 0.0125 — 0.0021 0.0136 — 0.0011 
4 0.0177 0.0125 —0.0051 0.0119 — 0.0058 0.0191 0.0014 
1984.1 0.0233 0.0154 — 0.0079 0.0112 — 0.0121 0.0146 — 0.0087 
2 0.0123 0.0078 — 0.0044 0.0113 — 0.0010 0.0078 —0.0044 
3 0.0057 0.0072 0.0015 0.0100 0.0042 0.0073 0.0016 
4 0.0037 0.0070 0.0033 0.0079 0.0043 0.0076 0.0039 
1985.1 0.0076 0.0128 0.0053 0.0079 0.0003 0.0032 , —0.0044 
2 0.0058 0.0131 0.0073 0.0084 0.0026 0.0072 0.0014 
3 0.0101 0.0125 0.0024 0.0093 — 0.0008 0.0044 — 0.0056 
4 0.0052 - 0.0145 0.0094 0.0107 0.0055 0.0094 0.0042 
1986.1 0.0092 0.0179 0.0087 0.0103 0.0011 0.0074 ~~ 0.0018 
2 0.0027 0.0139 0.0112 0.0107 0.0080 0.0031 0.0004 

Four-Quarter-Ahead Forecasts 
FAIR VAR4P3 AC-E17 
Actual Forecast Forecast Forecast 

Quarter Change Change Error Change Error Change Error 
1977.2 0.0436 0.0559 0.0122 0.0163 — 0.0274 0.0524 0.0088 
3 0.0595 0.0569 — 0.0025 0.0161 — 0.0434 0.0548 ~ 0.0047 
4 0.0468 0.0693 0.0224 0.0189 — 0.0279 0.0695 0.0226 
1978.1 0.0421 0.0539 0.0118 0.0205 —2.0216 0.0482 0.0061 
2 0.0571 . 0.0492 — 0.0079 0.0192 — 0.0379 0.0429 ~ 0.0142 
3 0.0458 0.0467 0.0010 0.0196 — 9.0261 0.0321 — 0.0136 
4 0.0608 0.0437 — 0.0171 0.0131 — 0.0477 0.0218 ~ 0.0389 
1979.1 0.0520 0.0402 —0.0117 0.0122 — 0.0398 0.0234 —0.0286 
2 0.0200 0.0364 0.0165 0.0156 — 0.0043 0.0293 0.0094 
3 0.0204 0.0375 0.0171 0.0125 — 0.0078 0.0222 0.0018 
4 - 0.0062 0.0267 0.0205 0.0091 0.0029 0.0181 0.0120 
1980.1 0.0161 0.0337 0.0177 0.0042 —0.0118 0.0145 —0.0016 
2 —0.0067 . 0.0245 0.0311 0.0028 0.0095 ‘ 0.0096 0.0163 
3 — 0.0152 0.0213 0.0365 0.0066 0.0218 0.0208 0.0359 
4 —0.0006 0.0158 0.0164 0.0017 0.0023 0.0276 . 0.0282 
1981.1 0.0086 0.0151 0.0066 0.0015 — 0.0071 0.0322 0.0237 
2 0.0290 0.0315 0.0025 0.0084 — 0.0206 0.0058 ~ 0.0233 
3 : 0.0329 0.0494 . 0.0165 0.0142 — 0.0187 0.0263 — 0.0066 
4 0.0062 0.0159 0.0097 0.0075 0.0013 0.0335 0.0273 
1982.1 —0.0282 0.0113 0.0395 0.0105 0.0387 0.0275 0.0557 
2 —0.0219 0.0081 0.0300 0.0092 0.0311 0.0241 0.0460 
3 —0.0344 — 0.0154 0.0190 0.0102 0.0446 0.0104 0.0448 
4 — 0.0188 0.0040 0.0227 0.0177 - 0.0364 0.0223 0.0411 
1983.1 0.0050 0.0155 0.0105 2.0169 (6.0119 0.0270 0.0220 
2 0.0244 0.0164 —0.0080 . 0.0249 0.0006 0.0317 0.0073 
3 0.0471 0.0227 —0.0244 0.0296 — 9.0174 0.0273 — 0.0197 
4 0.0632 0.0788 0.0156 0.0368 — 0.0264 0.0563 — 0.0069 
1984.1 0.0779 0.0680 — 0.0099 0.0394 ~ 0.0385 0.0516 — 0.0264 
2 0.0679 0.0621 — 0.0058 0.0447 — 0.0231 . 0.0581 |: —0.0098 
3 i 0.0590 - 0.0486 — 0.0104 0.0407 — 0.0182 0.0583 — 0.0006 
4 0.0450 0.0501 0.0052 0.0374 — 0.0076 0.0385 — 0.0064 
1985.1 0.0292 0.0309 0.0017 0.0364 0.0071 0.0205 — 0.0087 
2 0.0228 0.0230 0.0002 0.0338 0.0110 0.0191 — 0.0037 
3 0.0271 0.0246 — 0.0025 0.0298 0.0026 0.0106 — 0.0165 
4 0.0286 0.0416 0.0130 0.0327 0.0041 0.0105 — 0.0181 
1986.1 0.0303 0.0407 0.0104 0.0348 0.0045 0.0147 — 0.0156 


2 0.0272 0.0415 0.0143 0.0382 0.0110 0.0134 — 0.0137 
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Money, Output, and the Nominal National Debt 


By BRUCE CHAMP AND SCOTT FREEMAN* 


This paper presents a model of finitely lived rational agents in which unantici- 
pated innovations in the stock of fiat money affect real variables. An unantici- 
pated inflation reduces the real value of the nominally denominated national debt, 
thereby reducing the crowding-out of capital and/or the tax burden. Both effects 
_ Stimulate increased investment in capital, which leads to an increase in real output 
and wages in the following periods. In contrast with price-surprise models, these 
real effects occur even if the monetary innovation is instantly and perfectly 


observed by agents. (JEL 311) 


The possibility that innovations in the 
. monetary base are correlated with innova- 
tions in important real variables raises dif- 
ficult yet fascinating questions of theory and 
policy for macroeconomists committed to 
the rigor imposed by the assumption of 
choice-theoretic models of rational agents. 
How might the printing of intrinsically use- 
less pieces of paper be linked to the real 
output of the nation’s workers and ma- 
chines? Might there be a way to systemati- 
cally exploit real effects of the monetary 
base? 

These questions were first addressed by a 
choice-theoretic model with the publication 
of “Expectations and the Neutrality of 
Money” (Robert Lucas, 1972). Following the 
lead of this greatly influential paper, recent 
studies of the effects of monetary forces on 
real economic aggregates have often focused 
on the reaction of agents who have misinter- 
preted nominal price shocks as shocks to the 
real demand for their output. While the 
Lucas model can account for a difference in 
- the correlations of anticipated and unantici- 
pated inflation with real variables, the as- 
sumption in price-surprise/information-con- 
fusion models that agents lack information 
on the current money stock and price level is 
inconsistent with the ready availability of 


*Respectively, Department of Economics, University 
of Western Ontario, London, Ontario N6A 5C2, and 
Department of Economics, University of California, 
Santa Barbara, CA. 93106. 
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current information on prices and monetary 
aggregates, most of which are reported with 
delays of only a week or a month. We are 
led, therefore, to look for alternative expla- 
nations for the transmission of nominal 
monetary innovations to real variables. 

Another means by which monetary policy 
can conceivably affect real variables is 
through the real revenue raised by the print- 
ing of fiat money. If this seigniorage is ap- 
plied to the purchase of government bonds 
in an economy where government bonds are 
net wealth, money creation will reduce the 
real interest rate, stimulating investment and 
output. The use of seigniorage as a means of 
reducing the burden of the national debt is 
explored in Lloyd Metzler (1951), Preston 
Miller (1981), Thomas Sargent and Neil 
Wallace (1981), Wallace (1984), and Douglas 
Waldo (1985). However, the revenue from 
the tax on the monetary base is a small 
fraction of the size of total investment, out- 
put, and government revenue in most indus- 
trialized countries! One wonders how this 
small tax can account for major changes in 
output. 

A revenue effect of inflation that has been 
a recent focus of attention is the change in 
the real value of the outstanding nominally 
denominated government debt (see Jeremy 
Siegel, 1979, and Robert Risner and Paul 
Pieper, 1984). Given that the U.S. federal 


!See Barro (1982) or Fischer (1982). 
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debt is roughly ten times the size of the 
monetary base, the revenue effect of an in- 
flation-induced change in the real value of 
the national debt will exceed the revenue 
from the taxation of the monetary base 
through seigniorage.? With this in mind, we 
propose in this paper a model in which the 
primary effect of monetary policy is the 
change in the real value of the nominally 
denominated national debt. In this model 
unanticipated inflation reduces the real value 
of the national debt in much the same way 
as a default. The effect of unanticipated 
inflation. on the real national debt has 
captured the attention of those (such as 
Guillermo Calvo, 1978, 1988; Lucas and 
Nancy Stokey, 1983; and Robert Barro and 
David Gordon, 1983) examining the time 
consistency problem of government finance. 
Our work follows their lead with one impor- 
tant difference—agents in our model have 
finite planning horizons (lives). The assump- 
tion of finite planning horizons implies that 
the change in the real debt will affect the 
distribution of personal wealth among agents 
at different stages in their life cycle and thus 
consumption, capital accumulation, output, 
and other macroeconomic variables. 

The model is a version of Peter Diamond's 
(1965) overlapping generations model in 
which fiat money is held to satisfy legally 
imposed reserve requirements and where the 
government debt is denominated in units of 
fiat money. Agents may save for old age with 
either capital or government bonds, which 
are viewed as net wealth because of the finite 
planning horizon of each agent. Money is 
assumed to offer no transaction services in 
order to demonstrate that the real effects of 


monetary policy follow from its fiscal effects, | 


not its effects on transaction costs or the 
mechanics of exchange. 

This model shares some features with other 
models. As in the IS-LM model, for exam- 
ple, monetary policy works only to the ex- 
tent that it lowers the real interest rate, 


The Federal Reserve Bulletin of October 1988 puts 
the federal debt at $2,493 billion (Table 1.40, page 
A-30) and the monetary base at $263 billion (Table 
1.20, page A-12). 
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stimulating investment and output. As in 
disequilibrium models, monetary policy has ` 
an effect because a critical value (the na- 
tional debt in our model, prices in disequi- 
librium models) is fixed in nominal terms. 
However, unlike the static IS-LM model or 
fixed-price disequilibrium models, the model 
of this paper is constructed from a fully 
explicit general equilibrium model of utility 
maximizing rational agents, with no ad hoc 
restrictions on thé behavior of agents or 
prices. As a result it makes clear distinctions 
between the effects of changes in the mo- 
ments of the monetary policy process and 
the effects of different realizations of a given 
monetary policy process. In this way the 
model is consistent with the classical belief 
in the “long run” neutrality of monetary 


policy. 
1. The Model 
A. The Environment 


The model is an adaptation of Diamond’s 
(1965) version of Samuelson’s (1958) over- 
lapping generations model. In each period 
t21, N, two-period lived agents are born. 
Let N,=nN,_, for each period t, where n 
is a positive constant. An agent born at t 
maximizes the expected value of the twice 
continuously differentiable utility function 
U(c,,) t E,c;,, where c, denotes the agent's 
consumption of the economy's sole con- 
sumption good in the ith period of life. The 
function U(-) is strictly concave and strictly 
increasing, with lim, , qQU'(c) = oc. 

Each agent born in period £21 is en- 
dowed with one unit of labor when young 
(and nothing when old), which is supplied 
inelastically to the labor market. There also 
exist N members of generation 0 (to be 
called the initial old) who are alive only in 
period 1. The utility of each initial old agent 
is a strictly increasing function of personal 
consumption in period 1. 

In any period, one unit of capital can be 
created from every unit of the consumption 
good that is not consumed. We adopt the 
constant-returns-to-scale production func- 
tion of Diamond (1965), where production 
of the good at ¢ is a function, F(K,_,, Li) 
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of capital created at 1 — 1, K, ,, and labor 
supplied at f, L, The function F(-,-) is 
twice continuously differentiable, concave, 
and increasing in each argument. Let k, rep- 
resent the units of capital created by an 
individual in any period t. The assumption 
of constant returns to scale allows us to 
define production at ¢+1 as a function of 
the capital created per young person at t: 
f(k,) = F(k,, n) units of the consumption 
good in: period ¢+1. It follows that the 
function f(k,) is strictly increasing, concave, 
and twice continuously differentiable. Capi- 
tal created at ¢ completely disintegrates at 
the end of period t +1. 

The initial old are endowed with M, units 
of fiat money, which consists of unbacked, 
intrinsically useless pieces of paper costlessly 
produced by the government. The stock of 
fiat money changes according to the rule 
M,=2,M,_,, where M, denotes the money 
stock at time 7 after the change in the money 
stock has taken place. The change in the 
money stock at t is accomplished through 
lump-sum subsidies of a,2(z,—1)M,. ,/ 
N,_ units of fiat money to each old person. 
The gross rate of change of the fiat money 
Stock is z, = z*/(1-— e,), where z* is a posi- 
tive random variable and e, is a zero-mean, 
serially uncorrelated random variable with 
e€, <1 Vt. The realizations of z* are known 
by all with complete certainty at ¢—1, but 
the realizations of e, are not known until t. 
We will therefore label z as the “antic- 
ipated" rate of fiat money creation at 1, 
while e, will represent the unanticipated in- 
novation at ¢ in the money creation process. 
Each young agent is required by law to hold 
fiat money worth at least y units of the 
consumption good. Let .p, denote the price 
of a good in units of fiat money. 

In each period ¢ the government uses up 
g, goods per young agent in some project 
that has no direct effect on the utility or 
consumption of agents. In each period ¢ the 
government is also assumed to have an out- 
standing nominally denominated debt worth 
B, units of fiat money. This government debt 
has a one-period maturity and pays the nom- 
inal gross rate of return R, at ¢+1. In the 
initial period the initial old own a total of By 


JUNE 1990 


units of nominal government debt and so are 
owed RoB, units of fiat money in period 1. 
To finance its expenditures and debt obli- 
gations, the government collects a lump-sum 
tax of +, units of the consumption good from 
each young person in each period t. Note 
that seigniorage revenue is assumed to be 
used only to finance subsidies so that it will 
not finance any part of government expendi- 
tures, interest payments, or debt retirement. 
This is assumed so that all the revenue ef- 
fects of the expansion of the fiat money 
stock in this model will come solely from the 
effects of inflation on the real value of na- 
tional debt. The government budget con- 
straints in period ¢ can now be written as: 


(1) Np, + R,,B,.,— By + PN To 
(2) M, = M,_1+ Na). 
B. Equilibrium Conditions 


Define a rational expectations competitive 
equilibrium in this economy as values of the 
endogenous variables (listed below) such that 
(1) firms chocse capital and labor to maxi- 
mize profits while taking factor prices as 
given, (2) agents choose their assets to maxi- 
mize expected utility taking the probability 
distributions cf rates of return and the price 
level as given, (3) the probability distribu- 
tions of rates of return and the price level 
anticipated by agents when solving their 
maximization problems are the actual equi- 
librium probability distributions, and (4) 
markets clear. 

Perfectly competitive markets in capital 
and labor will ensure that x,_,, the marginal 
gross real return in period £ to a unit of 
capital created at ¢—1, is equal to its 
marginal product, or 


(3) xy = Fe(k, n) = f(k) , 


and that the real wage, the return to a unit 
of labor at ¢ (w;), is its marginal product, 


(4) w= F,( k, a, n) 


NE ETE A 


VOL. 80 NO. 3 


The maximization problem of an individ- 


ual agent born at ¢ can now be expressed as . 


the choice of personal holdings of capital 
(k,), bonds, (b), and fiat money (m,) to 


maximize expected utility U(c,,)+ E,c;, sub- . 


ject to his budget constraints: 
(5) cu= w= n k, b/p m/p 
(6) e 7 xk, R,bi/ Piit M,/ Pii 
+ a,41/ Peri 
0) - m,/ p, 2 y. 
The following market clearing constraints 
(8) B,= N,b,, 
(9) M,= Nm, 


and the government budget constraints (1) 
and (2) must also be met in equilibrium. 
We will confine our attention to equilibria 
in which the legal requirement to hold fiat 
money is binding (m,/ p, = y V t), equilibria 
in which the rate of return on capital exceeds 
the expected rate of return on fiat money 
[/ (Kk) > n] so that fiat money is not viewed 
as a substitute for capital. Notice first that 
equations (2) and (9) imply that m,_,+ a,— 
nm,. When the legal restriction (7) holds 
with equality, the problem of an agent born 
at t is simplified to the choice of real capital 
(k,) and nominal bonds (b) to maximize 
expected personal utility. The budget con- 
straints (5) and (6) can now be written as: 


(10) 
(11) 


The resulting first-order conditions for a 
maximum can be written as: 


(2) Uu =~ y= b/p- v] - f'(&) 


and 


(13) 


Cu = WT Tr ki B/D, Y 


Cop 7 X,k, + R,b,/ p, 1 ny. 


Xx,— f k,) E E|[R,p,/ p, i]. 


CHAMP. AND FREEMAN: THE NOMINAL NATIONAL DEBT 393 


The first (12) describes an agent’s saving 
behavior, and the second (13) represents an 
arbitrage condition guaranteeing the equality 
of the expected real rates of return of-capital 
and bonds. 

We can use (13) to find an expression for 
the nominal interest rate R, as a function of 
capital and expected inflation. First, we will 


. need an expression for the inverse of the 


inflation rate, the rate of return on fiat 
money. From (6) and (8), 


(14) p,7 M,/Ny 
(15) = P,/ Ppa. MN aY/M, NY 
—/Zji 


—n (1 = bj) / ta 


Then E|[R,p,/ Piil aa R,E,[nQ — £(43)/ 
z,41*] 7 R,n/zA, and from (13): 


(16) R,= f'(k)zi/n. 


The nominal interest rate on bonds is there- 
fore a function solely of the expected rate of 
money creation, the rate of growth of the 
economy (n), and the marginal product of 
capital. Notice that R,>1 if, as assumed, 


fk) - n/Z9y. 


The Real Effects of Inflation. We are now 
ready to examine the effects of anticipated 
and unanticipated inflation on the capital 
stock, the real interest rate, and output. The 
central real variable in this economy is the 
capital stock. The real interest rate at ¢ is the 
marginal product of capital, /'(k,) a de- 
creasing function of capital. Real output and 
the real wage at ¢+1 are increasing func- 
tions of capital created at ¢. The effects of 
anticipated and unanticipated inflation on 
the capital stock are easy to determine and 
are presented in Propositions 1 and 2. 


PROPOSITION 1: The real values of capi- 
tal, output, the wage, and the interest rate are 
independent of z,, the anticipated rate of 
money creation. 
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PROPOSITION 2: A positive realization of 
e, (an unanticipated positive innovation in the 
fiat money stock) is associated with an in- 
crease in the current capital, a decrease in the 
current real and nominal interest rates, and 
an increase of real output and the real wage 
one period later. 


We will prove these propositions by finding 
capital as a reduced form function of z and 
e,- From the government budget constraint 
(1), we can find the following expression for 
real government debt per young agent at t: 


(17) B,/p,N, Ry AB, A/p,N t g,—7.. 


Substitution of this expression for b,/ p, into 
the first-order condition describing desired 
saving (12) yields 


(18) U'[w, UE R, B, íA/ p, N, 


= gk oy] = f'(k,). 


Define d, R, B, ,/p,N, as the real bur- 
den of the national debt passed on to each 
young agent at 1, that is, the current value of 
the last period's debt (including interest). 


LEMMA 1: Capital at t per young agent 

_ (kj) is a decreasing function of the burden of 
the national debt at t per young agent (d,) 
and of government expenditures at t per young 
person (g,) and an increasing function of the 
real wage at t (w,). . 


Lemma 1 can be quickly proven by applying 
the implicit function theorem to (18). 

It remains to determine the effect of zt 
and e, on the current value of last period's 
debt. 


(19) d,= R, 4,B, ,/p,N, 
= R.a[Ba/ p aAN adG a pn) 
= RLa[B a7 p GN ul l 
X(1—.e,)/z* from (14) 
-f'(a Gt [Ba pa Nul 
x(1—s&)/z* from (15) 


= [z k,-1)/n] [Bi pi Ni] 
X(1—&). 
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Notice from (18) that the current value of 
last period's debt is unaffected by the rate of 
anticipated money creation (z*) but is a 
function of £, A positive value of e, repre- 
sents an unanticipated positive innovation in 
the fiat money stock and, hence, a negative 
innovation in the rate of return of fiat money. 
Since the debt is denominated in units of fiat 
money, an unanticipated inflation acts like a 
partial default and lowers the value of the 
outstanding national debt. Since the capital 
stock at t is a decreasing function of govern- 
ment debt at ¢ (see Lemma 1), such an 
unanticipated inflation will increase the for- 
mation of new capital, reducing the marginal 
product of capital created at t (the expected 
real interest rate of assets paying a return at 
t +1) and increasing output at t + 1. (Output 
responds to unanticipated inflation with a 
lag because of the assumption that it takes 
one period for new capital to augment pro- 
duction. A more general assumption would 
be that new capital contributes to both cur- 
rent and future output, resulting in' both a 
contemporaneous and lagged response of 
output to unanticipated inflation. This case 
is presented in the appendix.) 
. Unanticipated inflation  redistributes 
wealth from the current old to future genera- 
tions. The current young react by increasing 
investment regardless of the method of fi- 
nancing the burden of past debt. If the gov- 
ernment responds to the smaller burden of 
past debt with a reduction in new det, the 
young replace bonds with capital in their 
portfolios. If, on the other hand, it responds 
with a reduction in taxes, the young are 
wealthier and thus desire to save more 
(through increased capital) for their old age. 
The nonneutrality of government debt is 
crucial to the implications of the model. If 
all agents had infinite lives, changes in gov- 
ernment debt would have no effect on cur- 
rent consumption or the capital stock—the 
wealth increment from any increase in real 
government debt would be exactly offset by 
the expected burden of future taxes to ser- 
vice the debt. To pay the expected tax bur- 
den, an infinitely lived agent would inzrease 
his savings by exactly the amount of the 
increase in the real debt, preventing the in- 
creased national debt from crowding out 
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capital. As Barro (1974) has argued, agents 
who value the utility of their offspring and 
actively leave them bequests will react to 
changes in the national debt in the same way 
as infinitely lived agents. 

The model’s implication that the resulting 
change in the capital stock is of the same 
magnitude whether the burden of the past 
debt is financed by current taxes or ad- 
ditional debt follows from the constant 
marginal utility of consumption by the old. 
It is a great convenience because we are able 
to describe the current effects of the burden 
of debt in an equation as simple as (18) 
without having to refer to the tax policy 
through which the debt is to be financed in 
future periods. 

In this model the means of financing the 
added burden of past debt at ¢ matter only 
to the values of the capital stock after t. If 
the added burden of past debt is financed 
through new bonds, the new debt will con- 
tinue to reduce capital in the periods after t; 
if the added burden of past debt is financed 
through taxes, the reduction of capital takes 
place only at t. 

Independent of the means by which the 
government finances the burden of past debt, 
serially uncorrelated, unanticipated innova- 
tions in the money stock lead to serially 
correlated innovations in output. Unantici- 
pated inflation at f, by increasing capital 
formed at 7, increases output and wages at 
t 1, which in turn increase capital at £ -1 
and so output at t +2. 

Anticipated inflation has no effect on real 
activity through changes in the real national 
debt because the anticipated real return on 
nominal bonds is tied through arbitrage to 
the real return on capital. This implication 
follows from two special features of the 
model—the demand for money is fixed and 
additions to the fiat money stock are dis- 
tributed back to agents. If the demand for 
fiat money were not fixed, real money bal- 
ances would respond to changes in antici- 
pated inflation with the effects on real capi- 
tal and output suggested by James Tobin 
(1965). If the seigniorage from the expansion 
of the fiat money stock were not returned to 
agents but used to help finance government 
expenditures or to retire the debt, an antici- 
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pated expansion of the fiat money stock 
would have the same qualitative effects as an 
unanticipated expansion. The effects would 
be of smaller magnitude because an antici- 
pated inflation taxes only fiat money bal- 
ances, while an unanticipated inflation taxes 
both fiat money balances and nominally de- 
nominated national debt. 

Unanticipated monetary innovations have 
real effects in this economy even if the real- 
ization of the money growth process is in- 
stantly and perfectly observed by all. In this 
way the model differs from that of Lucas 
with its assumption that agents confuse real 
and nominal shocks because they cannot 
observe the behavior of the money stock. 
Since information on monetary aggregates is 
available weekly, the lack of information 
about the current money supply is an im- 
plausible explanation for output fluctuations 
that last more than a week. In contrast, in 
the model of this paper, a monetary innova- 
tion may be known with no delay yet will 
nonetheless affect output through a devalua- 
tion of the outstanding nominal stock of 
government debt. It affects output and the 
real value of the debt because bonds are 
purchased at fixed nominal rates of interest 
before the realization of the money stock is 
known. 


If. Concluding Remarks: 


We have proposed in this paper a means 
through which changes in the stock of fiat 
money may affect such real variables as in- 
vestment, the real interest rate, and output. 
It has an effect when unanticipated infla- 
tionary innovations reduce the real value of 
the national debt. Of course, this effect has 
been considered before, in the recent work 
of Lucas and Stokey (1983) and Barro and 
Gordon (1983) and in work as early as that 
of Arthur Pigou (1943) and Irving Fisher 
(1933). The model of this paper can be dis- 
tinguished from these efforts by its explicitly 
modeled life-cycle properties. In this model 
inflation-induced reductions in the national 
debt increase output through increased capi- 
tal holding in one of two ways. To the extent 
that the reduction of the value of the old 
national debt reduces current taxes it redis- 
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tributes wealth from the owners of the bonds, 
the current old, with no propensity to save, 
to the young, who at this point in their life 
cycle are ready to invest. To the extent that 
it reduces the new level of government debt, 
it reduces the crowding out of capital in 
agent portfolios. Neither effect is an implica- 
tion of models of infinitely lived representa- 
tive agents, where reductions in the burden 
of the debt matter only through the reduc- 
tion of the distortionary taxes needed. to 
finance the debt. The potential importance 
of the redistribution of wealth through infla- 
tion was recognized by economists as early 
as Fisher (1933),? but not in explicit models 
(by today's standards) of the means through 
which the redistribution affected output. The 
model presented here, with its explicit dif- 
ferentiation of the investment behavior of 
the owners of mature bonds (the current old) 
and that of those about to invest in capital 
and bonds (the current young), is offered as 
a structure in which we may examine the 
macroeconomic effects of the redistribution 
of wealth by unanticipated inflation. 

The model is also offered as an alternative 
to Lucas's price-surprise explanation of the 
observed difference in the effects of antici- 
pated and unanticipated inflation, an alter- 
native that does not require the obviously 
counterfactual assumption that the current 
money stock is not observable. With its as- 
sumption of rational expectations, this model 
does share the policy implications of the 
Lucas model inasmuch as inflationary inno- 
vations cannot be systematically used to 
stimulate output. 


APPENDIX ` 


The model presented in the main body of the paper 
predicts that output will respond to unanticipated inno- 
vations in money with a lag. The lag comes from the 
model’s adoption of Diamond’s production technology, 
which assumes that investment produces output after a 
one-period delay. Given that one period represents half 
of a lifetime in this model, the assumed delay in produc- 
tion seems very long relative to actual production lags 
and the observed response of output to money. We may 


34 clear summary and comparison of these views 
may be found in Tobin (1980). 
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capture a more contemporaneous response of output to 
unanticipated monetary innovations in a version of the 
model in which capital produces goods in. both the 
period of its creation and in the following period. In 
particular, consider a version of the model in which the 
creation at 1 of K, units of capital produces ^, K, goods 
in period ¢ in addition to the previously assumed 
F(K,, L,,1) goods in period t--1. 

The budget constraints of a young agent born at t 
corresponding to (10) and (11) can now be written as: 


(20) eum ERE, - kb] p — v. 
Q1) Coy = X'k, + Rb, / p + ny. 


The resulting firs:-order conditions for a maximum can 
be written as: 


n-k b/p- yY] 
= x,/(1— hy) 
-f'(&)/0- 4) 
= E[Rip/ pil. 


Q2) U'[w, — 


When combined with the government budget constraint 
we can define capital per young person, k,, as a func- 
tion of the real burden of past debt, d,, as we did in 
(18): 
Q3) U'[w -4,- & - kK - Y) h) = f(k). 
As before, we find that k, is a decreasing function of 
real burden of past debt, d,. 

In this economy, aggregate real output in period ¢ is 
the sum of producüon from previously and currently 
created capital: 


Q4) Nf (a) + Nh. 


Therefore, unanticipated inflation at £, by reducing the 
real burden of past debt, will stimulate capital creation 
at t, increasing cctput at both ¢ and ¢+1. By reducing 
the assumed lag between investment and output in this 
way, we find both a contemporaneous and a lagged 
response of outpu: to monetary innovations.* 


^We might alsc have found arbitrarily more “realis- 
tic" timing of the response of output to money by 
assuming that agents live T > 2 period lives. As long as 
T is finite, the rzal burden of past debt will affect 
capital and then output. The lag between the two can 
then be set to amy length that appropriately represents 
the time that it takes for new capital to start to produce. 
However, a model with more general life spans and 
production delays immediately loses a great deal of 
tractability, which influenced us to remain within the 
friendly confines of the simple model of two-period 
lives. 
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To maintain the tractability of the model [the closed 
form solution of (23)], we have assumed that the con- 
temporaneous production, h,, from newly created capi- 
tal does not require a labor input. If it did, unantici- 
pated inflation would stimulate not only current output 
but the current real wage as well, as the increased 
capital raises the marginal product of labor. 
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Mean Reversion in Equilibrium Asset Prices 


By STEPHEN G. CECCHETTI, POK-SANG LAM, AND NELSON C. MARK* 


This paper demonstrates that negative serial correlation in long horizon stock 
returns is consistent with an equilibrium model of asset pricing. When investors 
display only a moderate. desire to smooth their consumption, commonly used 
measures of mean reversion in stock prices calculated frora historical returns data 
nearly always lie within a 60 percent confidence interval of the median of the 
Monte Carlo distributions implied by our equilibrium model. From this evidence, 
we conclude that the degree of serial correlation in the data could plausibly have 
been generated by our model. (JEL 313) 


Recent research into the behavior of the 
stock market reports evidence that returns 
are negatively serially correlated. James 
Poterba and Lawrence Summers (1988) find 
that variance ratio tests reject the hypothesis 
that stock prices follow a random walk, and 
Eugene Fama and Kenneth French (1988) 
show that there is significant negative auto- 
correlation in long-horizon returns. It is well 
known (see Stephen Leroy, 1973; Robert 
Lucas, 1978; and Ronald Michener, 1982) 
that serial correlation of returns does not in 
itself imply a violation of market efficiency.” 


*Department of Economics, The Ohio State Univer- 
sity and NBER, Department of Economics, The Ohio 
State University and Stanford University, and Depart- 
ment of Economics, The Ohio State University, respec- 
tively. This paper is a revised version of NBER Work- 
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Boston University, British Columbia, Claremont, Michi- 
gan, Ohio State, the Federal Reserve Board, Stanford, 
U.C.S.D., and the NBER. We thank Andy Abel, David 
Backus, Ben Bernanke, John Campbell, Steve Cosslett, 
Paul Evans, Jim Hamilton, Alan Gregory, Hu McCul- 
loch, Robert Shiller, Larry Summers, and two anony- 
mous referees for useful comments on earlier drafts. 
Cecchetti acknowledges financial support from the Sa- 
lomon Bros. Center for the Study of Financial Institu- 
tions at NYU and the National Science Foundation. 

Poterba and Summers find negative serial correla- 
tion for stock returns over long horizons using monthly 
and annual data. Interestingly, Andrew Lo and A. Craig 
MacKinlay (1988) find that stock returns are positively 
correlated, using weekly observations. 

?Sanford Grossman and Robert Shiller (1981) make 
this same point in showing that the "excess" volatility 
implied by-variance bounds tests can be partly ex- 


Nevertheless, there is a tendency to conclude 
that evidence of mean reversion in stock 
prices constitutes a rejection of equilibrium 
models of rational asset pricing. Fama and 
French suggest this interpretation as a logi- 
cal possibility, while Poterba and Summers 
argue that the serial correlation in returns 
should be attributed to “price fads.” In this 
paper we demonstrate that the serial correla- 
tion in returns that is computed from stock 
market data is consistent with an equilib- 
rium model of asset pricing. 

Our approach is to combine the methods 
of model calibration and statistical inference 
to critically evaluate the conclusions that can 
be drawn from the available data. We begin 
by specifying both an economic model of 
asset pricing and a stochastic model for the 
exogenous forcing process driving the eco- 
nomic fundamentals. The forcing process is 


. then calibrated to actual data from the U.S. 


398 


economy over a long historical period. From 
this structure, we coinpute the Monte Carlo 
distributions of the statistics that previous 
investigators have used, under the null hy- 
pothesis that our;rational equilibrium model 
is true. Finally, we can state the likelihood 
that those statistics, computed with histori- 
cal returns, were actually generated from a 
model in the class that we consider. 


plained by risk aversion in a consumption beta: model. 


More recently, Fischer Black (1988) has discussed the 
relation between mean reversion and consumption 
smoothing. 
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Specifically, we begin with a variant of the 
Lucas (1978) model of an exchange economy 
in which the parametric representations for 
preferences and the stochastic process gov- 
erning the exogenous forcing variable (i.e., 
the endowment stream) admit a closed form 
solution to the asset pricing problem. We 
assume that the period utility function be- 
longs to the constant relative risk aversion 
family. For these preferences, the coefficient 
of relative risk aversion is also the inverse of 
the elasticity of intertemporal substitution in 
consumption so that it is not possible to 
separate agents’ tolerance for risk from their 
desire to have smooth consumption. Since 
our focus is on the serial correlation in asset 
returns implied by a model where agents 
confront an intertemporal consumption/ 
investment decision, we believe that it makes 
more sense to interpret the concavity of the 
utility function in terms of the consumption 
smoothing motive. 

The theory provides little guidance as to 
which time-series (i.e., consumption, output, 
or dividends) should serve as the endowment 
and from which to calibrate the model. That 
is because in the Lucas model, equilibri- 
um consumption equals output, which also 
equals dividends. Since none of these time- 
series seem to be more appropriate than the 
others a priori, we examine each of the three 
series separately. The stochastic process fol- 
lowed by the growth rate of the endowment 
is assumed to follow James Hamilton’s (1989) 
Markov switching model. This characteriza- 
tion of the forcing process has two impor- 
tant attributes. First, it allows us to model 
both the negative skewness and the excess 
kurtosis that is present in the growth rates of 
the raw data we employ.’ And second, the 
Markov switching model admits a closed 
form solution to the intertemporal asset pric- 
ing problem. Neither of these objectives can 
be accomplished with standard linear 
ARIMA models. 


*The negative skewness of growth rates for many 
macroeconomic time-series, and hence their asymmetric 
behavior over the cycle, is also discussed in Salih Neftci 
(1984). 
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The parameters of the Markov switching 
model are estimated by maximum likelihood 
employing annual observations on each of 
the series. Using these estimates, the empiri- 
cal soundness of the Markov switching model 
is demonstrated by showing that it matches 
all three time-series well in the dimension of 
the mean, variance, skewness, kurtosis, and 
first-order serial correlation. Furthermore, 
comparisons of k-step ahead in-sample fore- 
cast errors of the Markov switching model 
with autoregressions reveal roughly similar 
predictive capabilities. 

We then study the measures of mean re- 
version that have appeared in the literature. 
These are the variance ratio statistics used 
by Poterba and Summers and the long-hori- 
zon regression coefficients calculated by 
Fama and French.‘ First we calculate these 
statistics from historical data on returns 
drawn from the Standard and Poors’ index. 
The asset pricing model is calibrated by set- 
ting the parameters of the endowment pro- 
cess equal to the maximum likelihood esti- 
mates. Using the calibrated equilibrium 
model, we construct Monte Carlo distribu- 
tions for these statistics. Inferences regard- 
ing the model can then be drawn using clas- 
sical hypothesis testing procedures and the 
Monte Carlo distributions as the null. We 
are primarily interested in two hypotheses. 
The first is the random walk model of stock 
prices, which is an implication of the.Lucas 
model when agents have linear utility. The 
second hypothesis is that observed asset 
prices are determined in equilibrium but 
agents attempt to smooth their consumption. 
In this setting, asset returns can be nega- 
tively serially correlated even though they 
rationally reflect market fundamentals. 

To summarize our results at the outset, we 
find that for all return horizons both the 
variance ratio statistics and the long-horizon 
regression coefficients calculated from the 
actual Standard and Poors' returns lie near 


^We might also have examined variance bounds tests. 
But as John Campbell and Shiller (1988) point out, 
there is an equivalence between variance ratio tests of 
the type in Poterba and Summers and variance bounds 
tests pioneered by Shiller (1981). : 
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the 60 percent confidence band of the me- 
dian of the Monte Carlo distribution gener- 
ated under the linear utility (random walk) 
model. When investors display only a mod- 
erate desire to smooth their consumption, 
these same statistics calculated from the data 
lie at or near the median of the Monte Carlo 
distribution. When we test the null hypothe- 
sis against a diffuse alternative, we cannot 
réject the random walk model at the stan- 
dard 5 percent significance level, but the 
p-values of the test are much higher when 
the null distribution is generated assuming 
the utility function is concave. We conclude 
that much of the serial correlation in histori- 
cal stock returns can be attributed to small 
sample bias. However, the serial correlation 
of returns found in the data better resembles 
that of the model when the utility function is 
concave.’ 

The remainder of the paper consists of 
three sections. Section I presents the solution 
to the equilibrium asset pricing problem of 
the Lucas model when agents have constant 
relative risk aversion preferences, and the 
endowment process is assumed to follow the 
Markov switching model. We include in this 
section the maximum likelihood estimates of 
- the stochastic model, along with an evalua- 
tion of the model’s performance. Section II 
describes the Monte Carlo experiments and 
reports the main results of the paper. The 
final section offers some conclusions. 


I. The Equilibrium Model 
` A. A Case of the Lucas Model 


Consider the economy studied by Lucas 
(1978) in which there are a large number of 
infinitely lived and identical agents and a 
fixed number of assets that exogenously pro- 
duce units of the same nonstoreable con- 
sumption good. Let there be K agents and 
N productive units. Each asset has a single 


5Myung Kim, Charles Nelson, and Richard Startz 
(1988) and Matthew Richardson (1988) have recently 
examined the issue of small sample bias in these tests of 
serial correlation in stock returns. 
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perfectly divisible claim outstanding on it, 
and these claims are traded in a competitive 
equity market. The first-order necessary con- . 
ditions for a typical agents’ optimization 
problem are 


(1) P,,U'(C) 


= BE, U'(G.a)[B; ji t D; ond 
j=1,2,..., N 


where P, the real price of asset j in terms 
of the consumption good. 
U'(C) — marginal utility of consumption, 
C, for a typical consumer 
investor. 
B=a subjective discount factor, 0 < 
B <1. 
D,= the payoff or dividend from the 
jth productive unit. 
E,=the mathematical expectation 
conditioned on information 
available at time t. 


In equilibrium, per capita ownership of 
asset j is 1/K. It follows that equilibrium 
per capita consumption, C, is the per capita 
claim to the total endowment in that period, 
a /K >a 1D;. Now, make this substitution 
in equation (i) and sum over j to obtain an 
equilibrium condition involving economy- 
wide or market prices and quantities on a 
per capita basis: That is, 


(2) EU D,) = BEU'( D,,1) [Pa + Di]. 


where P, (1/K)XP;, is the share of the 
Plo s value owned by a typical agent and 
= (1/K )=D, ,. Since each productive unit 
i only a sin e share outstanding and the 
number of productive units are fixed, these 
are the theoretical value-weighted market in- 
dices adjusted for population. 
Let preferences be given by constant rela- 
tive risk aversion utility: 


U(C) - (1* y) ce, 


where — œ < y x 0 is the coefficient of rela- 
tive risk aversion. Now (2) simplifies to a 
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stochastic difference equation that is linear 
in PDY. That is, 
(3) PD? = BEP, Di + BEDS". 

E (3) forward, the current market 
value, P, can be expressed as a nonlinear 


function of current and expected future pay- 
offs, 


(4) P, =D! Y BYE, DEED. 
k=1 


To obtain a closed form solution, we must 
specify the stochastic process governing (D,), 
and this is done in Section I, Part C. We will 
refer to the exogenous forcing variable as 
dividends in the next two subsections. We do 
this because it helps to clarify the exposition, 
not because we restrict our attention to divi- 
dends when assessing the performance of the 
model. In fact, we consider alternatives as 
well. 


B. Characteristics of the Data 


The theory provides little guidance regard- 
ing the appropriate empirical counterpart 
to the exogenous forcing variable D. Be- 
cause equilibrium consumption equals out- 
put, which also equals dividends, there are 
three natural variables to serve this role. We 
consider all three candidate time-series in 
real, per capita terms: dividends, consump- 
tion, and GNP. 


SThe standard procedure in the literature has been to 
calibrate the endowment process to consumption (for 
example, Rajnish Mehra and Edward Prescott, 1985; 
Thomas Reitz, 1988; Shmuel Kandel and Robert 
Stambaugh, 1988; and George Constantinides, 1988, to 
name a few). It turns out that our results are robust to 
the particular time-series to which the endowment pro- 
cess is calibrated, whether it be consumption, dividends, 
or GNP. Because the variability of consumption, divi- 
dends, and GNP are very different from each other, the 
choice of the time-series to which the model is cali- 
brated will have different implications for other aspects 
of the model such as the implied size and volatility of 
returns and the risk premium. The aim of this paper is 
quite modest, however, in that it seeks only to examine 
conclusions that can be drawn from serial correlation in 
returns. We make no claims that our model can match 
every dimension of the data (see Kandel and Stambaugh, 
1988, who undertake a more ambitious project). 


CECCHETTI ET AL: EQUILIBRIUM ASSET PRICES > 401 


To choose an appropriate time-series 


"model for the endowment process, it is use- 


ful to know some of the details of the data. 
Table. 1 reports various summary statistics 
computed for the growth rates of dividends, 
GNP, and consumption. The data sources 
are described in the appendix. The following 
observations emerge from the table. Relative 
to a normal distribution, growth rates of the 
raw data are negatively skewed and have 
excess kurtosis. The coefficient of skewness 
is a measure of asymmetry, while the coef- 
ficient of kurtosis in excess of 3 implies that 
the distribution of the data is “fat tailed.” 
The negative skewness indicates that, rela- 
tive to a normal distribution, the data con- 
tain too many large negative values or 
“crashes.” As Reitz (1988) suggests in his 
examination of the equity premium puzzle, 
these crashes are potentially important for 
studying the dynamics of asset returns. 

The negative skewness is statistically sig- 
nificant at the 5 percent level for consump- 
tion and at the 1 percent level for dividends 
and GNP. Consequently, conventional time- 
series models, such as linear ARIMA models 
with Gaussian innovations, will be inappro- 
priate. That is, ARIMA models with normal 
error terms can never explain nonzero third 
moments in the distribution of the raw data. 

Excess kurtosis is also statistically signifi- 
cant at the 5 percent level for consumption 
and at the 1 percent level for dividends and 
GNP. Thus standard linear models will not 
capture this important characteristic of the 
data either. While conditionally normal but 
heteroskedastic models such as Robert 
Engle’s (1982) ARCH model can give rise to 
fat tails, they cannot model asymmetry. 

Our objective is to find a model that cap- 
tures these important features of the data 
and at the same time admits a solution to 
the intertemporal asset pricing problem set 
forth in Section 1, Part A. Hamilton’s (1989) 
Markov switching model meets both of these 
criteria. 


C. onto s Maro Switching Model 
Hamilton (1989) had suggested modeling 


the trends in nonstationary time-series as 
Markov processes, and has applied this ap- 
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TABLE 1 —SUMMARY STATISTICS FOR GROWTH RATES IN SAMPLE 


Consumption Dividends GNP 
Mean 0.0184 — 0.9038 0.0183 
Std. Dev. 0.0379 0.1359 0.0547 
Skewness —0.4097* — 0.5979 "0.7574? 
Coefficient (0.247) (0.227) (0.225) 
Kurtosis 3.8750* 5.830485 7.66305 
Coefficient (0.495) (0.455): (0.451) 
Minimum —0.1044 — 0.4673 — 0.2667 
Maximum 0.0989 0.4056 0.1662 
First Autocorrelation — 0.067 0.134 0.390* 
(0.101) (0.093) (0.092) 


Source: See data appendix. 


Notes: Standard errors for the skewness and kurtosis coefficients are reported in 
parentheses and are computed under the null hypothesis that growth rates of the data 
are distributed as i.i.d. normal. Significance tests are based on E. S. Pearson and H. O. 


- Hartley (1976), Tables 34.B and 34.C. 


*Significantly different from normal at the 5 percent level. 
"significantly different from normal at the 1 percent level. 
*Greater than two standard errors from zero. 


proach to the study of post-World War II 
real GNP. One of the attractive features of 
this approach is its ability to model the 
asymmetry and the leptokurtosis reported in 
Table 1. Let d, denote the logarithm of the 
endowment, D,. The Markov switching mod- 
el can be written as 


(5) i d,=d,ı+ £, + & + 0,8, 4, 


where {e,} is a sequence of independent and 
identically distributed normal variates with 
zero mean and variance o7, and {S,} is a 
- sequence of Markov random variables that 
take on values of 0 or 1 with transition 
probabilities, 


(6) Pr[S, =1|S,_,=1] =P, 

Pr[ S, = 01$, 1 71] eI P; 

Pr[S, =0|S,1= 0] =, 
and —- Pr[S,=1|S,_, =0] =1—q. 
The endowment process thus follows a ran- 
dom walk in logarithms (d, = d, , +-e,) with 
stochastic drift (a) + a,S,_,). As a normal- 


ization we restrict a, to be negative. The 
economy is said to be in a high-growth state 


or boom when S=0, and in a low-growth 
state or depression when S =1. The proba- 
bility of a boom next period given that the 
economy currently enjoys a boom is q, while 
the probability of a depression next period 
given a current depression state is p. The 
probabilities of transition from boom to de- 
pression and depression to boom are then 
l-q and 1-p, respectively. The endowment 
grows at the rate a, during a boom, and 
Qo'+ a, during a depression. The process ( S, } 
can be represented as a first-order autore- 
gression with an autocorrelation coefficient 
of (p q—1) that can be interpreted as a 
measure of persistence in the forcing pro- 
cess. 

It is also useful to think of the process 
loosely within the following context. The 
theory relates dividends to asset prices. In 
actual economies, future nominal dividend 
payments are announced in advance so a 
good deal of next period’s dividend growth 
is currently known. This is captured by the 
timing of the state in the Markov trend, and 
in the next subsection agents in the artificial 
economy will be assumed to observe the 
current state of the economy. From <5), the 
forecastable part of dividend growth during 
period t—1 is ag + a,S,_;, which is revealed 
at t —1. The unforecastable part of real divi- 
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'TABLE 2— MAXIMUM LIKELIHOOD ESTIMATES OF THE MARKOV TREND MODEL 


Yai = Ji + 0g + HS, + Eri 
Prob[s,, , 71|s, =1] = p, Prob[s,,, = 0|s, =1] - 1— p, 
Prob[s,, 7 0|s, 0] = q, Prob[s,, ; =1|s, = 0] -1— q, 
e ii.d. ~ N(0,c?). 


Parameter Consumption Dividends GNP 
P 0.5279 0.1748 0.5096 
(1.985) . (0.832) ` (2.034) 
q 0.9761 0.9508 0.9817 
(46.525) (40.785) (76.705) 
c 0.0320 0.1050 0.0433 
(12.297) (13.682) (14.932) 
ap- 0.0228 0.0171 0.0246 
(6.467) (1.579) (5.950) 
a — 0.0926 — 0.3700 — 0.1760 
(— 4.894) (— 6.548) (—7.116) 
Pr(S, =1) 0.0482 0.0563 0.0360 


Note: Asymptotic f-ratios in parentheses. 
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dend growth, eg, might be thought of as a 


combination of unanticipated inflation and 


productivity shocks. 

We note at this point that it is the data 
and not the discretion of the investigator 
that will choose the regime. That is, when we 
calculate the Monte Carlo distributions 
implied by the model, the parameters (a, o4, 
p.4,0) of the forcing process will be set 
equal to maximum likelihood estimates ob- 
tained from the data. 


D. Maximum Likelihood Estimates of 
the Markov Switching Model 


This section reports maximum likelihood 
estimates of the Markov switching model 
described above for annual dividends, GNP, 
and consumption. The model is nonlinear in 
the sense that the current minimum mean 
square error predictor of future values is a 
nonlinear function of current and lagged ob- 
servations. Even though the state, S, is un- 
observable to the econometrician, given the 
normality assumption on the e’s, the param- 
eters of the process, (p,q,a9,0,,0) can be 
estimated by maximum likelihood. The in- 
terested reader is directed to Hamilton (1989) 
for details on estimation or Pok-sang Lam 
(1988), who generalizes the Hamilton model. 


The estimation results and some summary 
Statistics are reported in Table 2. For the 
most part, the parameters are accurately esti- 
mated. When the economy is in a boom this 
year, the estimated probability that it contin- 
ues in a boom next year is q. This is esti- 
mated to be 0.95 for dividends, and 0.98 for 
GNP and consumption. The estimates of 
growth during a boom, a, are 0.017, 0.025, 
and 0.023 for dividends, GNP, and con- 
sumption, respectively. When in a boom, the 
estimated probability of a transition to a 
negative growth state next period, 1— q, is 
0.05 for dividends, and 0.02 for GNP and 
consumption. While in a depression state, 
expected growth, a + a,, is —0.35 for divi- 
dends, —0.15 for GNP and — 0.07 for con- 
sumption. 

The table also reports the unconditional 
probability of observing a depression state, 
Pr(S,—1) These are 0.056 for dividends, 
0.036 for GNP, and 0.048 for consumption. 
In other words, we expect real dividends, the 
most volatile of the three series, to crash by 
one-third in roughly 7 of the 116 years of the 
sample. While this may seem surprising, it is 
consistent with the historical experience. For 
example, the dividend model estimates imply 
that, if the economy is currently in the bad 
state (S, « 1), the 95 percent confidence in- 
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terval for growth in per capita real dividends 
is (— 0.14, — 0.56). The same confidence in- 
terval given that the economy is in the good 
state (S, = 0) is (0.23, — 0.19). Consequently, 
if dividends fall by 20 percent or more, we 
can be fairly certain that S, —1. Of the 116 
years in the sample, 8 meet this criterion." 


Once the economy finds itself in a depres- . 


sion, the probability that it will be in a 
depression the following year, p, is esti- 
mated to be 0.1748 for dividends, 0.5096 for 
GNP, and 0.5279 for consumption.® 


E. Evaluating the Markov Switching Model 


To assess the quality of the Markov 
switching model, we now compare it with 
some popular alternatives. The upper panel 
of Table 3 reports the results of two diagnos- 
tic tests. The first is a test of the Markov 
switching model against the simple nested 
null hypothesis that the data follow a geo- 
metric random walk with iid. innovations. 
Because the Markov switching model is not 
identified under thé null of the geometric 
random walk, the likelihood ratio statistic 
does not have the standard chi-squared dis- 
tribution. Therefore, we have tabulated the 
distribution of the pseudo-likelihood ratio 
statistic in order to perform this test. Our 
Monte Carlo experiment involved 1000 rep- 
lications where we first drew samples of 116 
for growth rates of GNP and dividends, and 
96 for growth rates of consumption under 
the null of a normal distribution with vari- 
ance set to values computed from the data. 
Next, we fitted the Markov switching model 


TReal per capita dividends fell by more than 30 
percent during 4 years, from 20 percent to 30 percent 
during four years, and by 10 percent to 20 percent 
during nine years of the sample. 

® We note that the likelihood function is fairly flat for 
variations in p, particularly in the estimation of the 
dividend process. This is not surprising given the asym- 
metric behavior of dividends over the business cycle. 
That is, downturns have generally been short lived, 
lasting between four and six quarters. This makes it 
difficult to obtain a good estimate of p using -annual 
observations. Hamilton does not encounter this prob- 
lem since he estimates his model using quarterly GNP 
data. 


JUNE 1990 


to this artificial data, and finally, we com- 
puted a standard likelihood ratio statistic as 
twice the difference in the maximized log 
likelihood values of the null and alternative 
models. As reported in the table, the weakest 
case is for consumption, where we reject the 
random walk at the 0.8 percent level. For 
both GNP and dividends, we observed fewer 
than two cases in 1000 where the LR statistic 
in the Monte Carlo experiment exceed the 
value obtained in the sample. Given the re- 
sults in Table 1, where the data clearly reject 
the hypothesis that growth rates were drawn 
from a normal distribution, it is perhaps not 
surprising that a formal statistical test rejects 
the random walk model for the log-levels. 


The second test reported in Table 3 is for . . 


symmetry of the Markov transition matrix, 
which implies symmetry of the uncondi- 
tional distribution of the growth rates. This 
test examines the maintained hypothesis that 
p = q against the alternative that p < q.° The 
table reports statistics that are asymptoti- 
cally standard normal under the null. We 
can reject the hypothesis of symmetry at the 
5 percent level in all three cases. 

The lower panel of Table 3 reports the 
distributional characteristics for the Markov 
switching process implied by the estimates in 
Table 2. We report the population values of 
the mean, standard deviation, coefficient of 
skewness, coefficient of kurtosis, and first- 
order autocorrelation computed from the 
point estimates of the Markov switching 
model. The values implied by the model 
generally lie within two standard deviations 
of the sample values reported in Table 1.!° 
The lone exception is the coefficient of kur- 
tosis for GNP. We conclude that the Markov 
switching model can produce both the de- 
gree of negative skewness and the amount of 
kurtosis that are found in the data. 


?This is a one-sided test of symmetry. against the 
alternative of negative skewness. 

101+ is worth noting that distribution of growth rates 
implied by the Markov switching model is always lep- 
tokurtic. Consequently, it'is a natural candidate for 
modeling data that exhibit fat tails. 

‘The results in Table 3 demonstrate that the two- 
state model we employ is capable of matching the first 
four central moments of the data and the first-order 
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TABLE 3— EVALUATING THE MARKOV SWITCHING MODEL 


Data Set 
Consumption Dividends GNP 
L.R. Statistic? 11.39 17.27 27.87 
p-value (0.008) (0.001) (0.000) 
Symmetry test? 1.72 3.71 1.89 
p-value (0.04) (0.00) (0.03) 
Distributional Characteristics of the Process Implied by the Estimates 
Consumption Dividends GNP 
` Mean F 0.0184 — 0.0038 0.0183 
Std. Dev. 0.038 0.135 : 0.054 
Skewness —0.617 . — 0.965 — 1.096 
Coefficient 
Kurtosis 4.22 5.03 6.030 
Coefficient i 
First Autocorrelation 0.140 0.05 0.179 


ê Based on a Monte Carlo experiment with 1000 replications.. 
>This is the test that p=q. The alternative hypothesis is p < q. 


As a final test of the Markov switching 
model, we have compared its ability to fore- 
cast growth rates in the three series with that 
of first- and second-order autoregressions. 
Table 4 reports the in-sample root mean 
square forecast error at horizons from one to 
ten years for an AR(1), and AR(2), and the 
Markov switching model.!* The results show 
that for dividends, the root mean squared 
forecast error of the Markov model is one- 
half that of the other models at all horizons. 
For consumption and GNP, the predictive 
ability of the three models is roughly the 
same. The Markov model marginally outper- 
forms the autoregressions for consumption 
but marginally underperforms an AR(2) for 
GNP. - 


autocorrelation. By contrast, linear ARIMA models can 
match higher-order serial correlation, but not the 
higher-order moments. A three-state Markov switching 
model would have ten parameters and would in princi- 
ple be able to match the first through sixth autocorrela- 
tion of the data in addition to the first four central 
moments. We have explored the three-state model but 
have thus far been unable to obtain sensible MLE 
parameter estimates with the annual data series studied 
here. 

"since these models are not nested, there is no 
natural statistical procedure for model selection. . 


Tbe purpose of this section has been to 
demonstrate that the Markov switching 
model is a reasonable alternative to obvious 
linear models in standard use. In addition to 
its ability to capture certain prominent fea- 
tures of the data that linear models cannot, 
the added attractiveness of the Markov 
switching model for our purposes is its ana- 
lytical tractability. We conclude that while 
there is no obvious and clear winner, a credi- 
ble case can be made for the Markov switch- 
ing model. While a better model of the data 
(especially for GNP) might incorporate both 
regime switching and AR components, we 
have been unsuccessful in our attempt to 
solve the asset pricing problem when the 
endowment follows such a process.. Our 
choice of the simple Markov switching model 
thus embodies a tradeoff between a model 
that completely matches the data and one 
that is tractable. 


F. Equilibrium Asset Prices 


Assume that the process driving the en- 
dowment is given by the Markov switching 
model of equations (5) and (6). We now 
obtain a solution to the asset pricing prob- 
lem stated in Section I, Part A, using the 
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` TABLE 4~—-RooT MEAN SQUARE FORECAST ERROR COMPARISONS USING LEVELS? 


A. Consumption 


Model 1 2 3 4 
ARI 0.136 0205 0.254 0287 
AR2 0.136 0.205 0.257 0.288 
MARKOV 0.131* 0.196*  0.249* 0.286* 
B. GNP 

Model 1 2 3 4 
ARI 0.324 0.594 0.808 0.962 
AR2 0.318* 0.578" 0.790* 0.936 
MARKOV 0.351 0,608 0.796 0.936* 
C. Dividends 

Model 1 2 3 4 
ARI 0.017 0.021 0.025 0.029 
AR2 0.031 0.033 0.034 0.037 
MARKOV 0.009* 0.012* 0.014* 0.017* 
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Horizon 

5 6 7 8 9 10 
0.313* 0.340* 0.366 0.393" 0.415* 0428" 
0314 0341 0.367 . 0.396 0.417 . 0.431 
0.315 0.342 . 0.366* 0.396 0.419 = 0.432 

Horizon : 

5 6 7 8 9 10 
1.055 1.093 1.109 1.126 1.186 1.241 
1.023 1.009 — 1.079 1.005* 1.154*  1.202* 
1.000* 1.054* 1.074* 1.102 1.167 1.220 

Horizon 

5 6 7 8 9 10 
0.033 0.088 0.042 0.047 0.052 0.057 
0.040 0.044 0.047 0051 0.055 0.060 
0.020*  0.023*  0.026* 0,029*  0.032*  0.035* 


*A * indicates lowest root mean square forecast error. Some entries appear the same due to rounding. 


method of undetermined coefficients. Con- 
jecture the following solution: 


(7) 
The problem is to verify that (7) solves. (3) 


and to find the function p(S,). To do this, 
substitute (7) into (3) to obtain 


(8) P(S) D? = BE,D, A [oC Sa) +1]. 
Next, write (5) in levels, 


(9) 


Now substitute (9) into (8) and note that e is 
iid. normal with variance o? to obtain 


P,=(S,) D. 


D, = D,e Got Site), 


(10) e(S,) = fieles) +y)? 
x ela MSIE [ p( S41) +1] . 


Because S, can take on only two values, 0 or 


PUsing formulas derived by Hamilton (1989), one 
can also evaluate the series given in equation (4) di- 
rectly. The appendix to the working paper version of 
this paper shows that both methods yield the same 
answer. 


1, (10) is a system of two linear equations in 
p(0) and p(1). The solution is given by 


(11) e(0) - B[1- B&( p *- a—1)]/A, 
(12) p(1) 2 a[1- B( p * a-1)]/A, 


where B= =Belodt+n+d+y’s va áj- en *D, 


and A-1— B(pá,* q)* Pap +q-1). 
This, establishes that (7) is the solution to 
(3). 

A number of interesting features of the 
equilibrium price function emerge. First, as- 
set prices are proportional to the endow- 
ment.! Second, the factor of proportionality 
depends on the inverse of the elasticity of 
intertemporal substitution and whether the 


1t is possible to show that as long as both (0) > 0 
and p(1) > 0, the transversality condition is met and the 
power series (4) converges. In addition, this solution 
technique can easily be generalized to the case of n-states 
in the mean and the variance. 

151n the simple model studied here this implies that 
the price dividend ratio takes on one of two values, p(0) 
or p(1). This is a consequence of assuming that agents 
observe S. In the more realistic case in which S, is 
unobserved and- must be estimated, the price dividend 
ratio would be a continuous variable fluctuating be- 
tween the two bounds of p(0) and p(1). 
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economy is currently in the high-growth state 
or low-growth state according to 


p(0) 2 p(1) as y -1. 


The interpretation of this is straightfor- 
ward. For a given level of the current en- 
dowment, suppose that the economy is 
known to be in a high-growth state (S, = 0). 
By (6), this implies that the economy is likely 
to remain in a high-growth state into the 
future, and agents anticipate high future lev- 
els of the endowment. This has two effects 
on asset prices that work in opposite direc- 
tions. First, there is an intertemporal relative 
price effect in which the higher expected 
future endowment implies a lower relative 
price of future goods. This induces agents to 
want to increase saving and to increase their 
demand for assets. The increased asset de- 
mand arising from this intertemporal relative 
price effect works to raise current asset prices. 
Working in the opposite direction is a substi- 
tution effect arising from agents' attempts to 
smooth their consumption. When the ex- 
pected future endowment is high, the con- 
sumption smoothing motive leads agents to 
want to increase. current consumption in an- 
ticipation of higher future investment in- 
come. To finance higher current consump- 
tion, they attempt to sell off part of their 
asset holdings, which in equilibrium results 
in falling asset prices. 

Log utility (y = —1) is a borderline case in 
which the intertemporal relative price effect 
and the consumption smoothing effect ex- 
actly cancel out. This can be seen, perhaps, 
more clearly from (4), in which the solution 
for y-—1 is P,7(B/[1—8])D, In this 
case, the factor of proportionality relating 
prices to dividends is a constant. When the 
utility function is less concave than it is in 
the log case, the intertemporal relative price 
effect assumes greater importance, so that 
p(0) — p(1). In the limiting case of linear 
utility (y —0), the intertemporal relative 
price effect is all that matters since agents 
have no desire to smooth consumption. Con- 
versely, when the utility function is more 
concave than is implied by log utility, the 
intertemporal consumption smoothing effect 
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dominates the intertemporal relative, price 
effect causing p(1) » p(0). . 

From (5) and (7), equilibrium gross re- 
turns are computed as 


(13 R,=(P,+ D)/P.., 


= ( [oC S) +1]/e(S,-1)} 
xexp[a + a4S,.., ^ e]. 


Notice that because the gross return depends 
on £, it is a continuous random variable on 
[0, co) and not a two-point process. 


Il. The Serial Correlation of Equilibrium and 
` Historical Returns 


In this section, returns obtained from the 
equilibrium model of Section I are used to 
generate Monte Carlo distributions of the 
variance ratio statistics used by Poterba and 
Summers and the regression coefficients cal- 
culated by Fama and French. These distri- 
butions are generated both for the case of 
linear utility and for a case in which the 
utility function is concave. They are then 
used to draw inference about the equilibrium 
model and the model driving the exogenous 
forcing variable. For a given value of the 
elasticity of intertemporal substitution, the 
model is calibrated to the estimated divi- 
dend, consumption, and GNP processes re- : 
ported in Table 1. That is, the parameters of 
the forcing process, ( p, q, 0, %, o4) are set to 
the values in the columns of Table 2, and 
each case is considered in turn. The subjec- 
tive discount factor B is assumed to be 0.97 
throughout. 

The procedure is as follows: First, given p 
and q, we generate a sequence of 116 S,’s 
according to (6). Second, given c, 116 inde- 
pendent draws from a normal distribution 
with zero mean and variance o? are taken to 
form a sequence of e,'s. Third, given ag, o, 
B. v, {s,}, and {e,}, we generate a sample of 
116 returns according to equation (13). For 
each sample of returns, the variance ratio 
and regression coefficients are calculated for 
horizons 1 through 10. This experiment is 
repeated 10,000 times. The tabulation of 
these calculations is the Monte Carlo distri- 


408 l THE AMERICAN ECONOMIC REVIEW 


bution of the statistic from which we draw 
inference. The sample size of 116 is chosen 
to correspond to the 116 annual observa- 
tions available in the actual Standard and 
Poors’ returns. We calculate the median, 60 
percent confidence intervals about the me- 
dian of the distribution, and the p-value for 
the statistic under investigation. We refer to 
these as the “small” sample results. A me- 
dian is also calculated from 10,000 time- 
series samples of 1160 returns each, to get an 
idea of the rate of convergence of the vari- 
ance ratio or regression coefficient statistic to 
its true population value. We refer to this as 
the “large” sample median. The results we 
obtain when the model is calibrated to con- 
sumption is representative of the results for 
dividend and GNP models. To facilitate the 
exposition, the results for the consumption 
model are also displayed in figures. Each 
figure displays the small sample medians, the 
60 percent confidence intervals about the 
median, and the point estimates calculated 
from the historical Standard and Poors’ re- 
turns. 

From the Monte Carlo distributions of the 
variance ratio statistic and the autocorrela- 
tion coefficient on returns, we can determine 
the likelihood that the estimates obtained 
from the historical data were drawn from the 
Monte Carlo distribution implied by equilib- 
rium returns. 


A. Variance Ratios 


Let R, be the one period real rate of 
return, and RF be the iple k-period re- 
turn. That is, ‘Rk= E ZoR,-; Poterba and 
Summers define ‘the variance ratio for re- 
turns at the Ath horizon as 


Var( Rf) 


(14) VR(k) TXVa(R) 


It is easy to show that the variance ratio can 
be expressed in terms of the return's auto- 
correlations. That is, 


k-1 
(15) VR(K) «17 Y, (k- oy 


ja 


where p, is the jth autocorrelation of annual 
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returns. When returns are serially uncorre- 
lated, the variance ratio is equal to one for 
all k in large samples. This is usually taken 
as the null hypothesis in tests of "market 
efficiency," corresponds to the case where 
stock prices follow a random walk, and is 
true in the equilibrium model of Section I 
only when investors have linear utility. Stock 
prices are said to be “mean reverting” if 
returns are negatively serially correlated and 
evidence of mean reversion is inferred from 
variance ratios that lie below unity. This is 
the finding of Poterba and Summers. 

We consider the case of linear utility first. 
Figure 1 displays the results under linear 
utility for the model calibrated to the con- 
sumption process. Since these returns are 
uncorrelated by construction, all of the devi- 
ation of the median of the variance ratio's 
distribution from unity is due to small sam- 
ple bias." In the large sample (T — 1160), 
most of the bias has disappeared. It is also 
seen that the variance ratios calculated from 
the Standard and Poors' data fall within the 
60 percent confidence interval of the Monte 
Carlo distribution.!® The serial correlation of 
returns, and hence their predictability, is only 
apparent. 

This result can be viewed in the same light 
as the business cycle in which recessions 
occur with random periodicity. Although real 
GNP may appear to be mean reverting, this 
does not imply that business cycle turning 
points are predictable. In the equilibrium 
model of asset prices, the exogenous forcing 


léin small samples, as Poterba and Summers point 
out, the sample autocorrelations of returns are biased so 
E [VR(k)] «1 even when returns are independent. 

UIt bears mentioning that even in the case of linear 
utility and the geometric random walk, our empirical 
distributions differ from those reported in Poterba and 
Summers and Fama and French. The reason is that we 
assume a probability model for the endowment process 
and study the dynamics of the returns implied by an 
equilibrium mode, whereas these authors assume a 
probability model for the returns. Since the return is a 
nonlinear function of the endowment in our setup, these 
two approaches need not yield the same small sample 
results. 

18These estimates of the variance ratios are smaller 
than those reported by Poterba and Summers because 
they make a bias correction assuming a null hypothesis 
of a homoscedastic random walk for asset prices. The 
bias correction is irrelevant for our purposes. 
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FIGURE 1. VARIANCE RATIOS. EQUILIBRIUM RETURNS GENERATED BY LINEAR 
UTILITY USING CONSUMPTION 


=-=- 60% confidence intervals 
——— Small sample median 

€ Large sample median 

O Estimated from data 
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FIGURE 2. VARIANCE RATIOS. EQUILIBRIUM RETURNS GENERATED BY 
CONCAVE UTILITY USING CONSUMPTION 


variable has a business cycle interpretation. 
The stochastic process of the forcing variable 
implies that a boom will eventually be fol- 
lowed by a rėcession, which will quickly be 
followed by a boom. Since equilibrium asset 
prices are proportional to the forcing vari- 


able, the appearance of mean reversion of 
asset prices is produced, but this does not 
mean that returns are predictable. 

When agents’ utility function is concave, 
the results are even more favorable to the 
model. Figure 2 reports the results of the 
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TABLE 5— VARIANCE RATIOS FOR HISTORICAL AND EQUILIBRIUM RETURNS 
Linear Utility Concave Utility 
T —116 T =1160 T =116 T —1160 
k Actual Median p-Value Median Median p-Value Median Population 
(1) (2) (3) (4) (5 (6) (7) (8) (9) 
Endowment Calibrated to Consumption: 
2 1.0137 0.9865 - 0.5779 1.0004 0.9462 . 0.6811 0.9523 0.9524 
3 0.8664 0.9669 0.3049 1.0005 0.8941 0.4482 0.9212 0.9206 
4 0.8351 0.9385 0.3301 0.9977 0.8511 0.4742 0.9006 0.8987 
5 0.7978 0.9115 0.3227 0.9989 0.8181 0.4671 0.8812 0.8831 
6 0.7459 0.8926 0.2903 0.9950 0.7782 0.4515 0.8707 0.8716 
7 0.7259 0.8811 0.2877 0.9953 0.7576 0.4534 0.8613 0.8630 
8 0.7363 0.8678 0.3307 0.9912 0.7380 0.4965 0.8528 0.8564 
9 0.7102 0.8527 0.3238 0.9876 0.7264 0.4796 0.8474 0.8511 
10 0.7242 0.8268 0.3737 0.9869 0.7083 0.5216 0.8391 0.8469 
' Endowment Calibrated to Dividends: 
2 1.0137 0.9835 0.6151 0.9959 0.8721 0.8986 0.8786 0.8828 
3 0.8664 0.9683 0.2374 0.9934 0.8166 0.6350 0.8266 0.8340 
4 0.8351 0.9541 0.2420 ` 0.9912 0.7746 0.6425 0.8021 0.8086 
5 0.7978 0.9367 0.2401 0.9906 0.7478 0.6069 0.7848 0.7933 
6 0.7459 0.9230 0.2042 0.9891 0.7286 0.5355 0.7742 0.7831 
7 0.7259 0.9058 0.2168 0.9876 0.7103 0.5277 0.7654 0.7758 
8 0.7363 0.8906 0.2582 0.9875 0.6957 0.5759 0.7595 0.7704 
9 0.7102 0.8851 0.2490 0.9838 0.6821 0.5492 0.7547 0.7661 
10 0.7242 0.8682 0.3000 0.9840 0.6718 0.5858 0.7496 0.7627 
Endowment Calibrated to GNP: 
2 1.0137 0.9782 0.5877 0.9966 0.9401 0.6632 0.9463 0.9490 
3 0.8664 0.9403 0.3722 0.9936 0.8760 0.4868 0.9102 0.9153 
4 0.8351 0.9075 0.3837 0.9911 0.8296 0.5074 0.8839 0.8923 
5 0.7978 0.8806 0.3757 0.9859 0.7858 0.5168 0.8658 0.8761 
6 0.7459 0.8590 0.3451 0.9850 0.7516 0.4911 0.8492 0.8643 
7 0.7259 0.8385 0.3541 0.9778 0.7424 0.4800 0.8390 0.8554 
8 0.7363 0.8252 0.3909 0.9773 0.7136 0.5309. 0.8285 0.8486 
9 0.7102 0.8120 0.3763 0.9770 0.6933 0.5229 0.8204 0.8432 
10 0.7242 0.7857 0.4227 0.9730 0.6833 0.5487 0.8172 0.8389 


Notes: Under linear utility, y = 0. Under concave utility, y is set to —1.7 for the consumption model, — 14 for the 
dividend model, and —1.6 for the GNP model. 8 = 0.97 throughout. Column 1: Horizon, in years. Column 2: 
Variance ratios of historical Standard and Poors' returns. Column 3: Median of Monte Carlo distribution of variance 
ratios for 116 equilibrium returns generated with linear utility. Column 4: Percentage of Monte Carlo distribution 
having values less than the value in column 3. Column 5: Median of Monte Carlo distribution for variance ratios of 
1160 equilibrium returns generated with linear utility. Column 6: Median of Monte Carlo distribution of variance 
ratios of 116 equilibrium returns generated with concave utility. Column 7: Percentage of Monte Carlo distribution 
with values less than the value in column 6. Column 8: Median of Monte Carlo distribution for variance ratios of 
1160 equilibrium returns generated with concave utility. Column 9: Population variance ratio of equilibrium returns. 


above calculations assuming concave utility 
with y= —1.7 and the forcing process 
matched to consumption. Now the median 
of both the small and large sample distribu- 
tions of the variance ratio statistics are well 
below 1.0 at every horizon. The median of 
the small sample (T —116) distribution 
closely matches the variance ratios calcu- 


lated from the annual returns on the Stan- 
dard and Poors'.? 
Table 5 reports results for the model cali- 


brated to consumption, dividends, and GNP, 


When y = —2, the model yields much more mean 
reversion than is in the data. The entire 60 percent 
confidence band lies below the sample values. 
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from which we make the following observa- 
tions. First, the results for variance ratios are 
fairly robust to the choice of the time-series 
to which the model is calibrated. P-values 
between 0.2 and 0.8 imply that the sample 
variance ratios lie within the 60 percent con- 
fidence interval of the Monte Carlo distribu- 
tion median. Thus, it can be seen that even 
under linear utility, the model cannot be 
rejected at conventional significance levels. 
From column 4, the smallest p-value is ob- 
tained at the six-year horizon in the dividend 
model ( p-value — 0.204). When the sample 
size is increased tenfold, (T =1160), most of 
the small sample bias disappears. When the 
utility function is concave, the median of the 
distribution matches up well with the values 
implied by the data as the p-values in col- 
umn 7 are generally in the neighborhood of 
0.5. A sizable small sample bias remains 
present, but as in the linear utility case, most 
of this bias vanishes if the sample is made 
ten times longer. 


B. Regression Coefficients on Returns of 
Varying Horizons 


Consider estimating the first-order serial 
correlation coefficient on 7-year returns by 
running the following regression: 


(16) Ratt” a, +b, Ririt Us tar 


T —],2,...,10 years, 


where R,,,, is the continuously com- 
pounded real stock return from ¢ to t+ v. It 
is easy to show that the relation between the 
autocorrelations of one-period returns and 
the autocorrelation of the 7-period return is 


3 py +2p,+ +++ +p, +(T—1) P41 
7 7+2(7-1)p,+2(7—2)p, 


x +++ +2po, o 3t84a 
qose t*t2p.i 


Using monthly returns on the CRSP index, 
Fama and French find that the slope coeffi- 
cient b, is negative for + greater than one 
year. From this they infer that stock prices 
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are mean reverting. We examine their result 
by computing the empirical distribution of 
these regression coefficients implied by the 
model in Section I. 

We begin with the linear utility (y= 0) 
case. Figure 3 displays results for the model 
calibrated to consumption, the small sample 
median, and 60 percent confidence intervals 
of the Monte Carlo distribution of the re- 
gression coefficient b,, the population values 
implied by the model, and the estimates 
obtained from the Standard and Poors’ re- 
turns. Again, the deviation of the median of 
the small sample (T =116) distribution from 
zero is due to small sample bias. This bias 
increases as 7 gets larger, because the effec- 
tive sample size, as measured by the number 
of independent pieces of information (non- 
overlapping observations), decreases with +. 
For example, at the ten-year horizon, there 
are only ten nonoverlapping observations 
available in the Standard and Poors’ data, 
and six nonoverlapping observations avail- 
able in the CRSP returns! 

The median of the large sample distribu- 
tion (T —1160), on the other hand, is reason- 
ably close to the true value of zero. The 
regression coefficients calculated from the 


Standard and Poors' data uniformly lie be- 


low the median of the small sample Monte 
Carlo distribution in the consumption model. 

Figure 4 displays the details of the Monte 
Carlo distributions of the regression coeffi- 
cients obtained from the equilibrium returns 
when y - —1.7 in the consumption model. 
As in Figure 3, the regression coefficient 
uniformly lies within the 60 percent confi- 
dence interval of the median. The distance 
between the small sample medians and the 
actual estimates tend to be smaller here than 
when agents have linear utility. 

Table 6 reports results for the model cali- 
brated to consumption, dividends, and GNP, 
from which we make the following observa- 
tions. As we found with the variance ratios, 
the results on the regression coefficients are 
robust to the series to which the model is 
calibrated. Under linear utility, the strongest 


: evidence against the model comes when the 


model is calibrated to dividends at the two- 
year horizon ( p-value = 0.1099). Most of the 
small sample bias vanishes when T 1160. 
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FIGURE 3. RETURN REGRESSION COEFFICIENTS. EQUILIBRIUM RETURNS ` 
GENERATED BY LINEAR UTILITY UsING CONSUMPTION 
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FIGURE 4. RETURN REGRESSION COEFFICIENTS. EQUILIBRIUM RETURNS 
GENERATED BY CONCAVE UTILITY USING CONSUMPTION 
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TABLE 6— REGRESSION COEFFICIENTS FOR HISTORICAL AND EQUILIBRIUM RETURNS 
Linear Utility : Concave Utility 
.T=116 T-1160 T=116 T =1160 
k Actual Median p-Value Median Median p-Value Median Population 
(1) Q) G) a (5) (6) o (8) (9) 
Endowment Calibrated to Consumption: : 

1 0.0147 — 0.0108 0.5747 0.0002 — 0.0541 0.6788 — 0.0466 — 0.0475 
2 — 0.1671 — 0.0416 0.1761 — 0.0006 — 0.0905 0.3039 . -—0.0555 — 0.0564 
3 — 0.1059 — 0.0543 0.3654 — 0.0024 —0.1052 0.4989 —0.0532 — 0.0531 
4 — 0.0836 — 0.0658 0.4565 — 0.0039 — 0.1109 0.5650 —0.0510 — 0.0470 
5 — 0.1530 — 0.0758 0.3273 — 0.0067 —0.1092 0.3940 — 0.0474 — 0.0409 
6 — 0.1345 — 0.0866 0.3952 — 0.0076 —0.1175 0.4629 — 0.0430 —0.0357 
7 — 0.1601 — 0.1000 0.3816 — 0.0096 — 0.1283 0.4335 — 0.0405 —0.0314 
8 — 0.2098 — 0.1108 0.3148 — 0.0099 — 0.1400 0.3586 — 0.0365 —0.0279 
9 — 0.2488 —0.1267 0.2883 — 0.0127 — 0.1482 0.3119 — 0.0362 — 0.0251 

10 — 0.3246 —0.1472 0.2158 — 0.0125 — 0.1643 0.2277 — 0.0330 —0.0227 

Endowment Calibrated to Dividends: 

i 0.0147 — 0.0129 0.6047 — 0.0035 — 0.1259 0.8992 — 0.1215 —0.1171 
2 — 0.1671 — 0.0254 0.1099 — 0.0039 — 0.1058 0.2943 — 0.0865 — 0.0840 
3 — 0.1059 — 0.0383 0.3095 — 0.0031 — 0.0948 0.4681 — 0.0645 — 0.0609 
4 — 0.0836 — 0.0472 0.4055 — 0.0045 — 0.0985 0.5379 — 0.0511 — 0.0473 
5 — 0.1530 — 0.0587 0.2914 — 0.0051 — 0.1030 0.3827 — 0.0434 — 0.0386 
6 — 0.1345 — 0.0756 0.3813 — 0.0064 — 0.1050 0.4322 — 0.0393 — 0.0325 
7 — 0.1601 — 0.0919 0.3617 — 0.0069 — 0.1141 0.4052 — 0:0356 — 0.0282 
8 — 0.2098 —0.1078 0.3150 — 0.0078 — 0.1269 0.3451 — 0.0342 — 0.0248 
9 — 0.2488 —0.1275 0.2896 — 0.0101 — 0.1387 0.3124 —0.0311 — 0.0222 

10 — 0.3246 —0.1410 0.2080 — 0.0108 — 0.1589 0.2321 — 0.0289 — 0.0200 

Endowment Calibrated to GNP: 

1 0.0147 — 0.0187 0.5862 — 0.0022 — 0.0602 0.6659 — 0.0538 — 0.0509 
2 — 0.1671 — 0.0577 0.2209 — 0.0070 — 0.1094 0.3584 — 0.0656 — 0.0597 
3 — 0.1059 — 0.0667 0.3988 — 0.0084 — 0.1181 0.5299 — 0.0631 — 0.0557 
4 — 0.0836 — 0.0731 0.4734 —0.0117 —0.1170 0.5814 — 0.0594 — 0.0489 
5. —0.1530 — 0.0822 0.3337 — 0.0121 — 0.1188 0.4202 0.0544 — 0.0424 
6 — 0.1345 — 0.0951 0.4132 — 0.0126 — 0.1227 0.4739 — 0.0506 — 0.0369 
7 — 0.1601 — 0.1041 0.3840 — 0.0128 — 0.1322 0.4413 — 0.0454 — 0.0324 
8 — 0.2098 — 0.1160 0.3213 — 0.0133 — 0.1409 0.3665 — 0.0424 — 0.0288 
9 — 0.2488 — 0.1269 0.2819 — 0.0154 —0.1458 0.3133 — 0.0406 — 0.0258 

10 — 0,3246 — 0.1433 0.2082 — 0.0136 — 0.1657 0.2262 — 0.0403 — 0.0234 








Notes: Under linear utility, y — 0. Under concave utility, y is set to — 1.7 for the consumption model, — 1.4 for the 
dividend model, and —1.6 for the GNP model. 8 —0.97 throughout. Column 1: Horizon, in years, Column 2: 
Regression coefficients of historical Standard and Poors' returns. Column 3: Median of Monte Carlo distribution of 
regression coefficients for 116 equilibrium returns generated with linear utility. Column 4: Percentage of Monte Carlo 
distribution having values less than the value in column 3. Column 5: Median of Monte Carlo distribution for 
regression coefficients of 1160 equilibrium returns generated with linear utility. Column 6: Median of Monte Carlo 
distribution of regression coefficients of 116 equilibrium returns generated with concave utility. Column 7: Percentage 
of Monte Carlo distribution with values less than the value in column 6. Column 8: Median of Monte Carlo 
distribution for regression coefficients of 1160 equilibrium returns generated with concave utility. Column 9: 
Population regression coefficients of equilibrium returns. 


The model matches the data more closely 
when the utility function is concave. From 
column 7 it can be seen that the regression 
coefficient for one-year returns for the divi- 
dend model lies near the 95 percent confi- 
dence bound. At the remaining horizons in 


the dividend model and at all horizons for 
the consumption and GNP models, the re- 
gression coefficients computed with the data 
uniformly lie within a 60 percent confidence 
interval of the median of the Monte Carlo 
distribution. The distance between the small 
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TABLE 7—THE SOURCE OF THE NEGATIVE SERIAL CORRELATION, 
MEDIAN OF DISTRIBUTION OF VARIANCE RATIOS OF RETURNS FOR 


MODEL CALIBRATED TO CONSUMPTION 


GRW MSM GRW 
y=0 y=0 y=0 
k Actual  T-116 T=116 T=1160 
a) (2) Q (4) (5) 
2 1.0137 0.9913 0.9865 0.9983 
(0.50) (0.58) (0.70) 
3 0.8664 0.9732 0.9669 0.9953 
(0.21) (0.30) (0.00) 
4 0.8351 0.9605 0.9385 0.9944 
(0.22) (0.33) (0.00) 
5 0.7978 0.9458 0.9115 0.9937 
(0.21) (0.32) (0.00) 
6 0.7459 0.9366 0.8926 0.9904 
(0.17) (0.29 (0.00) 
7 0.72259 0.9208 0.8811 0.9922 
(0.20) — (029) (0.00) 
8 0.7363 0.9062 0.8678 0.9892 
(0.25) (0.33) (0.00) 
9 0.7102 0.8887 0.8527 0.9884 
(024) — (0.32) (0.00) 
10 0.7242 0.8817 0.8268 0.9858 
(0.29) | (037) (0.00) 
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MSM GRW MSM GRW MSM 
y=0 y7-17 y=-17  ys5-17 y=-17 
T-1160 T=116 T=116 T=1160  T--1160 
(6) 0) (8) (9) (10) 
1.0004 0.9907 0.9462 0.9991 0.9523 
(0.62) (0.60) (0.68) (0.69) (0.90) 
1.0005 0.9758 0.8941 0.9964 0.9212 
(0.02) (0.21) (0.45) (0.00) (0.22) 
0.9977 0.9587 0.8511 0.9955 0.9006 
(0.02) (0.22) (0.47) (0.00) (0.22) 
0.9989 0.9474 0.8181 0.9927 0.8812 
(0.01) (0.22) (0.47) (0.00) (0.18) 
0.9950 0.9302 0.7782 0.9921 0.8707 
(0.00) (0.18) (0.45) (0.00) (0.10) 
0.9953 0.9227 0.7576 0.9912 0.8613 
(0.00) (0.18) (0.45) (0.00) (0.09) 
0.9912 0.9077 0.7380 0.9908 0.8528 
(0.01) (0.18) (0.50) (0.00) (0.15) 
0.9876 0.8938 0.7264 0.9874 0.8474 
(0.01) (0.24) (0.48) (0.00) (0.12) 
0.9869 0.8718 0.7083 0.9874 0.8391 
(0.01) (0.29) (0.52) (0.00) (0.17) 


Note: GRW is the geometric random walk model. MSM is the Markov switching model. 


sample medians and the actual estimates tend 
to be smaller when agents have concave util- 


ity. 


C. Mean Reversion, Small Sample Bias, 
and Consumption Smoothing 


There are three features of our model that 
contribute to apparent mean reverting asset 
prices. They are (1) the nonlinearity of the 
endowment process (i.e., the presence of S,), 
(2) pure small sample bias (T'— 116), and (3) 
consumption smoothing (y « 0). The role of 
consumption smoothing can be deduced 
from a comparison of columns 4 and 7 in 
Tables 5 and 6. For the values of y that we 

. Work with, introducing concavity to the util- 
ity function shifts roughly an additional 15 
percent to 20 percent of the variance ratio 
distribution and 1 percent to 10 percent of 
the regression coefficient distribution below 
the data. Table 7 reports additional results 
that allow us to separate these three influ- 


ences and quantify the contribution of each . 


in generating negatively serially correlated 
returns. While the table only includes evi- 
dence for the variance ratio statistic when 
the model is calibrated to consumption, these 
results are representative of the other cases 
that we do not report to economize on space. 

We compare our results to those obtained 
when the endowment is assumed to follow a 
geometric random walk with drift and iid. 
innovations with variance set equal to the 
sample values obtained from the consump- 
tion data. The table reports the median value 
of a Monte Carlo experiment with 10,000 
replications, along with the percentage of the 
empirical distribution that lies below the 
value computed from the data. 

To isolate the effects of the nonlinear en- 
dowment process, we first examine the impli- 
cations for linear utility. A comparison of 
columns (3) and (4) reveals that the Markov 
switching model makes the small sample bias 
larger than it is in the case of the geometric 
random walk. While the variance ratio statis- 
tics are very near their population values of 
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TABLE 8— SERIAL CORRELATION IN EXCESS RETURNS, EQUILIBRIUM RETURNS 
IMPLIED BY THE MODEL CALIBRATED TO CONSUMPTION 

Linear Utility Concave Utility 
T —116 T=116 
k Actual Median p-Value Median p-Value Population 
Q) Q) @) 4) (5) (6) () 
Variance Ratios: 
2 1.0492 0.9866 0.7048 0.9878 0.7423 0.9993 
3 0.9300 0.9600 0.4712 0.9729 0.4366 0.9988 
4 0.9121 0.9325 0.5080 0.9514 0.4678 0.9985 
5 0.8639 0.9124 0.4721 0.9424 — 0.4184 0.9982 
6 0.7848 0.8998 0.3937 0.9231 0.3308 0.9981 
7 0:7246 0.8793 0.3543 0.9044 0.2846 0.9980 
8 0.7123 0.8577 0.3736 0.8968 0.3021 0.9979 
9 0.7038 0.8421 0.3924 0.8854 0.3245 0.9978 
10 0.7148 0.8345 0.4363 0.8665 . 0.3842 0.9977 
Regression Coefficients: 
l 0.0511 . — 0.0133 0.6809 — 0.0114 0.7209 —0.692e — 03 
2 —0.1193 — 0.0407 0.2871 — 0.0281 0.2229 —0.783e — 03 
3 —0.1021 — —0.0558 0.3834 — 0.0387 0.3330 —0.714e — 03 
4 — 0.1230 — 0.0663 0.3609 — 0.0499 0.3224 —0.616e — 03 
5 —0.0973  —0.0778 0.4504 — 0.0654 0.4268 —0.527e — 03 
6 0.0557 —0.0894 0.7838 — 0.0816 0.7569 —0.454e — 03 
7 0.1094  —0.0952 0.8470 — 0.0956 0.8401 —0.396e — 03 
8 0.0904 | —0.1090 0.8298 — 0.1033 0.8226 — 0.349e — 03 
9 0.1202 —0.1248 0.8618 —0.1225 0.8527 —0.312e — 03 
10 0.0863  —0.1436 0.8331 —0.1342 0.8133 —0.281e — 03 


Notes: Column 1: Horizon, in years. Column 2: Statistics computed using historical 
excess returns. Column 3: Median of Monte Carlo distribution of statistics for 116 
equilibrium excess returns generated with linear utility. Column 4: Percentage of Monte 
Carlo distribution having values less than the value in column 3. Column 5: Median of 
Monte Carlo distribution of statistics of 116 equilibrium excess returns generated with 
concave utility. Column 6: Percentage of Monte Carlo distribution with values less than 
the value in column 5. Column 7: Population statistics for equilibrium excess returns. 


1.0 for T=1160, the distribution shifts far- 
ther down in a sample of T —116 when the 
endowment follows the Markov switching 
process. The results in column (5) show that 
for the Markov switching model, roughly 10 
percent more of the empirical distribution 
lies below the data than for the geometric 
random walk model reported in column (1). 

The effect of consumption smoothing 
alone can also be deduced from the table. 
Columns (7) and (9) of Table 7 replicate the 
well known result that if the endowment 
process follows a geometric random walk, 
concave utility cannot produce mean rever- 
sion. This can also be seen from our solution 
for returns in equation (13). When there is 
no switching of states, the function p is 
constant. Because the e,'s are independent, 


returns are serially uncorrelated regardless of 
the elasticity of intertemporal substitution. 
Columns (7) and (8) report on the impor- 
tance of the interaction between consump- 
tion smoothing and the Markov switching 
model. Comparing column (8) to column (3) 
shows the importance of the combined im- 
pact of these two effects in small samples. By 
comparison with the model calibrated to the 
geometric random walk with linear utility, 
the nonlinear model with concave utility 
yields roughly an additional 25 percent of 
the empirical distribution below the data 
when 7-116. Furthermore, the results in 
column (10) show that even in large samples, 
consumption smoothing implies negative se- 
rial correlation in returns when the endow- 
ment follows the Markov switching process. 
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D. Excess Returns 


Table 8 reports results on mean reversion 
in excess returns when the model is cal- 
ibrated to consumption.” Poterba and 
Summers also find evidence of mean rever- 
sion in excess returns. The top panel of the 
table shows the results for the variance ratio 
statistics. Column (2) of table reports the 
values computed from historical excess re- 
turns. These are only slightly larger than the 
values for real equity returns reported in 
Table 5. Although the model generates 
time-varying excess returns for the parame- 
ter values that we consider (i.e., the implied 
variance of excess returns are not zero), they 
do not exhibit very much serial correlation. 
However, the estimates in column (2) still lie 
well within the standard 95 percent confi- 
dence intervals of the median of the Monte 
Carlo distributions. 

The bottom panel of the table reports the 
results for the regression coefficients al- 
though Fama and French do not report re- 
sults for excess returns. From column (2), it 
can be seen that the coefficients computed 
from historical excess returns are substan- 
tially different from those computed with 
real equity returns, and are strikingly differ- 
ent from any of the results reported by Fama 
and French. Rather than becoming more 
and more negative as the horizon lengthens, 
they become positive at six years, and grow 
larger and larger. Nevertheless, because of 
small sample bias in the calculation of the 
regression coefficients at long horizons, the 
data still lie within a 90 percent confidence 
interval of the median of our Monte Carlo 
distributions. 


M. Conclusion 


This paper demonstrates that the findings 
of Poterba and Summers (1988) and Fama 
and French (1988), that stock prices are mean 
reverting, are consistent with an equilibrium 
model of asset price determination. The 
question we addressed was whether the em- 
pirically observed serial correlation proper- 
ties of stock returns can be generated by an 


20The results with the model calibrated to dividends 
and GNP are similar and are not reported to save space. 
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equilibrium model of asset pricing. Monte 
Carlo distributions of Poterba and Summers’ 
variance ratio statistics and Fama and 
French's long-horizon return regression co- 
efficients are generated using equilibrium re- 
turns derived from the Lucas (1978) model 
and Hamilton's (1989) Markov switching 
process governing consumption, dividends, 
and GNP. When economic agents care about 
smoothing their consumption, the equilib- 
rium model implies that stock prices are 
mean reverting. It is possible that this is 
what was detected by Poterba and Summers 
and Fama and French. However, even with a 
linear utility function, the variance ratios 
and regression coefficients calculated with 
the historical Standard and Poors' returns 
data are not significantly different from the 
median of our Monte Carlo distribution. This 
latter result underscores the problem that 
116 annual observations do not contain very 
much information when computing statistics 
based on returns at five- or ten-year hori- 
zons. Both the bias and the size of confi- 
dence intervals generated by sampling varia- 
tion grow as the effective sample size gets 
smaller. The implication for testing the null 
against local alternatives is complementary 
to Summers' (1986) point that most tests of 
market efficiency have virtually no power 
against what he calls fad alternatives. Since 
we have shown that a properly constructed 
equilibrium model can generate rational as- 
sets prices that exhibit a good deal of nega- 
tive serial correlation, it follows that, given 
the available data, the test of any fad model 
will have very little power against the rather 
wide class of equilibrium alternatives. More 
precise estimates and more powerful tests 
can only come through the passage of time 
and not by sampling the data more fre- 
quently.” If there had been a well-function- 
ing asset market since the time of the 
Norman invasion (A.D. 1066) and we had all 
the necessary price and dividend data, then 
we might begin to distinguish among some 
of the competing theories. We conclude that 


"That is, in computing the autocorrelation of ten- 
year returns, what is needed is more ten-year time 
periods and not weekly or daily observations. All we 
can do is wait. 
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the evidence drawn from variance ratios and 
return regression coefficients are not suffi- 
cient to rule out equilibrium models. It is 
important to emphasize, however, that these 
results do not prove that the equilibrium 
model is true, since it is impossible to prove 
rationality or irrationality. 


DATA APPENDIX 


The dividend data are annual observations from the 
Standard and Poors’ index from 1871 to 1985 deflated 
by the CPI. This is the Standard and Poors’ historical 
data used by Poterba and Summers. Observations on 
returns and the CPI from 1871 to 1926 are from Jack 
Wilson and Charles Jones (1987) and Wilson and Jones 
(1988), respectively. From 1926 to 1985, the data are 
from Roger Ibbotson and Rex Sinquefeld (1988). Ob- 
servations on nominal dividends are those used by 
Campbell and Shiller (1987). We use these data as a 
benchmark because they represent the longest available 
time-series, and we believe that the characteristics of 
these data are representative of equity returns and divi- 
dend disbursements in general. Also, the Standard and 
Poors’ index is one of the data sets used by Poterba and 
Summers, so a direct comparison can be made with 
some of their results. We follow both Poterba and 
Summers and Fama and French in deflating by the CPI. 

The risk-free rate series is the ex post real return to 
holding a one-year U.S. Treasury security, or the equiv- 
alent, computed by subtracting realized inflation from 
the nominal interest rate. The nominal interest rate 
series was computed using data from four separate 
sources. The constructed series is intended to come as 
close as possible to a measure of the yield to maturity 
on a one-year U.S. security. For the period from 1920 to 
1929, the basic data are drawn from the column giving 
the compound annual return from rolling “3- to 6-month 
Treasury notes and certificates,” Table No. 122, page 
460, of the Banking and Monetary Statistics of the United 
States. For a given year, the one-year yield was com- 
puted by assuming that the security was rolled over in 
July. For the period from 1930 to 1950, the data are the 
one-year yield for December of the previous year re- 
ported in the appendix to Stephen Cecchetti (1988). 
From 1951 to 1985, the data are the one-year zero 
coupon yield reported in J. Huston McCulloch (1988). 
For the period from 1871 to 1919, there is no direct 
information on government yields. To obtain a consis- 
tent series, we began by regressing the one-year yields 
from 1920 to 1987 (as described above) on the commer- 
cial paper rate constructed by Campbell and Shiller 
(1987). Since the commercial paper rate series extends 
back to 1871, we were able to estimate the implied 
government yield as the fitted values from this regres- 
sion. 

The real GNP data are constructed by combining 
data from 1869 to 1928 from Christina Romer (1989) 
with data from 1909 to 1928 from Romer (1985), and 
observations from 1929 to 1985 from the National 
Income and Product Accounts. 

The consumption data are constructed by splicing 
the Kendrick consumption series reported in Nathan 
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Balke and Robert Gordon (1986), from 1889 to 1928, 
with the National Income and Product Accounts data 
from 1928 to 1985. This series is the longest available 
series on aggregate personal consumption expenditure 
we are aware of. 

In order to express quantities in per capita terms, we 
divided each time-series by annual population esti- 
mates. The estimates used are as follows. From 1869 to 
1938 the data are from the Historical Statistics of the 
United States, Series A7 from 1869 to 1928 (with the 
data in footnote 1 for 1917 to 1919), and Series A6 from 
1929 to 1938. From 1938 to 1985 the data are from the 
Economic Report of the President, 1989, Table B-31. 
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is the European Community an Optimal Currency Area? 
Optimal Taxation Versus the Cost of Multiple Currencies 


By MATTHEW B. CANZONERI AND CAROL ANN ROGERS* 


We propose a view of optimal currency areas that is based on the principles of 
public finance. Inflation taxes are distortionary, and an optimal spreading of tax 
distortions may require high inflation in one region and low inflation in another. 
Each region would need its own currency to do this. On the other hand, multiple 
currencies imply valuation and currency conversion costs, which impede trade 
between regions. This tradeoff is explored in the context of the European 
Community’s debate over a common currency, using a two-country variant of 
Lucas and Stokey’s cash-in-advance model. (JEL 430) 


The European Community seems to be 
moving rapidly toward economic integra- 
tion, Barriers to trade in goods and financial 
assets are to come down by 1992. However, 
the fate of the European Monetary System 
seems to be in some doubt, especially in light 
of the elimination of capital controls. With- 
out capital controls, the maintenance of fixed 
exchange rates may require more conver- 
gence in monetary policy and inflation rates 
than has yet been observed. The “Delors 

"Report" has gone so far as to propose a 
common currency for the EC.! But is the EC 
an optimal currency area? We would like to 
suggest a new way of looking at this ques- 
tion. 

Over the last fifteen years, arguments for 
fixed ‘exchange rates have focused on stabi- 
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Washington, D.C. 20057. We wish to thank Hans Gen- 
berg, Kent Matthews, Bennett McCallum, Patrick Min- 
ford, Anne Sibert, Alan Stockman, Lars Svensson, Car- 
los Vegh, Paul Wood, and an anonymous referee for 
their helpful comments. We would also like to thank 
seminar participants at the Center for Economic Re- 
search (Tilburg), the Institute for International Eco- 
nomic Studies (Stockholm), the National Bureau of 
Economic Research, the University of Chicago, Eras- 
mus University, Georgetown University, the University 
of Houston, Indiana University, and New York Univer- 
sity. However, we alone are responsible for what fol- 
lows. 

"See J acques Delors (1989). Jacques Delors is Presi- 
dent of the European Commission. The Committee for 
the Study of Economic and Monetary Union prepared 
the Delors report for the European Council. 
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lization problems created by sticky prices or 
on inflation problems caused by policymak- 
ers’ lack of credibility.? Sticky prices were 
thought to imply adjustment costs, and the 
case for fixed rates was analyzed along the 
lines of William Poole (1970) or Canzoneri 
and Gray (1985). Alternatively, it was ar- 
gued that “weak” policymakers could com- 
bat inflationary expectations by tying their 
currencies to a more stable currency, and the 
case for fixed rates was analyzed along the 
lines of Robert Barro and David Gordon 
(1983). In this paper, however, we eschew all 
problems associated with sticky prices or the 
policymakers’ credibility. We do not mean to 
suggest that these problems are unimpor- 
tant; we simply want to focus attention on a 
neglected aspect of the optimal currency area 
question. 

Our purpose is to present a view of opti- 
mal currency areas that is based on the 
principles of public finance.) Seignorage is 


?Hans Genberg (1988) provides an excellent discus- 
sion of this literature and relates it to earlier work; 
however, he focuses almost exclusively on small coun- 
tries. Matthew Canzoneri and Jo Anna Gray (1985) and 
Matthew Canzoneri and Dale Henderson (1988) extend 
the discussion to large countries, using a game theoretic 
approach. 

3We are following a research agenda suggested by 
Robert Lucas (1986). Indeed, a macroeconomic/public 
finance literature on the international aspects of fiscal 
policy is already developing; see Patrick Kehoe (1987), 
Jacob Frenzel and Assaf Razin (1988), Carlos Vegh and 
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one tax among many that can be imposed to 
raise revenue. As Table 1 shows, southern 
European countries raise between 5 and 10 
percent of their revenues through seignorage, 
while northern European countries hardly 
depend upon seignorage at all.* Why do tax 
structures differ across Europe? The public 
finance literature provides one answer. Tax 
rates should be set to spread the distortions 
that taxes create; the marginal disutility from 
the last revenues raised should be equalized 
across .all revenue sources. Optimal tax rates 
will depend upon characteristics of the activ- 
ities being taxed, including collection costs; 
goods and services that are easily taxed in 
one. region may be very difficult to tax in 
another. There is no reason to think that the 
optimal inflation tax for Germany will be 
the same as that for Italy. For that matter, 
there is no reason for the inflation tax in 
southern Italy to be the same as that in 
northern Italy. In any case, the implications 
of the principles of public finance are clear. 
Regions that require the same inflation tax 
may form an optimal currency area. 

Since the characteristics that determine 
optimal tax rates may vary widely across 


Pablo ‘Guidotti (1988) and Roberto Chang (1988). Gre- 
gory Mankiw (1987) and Vittorio Grilli (1989) report 
. Some empirical evidence that U.S. and European infla- 
tion rates have been used for intertemporal tax spread- 
ing; Grilli (1989) also finds some evidence for the 
intratemporal tax spreading discussed here, if allowance 
is made for constraints imposed by the EMS. 

Rudiger Dornbusch (1988a,b) has already argued 
that “countries for whom the efficient tax structure 
implies the use of an inflation tax...should not merge 
with others for whom zero inflation is the policy objec- 
tive.” He suggests that even the small loss of seignorage 
revenue that has resulted from the convergence of Euro- 
pean inflation rates can have explosive consequences for 
the level-of debt when the real rate of interest exceeds 
the rate of growth. 

Inflation rates did fall in southern Europe during 
the EMS period: However, the southern European 
countries also increased reserve requirements during 
this period. This broadened the seignorage tax base and 
maintained revénues to some extent; see Allan Drazen 
(1988) and Francesco Giavazzi (1988). 

It seems unlikely that the southern European coun- 
tries will be able to maintain their higher reserve re- 
quirements once banking services are allowed across 
borders. The 1992 agenda may put more fiscal pressures 
on the southern European countries than we have seen 
to date. Again, the comments of Dornbusch in fn. 3 are 
relevant. 
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TABLE 1 — SEIGNORAGE AS A PERCENT OF 
TAX REVENUE, 1979-1988 











Greece l 9.1 Belgium 0.4 
Italy 6.2 France 5s x3 
Portugal 11.9 Germany 0.8 


Spain 5.9 UK. 0.5 


Note: This table was drawn from Drazen (1988), Table 
1. Drazen’s table was in turn drawn from tables in 
Giavazzi (1988). 


countries, or even across regions within a 
country, one might expect that the public 
finance view will call for many different in- 
flation taxes and many small currency areas. 
As Robert Mundell (1961) has observed, this 
does not accord with a long held view that 
trade in many currencies is intrinsically 
costly, so that currency areas should be large. 
Mundell writes (p. 662): 


..It will be recalled that the older ^ 
economists of the nineteenth century 
were internationalists and generally fa- 
vored a world currency. ...Mill, like 
Bagehot and others, was concerned 
with the costs of valuation and mcney- 
changing,..., and it is readily seen that 
these costs tend to increase with the 
number of currencies. Any given money 
qua numeraire or unit of account ful- 
fills this function less adequately if the 
prices of foreign goods are expressed in 
terms of foreign currency and must 
then be translated into domestic cur- 
rency prices. Similarly, money in its 
role of medium of exchange .is .less 
useful if there are many currencies; 
although the costs of currency conver- 
sion are always present, they loom ex- 
ceptionally large under inconvertibility 
or flexible exchange rates. 


Similar notions have been expressed by those 
who favor a continuation of the EMS or the 
creation. of a single European currency, 
though the actual cost of multiple currencies. 
has never been measured.? 


5Some have tried to measure the effects of exchange 
rate volatility on trade flows; see Martin Bailey and 
George Tavlas’ (1988) recent summary of this literature. 
Few have found these effects significant. However, ex- 
change rate volatility would not seem to be a good 
proxy for the concerns Mundell and others had in mind. 
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Thus, we come to the factors we want to 
consider in drawing the boundaries of an 
optimal currency area. Multiple currencies 
allow separate inflation rates in each country 
or region, and this flexibility may well pro- 
duce tax spreading efficiencies. However, 
these benefits must be weighed against the 
valuation and currency conversion costs that 
come with a multiplicity of currencies. 

To illustrate this tradeoff, we construct a 
two-country model of what we call the “EC.” 
We give “Italy” tax spreading reasons to opt 
for high inflation; we model “Germany” as a 
low inflation country. However, we also 
model valuation and currency conversion 
costs, costs that can be eliminated if Italy 
and Germany adopt a common currency. 
The EC is an optimal currency area if the 
costs of multiple currencies outweigh the 
benefits of tax spreading. 

In Section I, we outline our model of the 
EC. The model is essentially a two-country 
version of the model used by Robert Lucas 
(1987) and Robert Lucas and Nancy Stokey 
(1987). Each country is populated -by in- 
finitely lived households that consume home 
cash goods, home credit goods, and foreign 
credit goods; the cash good is subject to a 
cash in advance constraint. Each country has 
a given level of public spending that must be 
financed through seignorage or a value added 
tax. We model only one asymmetry between 
the countries: Italy has a black market that 
cannot be taxed directly. Germany has no 
black market, and all German production is 
subject to the VAT. The black market is 
modeled as a cash good. So, Italy can use 
inflation to tax its black economy indirectly. 

The model comes in two versions. In the 
first, each country has its own currency, and 
doing business with two currencies wastes 
resources; it costs: more to produce a credit 
good that is sold to a foreigner. The extra 
costs reflect the time and effort it takes to 
quote prices in a foreign currency and to 
convert the proceeds back into domestic cur- 


This way of modeling black markets was suggested 
to us by Paul Wood in the context of Latin American 
countries. Robert Barro (1987) has also noted that the 
inflation tax may be the only way of taxing the black 
economy. 


CANZONERI AND ROGERS: MULTIPLE CURRENCIES COST 421 


TABLE 2— THE IMPORTANCE OF THE BLACK ECONOMY 


Percent of GNP Percent of: 
in Italy GNP in Germany 
Source: 
Kent Matthews 15 to 20 7 to 10 
de Grazia (1984) 20 to 25 2 
Contini (1982) 14 to 20 - 
Langfeldt (1983) - 5to10 


Note: Matthew's estimates, based on his reading of a 
number of studies, were given to us in private conversa- 
tion. : 


rency. In the second version of the model, 
Europe has a common currency, and selling 
to a foreigner entails no additional costs. 

In Section II, we will give a formal defini- 
tion of the optimal currency area problem. 
The EC policymaker's problem is to choose 
the number of currencies (and to set the 
available tax rates) so as to maximize a 
weighted sum of the utilities of Italian and 
German households. 

In Section III, we find the best outcome 
that can be achieved if two currencies are 
retained. With two currencies, the EC poli- 
cymaker has enough taxes to achieve an 
efficient outcome. The optimal tax rates have 
to make the relative price of cash and credit 
goods in each country equal to the rate of 
transformation in production. In Italy, the 
inflation tax on cash goods (the black econ- 
omy) must balance the VAT on credit goods. 
Germany's VAT is not distortionary, since it 
falls on both cash goods. and credit goods. 
However, the relative price on cash and credit 
goods will be distorted in Germany if the 
nominal interest rate is greater than zero, 
since money must be held instead of bonds 
to purchase a cash good. The optimal infla- 
tion rate in Germany is equal to minus the 
rate of time preference, as Friedman has 
suggested. From a public finance point of 
view, the EC is not an optimal currency area. 
Any EMS-like constraint that makes Italian 
and German inflation rates converge must 
decrease welfare in one country or the other. 

The relevance of our modeling of the tax 
spreading problem depends on the impor- 
tance of the black market in Italy. Table 2 
presents estimates of the relative size of the 
black economy in various countries; the esti- 
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mates vary widely, but Italy’s black economy 
appears to be two or three times the size of 
black economies in other European coun- 
tries. One might conclude that Italy does 
indeed have more reason to resort to the 
inflation tax. Of course, we do not mean to 
suggest that the black economy is Italy’s 
only reason for adopting a high inflation 
rate. Similar results would obtain if Italy 
simply faced higher tax collection costs on 
cash goods than Germany. And more gener- 
ally, Italy could differ from Germany in any 
number of ways that would result in a higher 
optimal inflation rate. 

In Section IV, we ask if the EC edid be 
better off with a common currency. The costs 
associated with a multiplicity of currencies 
would be eliminated, but Italy and Germany 
could no longer run the independent infla- 
tion rates they need for efficient tax spread- 
ing. The answer one might expect is that it 
depends on the size of the valuation and 
currency conversion costs: if the costs are 
high, then the EC is an optimal currency 
area; if the costs are small, then separate 
currencies should be maintained for tax 
spreading. We show that this answer is cor- 
rect as long as home goods are not close 
substitutes for imported goods. High valua- 
tion and currency conversion costs drive out 
international trade. If home goods are close 
substitutes for imports, then the switch to 
home goods involves no great loss in utility; 
in this case, it may pay to keep the tax 
spreading flexibility afforded by multiple 
currencies, even if the extra costs do drive 
out trade. 

While these arguments are relatively 
straightforward, our model of the EC is 
rather complex. This complexity makes more 
detailed results hard to obtain. Therefore, in 
Section V, we present some numerical exam- 
ples that suggest the following propositions: 
(1) Very small valuation and currency con- 
version costs can make the EC an optimal 
. currency area. (2) High levels of government 
spending make the EC less likely to be an 
optimal currency area. (3) Openness makes 
the EC more likely to be an optimal cur- 
rency area. While the general validity of 
these propositions has yet to be demon- 
strated, our examples are suggestive and the 
propositions are plausible. 


JUNE 1990 


Finally, in Section VI, we discuss the limi- 
tations of our analysis. We also make sug- 
gestions for future research. 


I. A Simple Model of the EC 


The model comes in two versions. We 
begin with the multiple currency version, 
and then we explain how it has to be modi- 
fied to get the single currency version. 

There are only two countries in our styl- 
ized model of the EC: Italy and Germany. 
Italy is inhabited by identical, infinitely lived 
households whose utility depends upon con- 
sumption of an Italian cash good, x, an 
Italian credit good, y, and a German credit 
good, z: 


t 


u(x,, Jo z,). 








D v-E 


= 1+8 


The function u(x, y, z) is increasing, strict- 
ly concave and twice continuously differ- 
entiable; in addition, ĝu(0, y, z)/dx = 
ðu(x,0, z)/3y = Qu(x, y,0)/0z = 00. Ger- 
many is similarly populated by identical, 
infinitely lived households whose utility’ de- 
pends on consumption of a German cash 
good, x*, a German credit good, y*, and an 
Italian credit good, z*: : 


oo 


a9 v-Y 


u(xf, yt zě). 





1 t 
3L1 6 | 
*s denote German variables, but the utility 
functions in Italy and Germany are the same. 

Households in each country are endowed 
with n units of labor each period. This labor 
can be converted one for one into home cash 
goods or home credit goods. Since Italy and 
Germany have separate currencies, produc- : 
ers incur extra resource costs when they sell 
to foreigners: c >1 units of labor are needed 
to produce one credit good sold to a for- 
eigner. Thus, the resource constraints for 
Italy and Germany are 


(2) x,+y%ta*kt+ gen 


and x*¥ + y*+ez,+¢ =n. 
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Government spending, g, is exogenously de- - 


termined and the same in Italy and Ger- 
many. 

The cash in advance framework and the 
distinction between cash goods and credit 
goods are borrowed from Lucas.’ Each pe- 
riod begins with a financial exchange and 
ends with a goods market. In the financial 
exchange, households pay for the credit 
goods they purchased last period and ac- 
quire the cash they will need in the goods 
market that follows; governments collect 
taxes, pay interest on debt, and issue new 
debt (in the form of money or bonds). In the 
goods market, households divide into work- 
er-shopper pairs. Workers produce cash 
goods and credit goods; shoppers visit the 
workers of other households and buy goods. 
Shoppers must pay cash for the cash goods; 
credit goods are invoiced in the shopper’s 
currency for payment in the next financial 
exchange. 

Now we can be more explicit about the 
extra costs workers incur when there are two 
currencies. Credit goods are, by assumption, 
invoiced in the buyer’s currency, and in the 
next period the payment for them comes in 
the buyer's currency.5 A worker incurs two 
additional costs in selling credit goods to a 
foreigner. First, the worker has to value the 
good in another unit of account. Second, the 
foreign currency received must be converted 
into domestic currency, so that the house- 
hold can purchase domestic cash goods. Both 
of these activities take time and effort, so we 
have modeled them as direct resource costs. 
They are our model’s incarnation of the val- 
uation and currency conversion costs de- 
scribed by Mundell and others. 

Certain arbitrage conditions are immedi- 
ate, and it will facilitate our discussion to 


"See Lucas (1987) or Lucas and Stokey (1987) and 
the references therein for a more detailed description of 
the framework outlined in this paragraph. We have 
extended their structure to a two-country setting. 

Here we are applying Elhanan Helpman and Assaf 
Razin's (1984) "buyer's system" to the invoicing of 
credit goods. Some restriction on the use of currencies is 
required if exchange rates are to be determinate; this is 
a reflection of the fact that there is no deep theory of 
currency substitution embodied in the cash in advance 
paradigm. 
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take advantage of them. A worker will charge 
the same price for cash goods and home 
credit goods, since each takes one unit of 
labor to produce and the sale of either re- 
sults in cash that cannot be used until the 
next financial period. Let p, be the lira price 
of the Italian goods, and let p* be the mark 
price of the German goods. It takes c units 
of labor to produce a credit good for a 
foreigner, and the worker wants sales to all 
customers to result in the same cash value in 
the next financial exchange; so, letting e, be 
the lira price of marks, 


(3) Př = e, Pic and p;* = ( Pi/ei1) c 


are the prices charged to Italian and German 
shoppers for foreign credit goods. 

An Italian household's budget constraint 
for the financial exchange is 


(4) m, by, e,bg, + AAI Yt 9:21) 
S R, bg + e R$ Gb, 
+ P. n— 9,17 d, 12,-1)- 


On the left-hand side, we have the house- 
hold's acquisition of money and bonds and 
its tax payments. “J” subscripts denote Ital- 
ian government bonds, which pay the gross 
nominal return R, and “G” subscripts de- 
note German government bonds, which pay 
R*. The Italian government levies a value 
added tax, 8,, on the household’s purchases 
of domestic and foreign credit goods; q, 
(= p// p,) is the relative price of the foreign 
credit good. The black economy is embodied 
in the Italian cash good; market transactions 
go unrecorded and untaxed. On the right- 
hand side, we have the gross return on bonds 
purchased last period, plus receipts from last 
period’s sales, minus payments for credit 
goods invoiced last period. 

German households face a similar budget 
constraint: 


(4*) mf + bit /e, bg, + q prn 
S R,_bf_1/e,+ RIEGO. 


T peat n= VE v q*uzti) . 
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q* (= pf */pf) is the relative price.of for- 
eign credit goods; note that q, = c?/g*. The 
only difference between (4) and (4*) is the 
tax structure. There is no black economy in 
Germany. The German government levies a 
tax, 7,*, on all German value added. 

In deriving these household constraints, 
we have assumed that the cash in advance 
constraints were binding in the previous pe- 
riod; that is, 


. (5) m,—p,x, and mf-pfxf. 


_If R, and R* are greater than one, then 
money is a dominated asset, and the con- 
' straints will indeed be binding. In one of the 
equilibria we derive, R* is equal to one, and 
German investors are indifferent between 
holding money and bonds; in this case, we 
assume (without loss of generality) that (5) 
: still holds and savings go into bonds. 
Households maximize utility subject to 
budget constraints. Their first-order condi- 
tions are 








Qu /8y, 
—— — =R! +8 
(6) du/dx, ` $ 
du/dz N 
S d cae) 
t 
ðu/ðx Ta 
/ t 8) =R. P | 
0u/8x,,, Pri 
du/dy,* du *—1 
Qu/àx* ©? 
du/dz* AS 
Qu/üÓx* "H? 
Qu/Oxf* xe 
Mtr ide 5) = R* P: . 
du/OX A, piel 


Marginal rates of subscription are set equal 
to relative prices. 
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Governments also face budget constraints 
in the financial exchange:. 


(7) Ry. (bya + 64-1) + pa(g- T) 
< bk +b, +m- m, 
* 6 py * qz) 
RE (berit 56) + piti g + ata) 
€ bg, + bg, + mF 
—mf,T*p?n. 


On the left-hand side of each of these con- 
straints, the first term is principal plus inter- 
est on the debt, while the second is a settling 
of last period's credit transactions. T is a 
transfer from the German government to the 
Italian government; we will consider com- 
bined EC policies that make such transfers 
possible. Governments are credit consumers; 
g and g* are bought on credit. We also 
assume that inzergovernmental transfers are 
credits that are settled in the next financial 
exchange. On the right-hand side of the 
constraints, the first terms are new debt is- 
sue, and the last is tax proceeds. 

We let A, and Af be the growth rates of 
the two monies, 


(8) h, (m,— m, )/m, 


htz(mf-mt)/mf. 
Seignorage collections can be expressed as 
(9) m,— m, ,—7h,m, 
and 


£_ yy *® —h*g* 
m, m, hfmf; 


?More precisely, T is in units of the Italian cash 
good; so, the mark value of the credit for last period's 
transfer in the current financial exchange is P,_,T/e,. 
Governments incur the same valuation and currency 
conversion costs as the private sector; so, the total cost 
of the transfer is cp, ,T/e, in marks or c(p, ,7/ 
epe AX p / Pë) = gta (pa pA) in units of the 
German cash good. 
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so, h, and h* may be viewed as seignorage 
tax rates. — 

In this paper, we only consider stationary 
equilibria.'? A number of results for station- 


ary equilibria follow immediately from the . 


arbitrage conditions, (3), the cash in advance 
constraints, (5), and the first-order condi- 
tions, (6): 


(10) h,=%,> (5.7 Pra) /Prs 
hi = utum (př ptu) 7p. 


l- ê= (bes) ",*). 


^ 


e, [e,— ex es 
Ry73(1+ 8) = m,/m,,—1— hu 
RY (1+8) =mž/mž =l- hž. 


These results are standard. Inflation rates are 
equal to money growth rates. The rate of 
depreciation is approximately equal to the 
difference in inflation rates, and nominal in- 
terest rates increase with money growth rates. 

In a stationary equilibrium, the first-order 
conditions become 











du/dy l-h 

11 —-——— 

d) Qu/àx iva ^^ 
du /dz l-h á 
du/ax | TES EAE 
du/Oy* 1—h* 
du/ax* 146 

i Qu/Oz* " 1—h* 

75 Qu/Ox* 4 146 


Given our assumptions, there is no reason to con- 
sider nonstationary solutions. Government spending is 
constant over time; so there is no need for intertempo- 
ral tax spreading. There would be issues of time consis- 
tency with the monetary policies we consider, but we 
have assumed that policymakers are precommitted to 
cooperative policies. 
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The household budget constraints become 


(12 x+0(y+qz) =(1-A) 


X(n— y — qz) 
x*-- 7*n = (1— h*) 


X(n-— y*— q*z*), 


and the government budget constraints be- 
come! 


03) (1-5)(g- T) -0(y * qz) thx 
and 
(1—h*)(g+q*T) =7*n + h*x*. 


In addition, there is no borrowing or lending 
in equilibrium, so trade balances must equal 
the intergovernmental transfer. For example, 
the Italian trade deficit, in units of the Ital- 
ian cash good, is equal to’? 


(14) qz — cz* =T. 


The EC policymaker's constraint is found 
by aggregating the Italian and German gov- 


H]f governments were cash customers, instead of 
credit customers, the budget constraints would be 


g=O(y+qz)th(x+g) 
and 
gt=r*n* + h*(x*-- g*), 


which are equivalent to (13). 

Some governments try to increase the base for their 
seignorage tax by making the private sector use domes- 
tic currency for government transactions. In the present 
framework, such efforts are pointless. If g is a cash 
good, the seignorage revenue goes up by hg, but if g is 
bought on credit, payments are inflated away by a 
factor wg = hg. 

12To verify (14), use the Italian government’s budget 
constraint to eliminate @(y + qz) in the Italian house- 
hold's budget constraint, and subtract the result from 
the Italian resource constraint. Equivalent operations 
with the German constraints give the German trade 
surplus, cz — q*z* = q*T, which: is equal to the Italian 
deficit since q = c?/q*. 
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ernment budget constraints: 


(15) g+g/q*=[8( y+qz) +hx]/(1-h) 
+ (7*n + h*x*) /g*(1— &*). 


T, the intergovernmental transfer, is now 
implicit. Finally, it should be noted that not 
all of these budget constraints are indepen- 
dent; using the resource constraints and the 
trade deficit equation, the household budget 
constraints, (12), imply the EC policymaker’s 
constraint, (15). 

Summarizing, a stationary equilibrium in 
the two currency version of the model is de- 
fined by the two resource constraints, (2), the 
four first-order conditions, (11), and the two 
household budget constraints, (12). If an 
equilibrium exists, the variables x, y, z, 
x*, y*, z*, q, and the tax rates h,0, h*, 7* 
must satisfy these eight equations. 

Two modifications. must be made in the 
model if Europe adopts a common currency, 
the ECU. First, since there are no longer any 
valuation and currency conversion costs, c — 
1. Second, the cash in advance constraints, 
(5), reduce to a single equation, l 


(16) m,= p,x,+ p*xf = p,(x,+ qx) 
= pf (x,/q, E x). 


where m, is the supply of ECU's. 
In a stationary equilibrium, 
(17) v—-m7*—H, 
where H is the growth rate of ECU's. The 


household first-order conditions, (11), be- 
come 














du/dy 1-H 
= + 
ue ðu/ðx 1+8  ' 
Qu/àz = | 
=q — + a 
du/dy 146 
du/dy* 1-H 
du/dx* 146 l 
: du /dz* | 
= 9u/dy* 1 | 1+8 | 
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_ The household budget constraints, (12), be- 


come 
(19) xd ee qz) = (1-H) 
x(n-y-4z) 
x* T*n — (1-H) 


X(n = y*- z*/4), 


and the EC policymaker's constraint, (15), is 
replaced by 


Q0) (1- H)(g- ag) 
= (y qz) 


+ qr*n+ H(x + qx*). 


Once again, the policymaker’s budget con- 
straint is redundant; using the resource con- 
straints, it can be derived from the house- 
hold budget constraints. 

Summarizing, a stationary equilibrium in 
the single currency version of the model is 
defined by the two resource constraints, (2), 
the four first-order conditions, (18), and the 
two budget constraints, (19). If an equilib- 
rium exists, the variables x, y, z, 
x*, y*, z*, q, and the tax rates H, 0, and 7* 
must satisfy these eight equations. 


II. The Optimal Currency Area Problem 


If the EC keeps its two currencies, then 
there are four tax rates (h, h*, 0, and 7*) to 
be set. The seignorage tax distorts the deci- 
sion between cash and credit goods; so does 
the Italian VAT, since it falls only on credit 
goods. The German VAT is not distor- 
tionary because it falls symmetrically on cash 
and credit goods. Having four separate taxes 
helps in spreading tax distortions, but the 
multiplicity of currencies imposes extra costs 
on producers who sell abroad. If the EC 
adopts a common currency, these extra costs 
disappear, but there will only be three tax 
rates (H, 0, and r*) available for tax spread- 
ing. The EC will be an optimal currency area 
if valuation and currency conversion costs 
are more important than tax spreading. 
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To be more precise, we assume that the 
EC policymaker chooses the number of cur- 
rencies (as well as the tax rates) to maximize 
a weighted sum of the utilities of Italian and 
German households. Since Italian and Ger- 
man households have the same utility func- 
tions, labor endowments and production 
possibilities, weights are chosen to ensure 
that U=U*. (A more appealing approach 
might have been to use the Nash Bargaining 
solution. However, since it was beyond the 
scope of the present paper to model a non- 
cooperative game between Italy and Ger- 
many, there was no obvious way of finding a 
status quo point.) ; 

First, we calculate the best that can be 
done if multiple currencies are retained; that 
is, we solve the problem . 


(21) maxW"*z wU--(1— w)U* 
(xz, x", y*; z*, q h, h*, 0, 7%, w) 


‘subject to the conditions that define 
equilibrium in the two currency ver- 
sion of the model and the requirement 
that U = U*. 


Then, we calculate the best that can be done 
if a common currency is adopted; that is, we 
solve the problem 


(22) max W“ = wU --(1— w)U* 
(x»z x*, y z*, 4 H, 8, 7%, w) 


subject to the conditions that define 
equilibrium in the one currency ver- 
sion of the model and the requirement 
that U = U*. 


Finally, we compare the results of these two 
problems. W/** is the utility each country 
obtains under a common currency, and W"* 
is the utility each country obtains with mul- 
tiple currencies. If W** is greater than W^, 
then the EC is an optimal currency area. 


Ill. Tax Spreading with Two Currencies 


Proposition 1 states that with two curren- 
cies there are enough tax instruments avail- 
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able to eliminate all of the tax distortions 
and obtain an efficient solution. 


PROPOSITION 1: The solution to problem 
(21) is also the solution to the social planner 's 
problem, ` 


Max W = (1/2)U+(1/2) U* 
(x, y, 2, x*, y*,2*} 
subject to the resource constraints, (2). 


The social planner, unlike the EC policy- 
maker, can allocate goods and services with- 
out going through markets or satisfying cash 
in advance constraints. U is equal to U* in 
the planner's solution because of the symme- 
try in our model. In addition, the planner's 
allocation is the best that can be achieved 
given the resource constraints, (2); that is, 
Ww < W. To prove Proposition 1, it suffices 
to show that the EC policymaker can find 
tax rates that make a market economy sup- 
port the planner's allocation. 

The first-order conditions for the planner's 








: problem are 
Qu/à du/oy* 
(23) fay 9u/0y" — 
Qdu/OÓx | Ou/Ox* 
and 


Qu/Oz* du/dz 
du/dx 3 du/dx* 


In the planner’s solution, marginal rates of 
substitution in consumption are set equal to 
marginal rates of transformation in produc- 
tion. One unit of labor produces one cash 
good or one home credit good, but c units of 
labor are required for a credit good con- 
sumed abroad. So, the marginal rate of sub- 
stitution between cash goods and home credit 
goods is equal to one, and the marginal rate 
of substitution (in the planner’s welfare 
function) between cash goods and credit 
goods .consumed abroad is equal to c. Tax 
rates must be chosen so.that after tax rela- 
tive prices reflect these tradeoffs. 

First, consider the margin between cash 
goods and home credit goods. The marginal 
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rate of substitution between these goods must 
equal one. The tax rates that achieve this 
relative price can be deduced from the 
household first-order conditions, (6) or (11). 
R*, the gross nominal return on German 
bonds, must be equal to one to avoid distor- 
tions between the German cash and credit 
goods. A consumer has to hold cash instead 
of bonds to buy a cash good, but this causes 
no distortion if money and bonds bring the 
same return. To make R* equal one, the 
German seignorage tax must be set in accor- 
dance with Friedman’s rule; that is, 


(24) ht=—6. 


In Italy, the story is a little. more compli- 
cated. The VAT can also cause a distortion 
between cash and home credit goods because 
the black market escapes taxation; @ and h 
must be set so that 


l-h 
-1 unc = 
(25) R'"-8 mat? 1. 
The seignorage tax on cash goods balances 
the VAT on credit goods, and the relative 
price is unaffected.. 

Next, consider the margin between cash 
goods and credit goods sold abroad. The 
marginal rate of substitution (in the planner’s 
welfare function) between cash goods and 
credit goods consumed abroad must equal c. 
However, by the symmetry of the planner’s 
solution, 


du/dz* du/dz 
du/dx a Qu/àx 








(26) 


and 
.Qu/Oz du/dz* 
Qu/Ox* n Qu/Ox*' 


The tax rates that make the marginal rate of 
substitution between cash goods and im- 
ported credit goods equal to c can again be 
inferred from the household first-order con- 
ditions. If tax rates satisfy restrictions (24) 
and (25), and if g=q*=c, then this margin 
‘will also be met. 
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The planner’s solution is symmetric. So, z 
is equal to z*, and (14) implies that q (and 
thus g*) is equal to c if the intergovernmen- 
tal transfer, T, is equal to zero. 

Summarizing, if tax rates satisfy restric- 
tions (24) and (25) and if the intergovern- 
mental transfer is zero, then the first-order 
conditions for the planner's problem are sat- 
isfied. Finallv, tax revenues must finance 
government spending in each country. So, 
the tax rates must also satisfy the govern- 
ment budget constraints, (13). This will be 
possible if (as we assume) g and 6 are not 
too big. This completes the proof of Proposi- 
tion 1. ; 

Corollary 1 emphasizes the fact that opti- 
mal tax spreading requires higher infiation in 
Italy than in Germany, and therefore a con- 
tinuous depreciation of the lira. Any attempt 
to limit this depreciation would be counter- 
productive; tax distortions would be intro- 
duced, and welfare would be lower. 


COROLLARY1: In the optimal multiple 
currency solution, m* = — 8, m > «*, and ê= 
7—7*—(1-35)820. 


The corollary follows from equation (10) and 
restrictions (24) and (25). 


IV. When Is the EC an Optimal . 
Currency Area? 


If the EC adopts a common currency, then 
the valuation and currency conversion costs 
are eliminated; c reverts to unity. However, 
since there is also one less tax rate for the 


` EC policymaker to set, there are too few 


instruments to eliminate tax distortions en- 
tirely. One might expect that the answer 
hinges on the size of c: if c is small, then the 
EC should retain its separate currencies to 
facilitate tax spreading; if c is large, then the 
EC is an optimal currency area. ` 

More specifically, one might expect some- 
thing like Figure 1. W"* (welfare in the 
multicurrency solution) should be a decreas- 
ing function of c. When c is equal to 1, 
multiple currencies impose no extra costs. 
So, tax spreading advantages should make 
multiple currencies dominate a common cur- 
rency; that is, W/"*(1) should be greater than 
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FIGURE 1 


W** (welfare in the common currency solu- 
tion). However, as c increases, the attractive- 
ness of multiple currencies should diminish. 
For c greater than some critical value, c, the 
EC should be an optimal currency area. 

Lemma 1 states that welfare under multi- 
ple currencies is indeed a decreasing func- 
tion of c, and that the EC cannot be an 
optimal currency area for values of c close 
to one. 


LEMMA 1: W"* is a function of c, 
dW"*(c)/dc <0, and W"*(1) > W“. 


The planner's problem described in Proposi- 
tion 1 has a unique interior solution and 
W"* is a differentiable function of c be- 
cause of the restrictions placed on u(x, y, z). 
W": is a decreasing function. of c because 
dW "*(c)/dc = — Xz* ~ A*z, where À and A* 
are the positive Lagrange multipliers in the 


planner's problem.? W"*(1) > W** since the 
planner's solution cannot be improved upon. 
W"*(1) > W** follows from the fact that the 
planner’s solution cannot be duplicated in 
the common currency version of the model. 
What remains to be shown in Figure 1 is 
that W"*(c) « W** for large values of c. It 
turns out that this need not be the case, even 
as c goes to infinity. This seemingly paradox- 
ical result can be explained as follows. As c 
rises, international trade is driven out by the 
high cost of selling to foreigners. However, if 
home goods and imports are very close sub- 
stitutes, then consumers lose little utility in 
switching to home goods, and the policy- 


This fact follows from the symmetry of the plan- 
ner's problem and the envelope theorem; see, for exam- 
ple. Appendix A.11 in Hal Varian (1978). 
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FIGURE 2 


maker can continue to take advantage of the 
tax spreading opportunities afforded by a 
multiplicity of currencies. If on the other 
hand home goods and imports are not close 
substitutes, then consumers do lose utility in 
the switch from imports to home goods, and 
W""*(c) < W for large values of c. 

Tt is convenient at this point to restrict our 
attention to the class of CES (constant elas- 
ticity of substitution) utility functions; let 


(27) u(x, Y, z) = [px + p, yY 


PEPEN 
+ paz!71/7]" /Y : 


where p+ fy 471 and 0 « y «oo; y is 
the elasticity of substitution. 

Lemma 2 describes W""*(c) for this class 
of utility functions. 


LEMMA2: If utility is CES, then 


(n-g) 
x [pT + wh + uc 77] 
fory #1 
(n— g) upige" 
for y=1 


-)1 


W'™(c)= 


JUNE 1990 
and as c > oo, 
0 
forü «y x1 
mc = as 
FONE NG us g) [urea] 
forl1« y «oo. 


Lemma 2 is proved by solving the planner's 
problem with CES utility functions. Figure 2 
illustrates W'"*(c) for various values of y. If 
y is less than or equal to one, W'"* goes to 
Zero as c gets very large. 

Proposition 2 states our basic result. The 
EC will be an optimal currency area if valua- 
tion and currency conversion costs are big 
enough and if the elasticity of substitution is 
not too big. 


PROPOSITION 2: If utility is CES, then 


i. if the elasticity of substitution is less than 
or equal to one, there exists a critical 
value, c, of the valuation and currency 
conversion costs such that W"*(c) > W°: 
if c « and W""(c) «W* if c» c. 

ui. if the elasticity of substitution is high, then 
a critical value, c, need not exist; W"*(c) 
may be greater than W** for all values 


of c. 
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TABLE 3— THE IMPORTANCE OF y 


y W" (o) we 


1.0 a: 0.000 0.271 
5.5 0.291 0.291 
10.0 0313  . 0.307 


To prove the first part of Proposition 2, it 
suffices to show that W**- 0, because 
Lemma 2 implies that W' goes to zero as c 
goes to infinity and Lemma 1 states that 
W*(1) > W**, The restrictions on u(x, y, z) 
guarantee that the consumer's problem has 
an interior solution. Therefore, if (as we as- 
sume) problem (22) has a solution, it is an 
interior solution. The CES utility function 
gives strictly positive utility if all of its argu- 
ments are positive. 

To prove the second part of Proposition 2, 
it suffices to find an example where W'"'*(oo) 
>W“. In Table 3, we report a set of numer- 
ical examples in which ô = 0.05, n=1, g= 
0.2, u,/p5:71.5 and gu; — p4. When y=1, 
W'"* goes to zero as c goes to infinity; W"* 
is less than W° for large values of c. When 
y ^10, W"* goes to 0.313 as c goes to 
infinity, while W is equal to 0.307; W"* is 
greater than W** for all values of c. 


V. Some Illustrative Examples 


The “simple” model outlined in Section I 
is really rather complex, and this complexity 
makes it difficult to say more about the 
factors that determine the size.of the critical 
value, c. In this section, we suggest some 
propositions about the size of c based on 
two sets of numerical examples. 

The examples all use a Cobb-Douglas util- 
ity function 


(28) u(x, y, z) = xhyPzms, 


where p, + p, + u4 =1. This is of course the 
CES utility function with y =1. And in all of 
the examples, 8 = 0.05 and n =1. 

The first set of examples illustrates how 
the critical value of c might depend on the 
size of government spending. p; = 44 = u3 = 
1/3 and g is allowed to vary between 0 and 
0.4. The results are reported in Table 4; they 
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TABLE 4— ¢ AND THE SIZE OF g 
g 0.0 0.1 0.2 0.3 0.4 
é 1.00002 1.00171 1.00667 1.01623 1.03265 


TABLE 5—C AND OPENNESS 
i3 / 3 0.5 0.75 1.0 1.25 1.5 
e 1.0134 1.0089 1.0067 1.0053 1.0045 


suggest that the critical value of c is an 
increasing function of g. This is a plausible 
proposition. As g increases, more revenue 
must be raised. Tax spreading becomes more 
important, and the value of c for which the 
EC becomes an optimal currency area is 
higher. i 

The surprising result in Table 4 may be 
the low numbers obtained for the critical 
value of c. Even if the government consumes 
as much as twenty percent of output, valua- 
tion and currency conversion costs of only 
seven-tenths of one percent of the cost of 
producing the traded good make the EC an 
optimal currency area. The critical value does 
rise rapidly as g is increased. 

The second set of examples illustrates the 
effect of openness on the critical value of c. 
We measure openness by the relative impor- 
tance of imported goods in the utility func- 
tion; that is, we let p; =p, and we take the 
ratio u4/p, as our measure of openness. 
g=0.2 and p,/p, is allowed to vary be- 
tween 0.5 and 1.5. The results are reported in 
Table 5; they suggest that the critical value 
of c goes down as openness is increased. 
This too is a plausible proposition.!^ As im- 
ports become more valuable, lowering the 
cost of producing them becomes more im- 
portant. 

Summarizing, the examples presented in 
this section suggest the following proposi- 
tions: (1) Small valuation and currency con- 
version costs can make the EC an optimal 


14 Mundell (1961) also found that openness was a 
factor leading countries to form a currency union. 
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currency area. (2) High levels of government 
spending make the EC less likely to be an 
optimal currency area. (3) Openness makes 
the EC more likely to be an optimal cur- 
rency area. 


V. Limitations and Suggestions 
for Future Research 


Throughout this paper, we have assumed 
that the Italian black economy is a cash 
good, subject to a lira denominated cash in 
advance constraint. In Section III, this al- 
lowed the EC policymaker to tax the black 
market separately, using the Italian inflation 
tax. Alan Stockman has pointed out that 
there is an obvious incentive here for black 
market participants to switch to marks, which 
are subject to a lower inflation tax; this 
possibility is simply ruled out by the cash in 
advance constraints in our model. It would 
therefore be interesting to replace our cash 
in advance constraints with a restriction that 
allowed currency substitution. Would Stock- 
man’s concern constitute an argument for 
currency controls? We do not know. 

Throughout this paper, we have also as- 
sumed that all fiscal policy decisions are 
made by a central EC policymaker. Present 
proposals for European economic integra- 
tion do not go nearly so far. It would be 
interesting to introduce Italian and German 
policymakers who play a Nash game with 
each other in their attempts to maximize 
their own citizens’ welfare. In a multicur- 
rency setting, the “1992” proposals would be 
represented by set of rules of the game. For 
example, protective tariffs would be out- 
lawed. In a common currency setting, we 
might envision a central EC policymaker 
who issues the currency and distributes the 
seignorage.? The Italian and German poli- 
cymakers would be free to set the remaining 
tax rates. - 


PAlessandra Casella and Jonathan Feinstein (1988) 
demonstrate that difficulties arise if both the Italian and 
German policymakers are allowed to issue the common 
currency. 
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Finally, we have also assumed that policy- 
makers can precommit to monetary policies 
that may be time inconsistent. (The policies 
described in Section III do happen to be 
time consistent, since there are no distor- 
tions in the equilibrium that results; this 
would not be the case if, say, a labor-leisure 
decision were added.) We noted in the intro- 
duction that the EMS is sometimes justified 
on the grounds of credibility; Italy is thought 
to be able to solve its precommitment prob- 
lem by tying the lira to the mark.!6 It would 
therefore be interesting to drop our assump- 
tion about precommitment and investigate 
time-consistent solutions." We speculate 
that Italy would have the higher inflation 
rate, and an inflation bias. Italy may wish to 
tie the lira to the mark to reduce the infla- 
tion bias, even if this interferes with optimal 
tax spreading. In other words, credibility 
may have to be weighed along with the valu- 
ation and currency conversion costs when 
defining an optimal currency area. 


Canzoneri and Henderson (1988) question whether 
a credibility problem can be so easily solved: why is an 
announcement about an exchange rate policy more 
binding than an announcement about a money supply 


policy 
To do so, we would have to introduce a cost to 

actual (as opposed to anticipated) inflation; see Her- 

schel Grossman (1288) or Maurice Obstfeld (1988). 
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Foreign Exchange Rate Expectations: 
Micro Survey Data 


By TAKATOSHI ITO* 


This paper analyzes the panel data of biweekly surveys on the yen/dollar 
exchange rate expectations of forty-four institutions for two years, and contains 
four major findings. First, market participants are heterogeneous; that is, signifi- 
cant “individual effects" exist in their expectation formation. Second, the individ- 
ual effects have a characteristic of “wishful expectations": exporters expect yen 
depreciation, and importers expect yen appreciation (relative to others). Third, 
many violate the rational expectations hypothesis. Fourth, forecasts with long 
horizons showed less yen appreciation than those with short horizons. Cross-equa- 
tion constraints implied by the consistency of the forecast term structure are 


strongly rejected in the data. 


As rational expectations have become a 
popular benchmark for thinking about fi- 
nancial and macroeconomic hypotheses, 
many economists have become more inter- 
ested in directly measuring the expectations 
of market participants. Although survey data 
for many domestic variables, including inter- 
est rates and inflation rates, have been fre- 
quently analyzed by many investigators (see, 
for example, Frederic S. Mishkin, 1983, ch. 
4), it is only recently that survey data on 
foreign exchange rates have become avail- 
able and been analyzed. Kathryn M. 
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Dominguez (1986) and Jeffrey A. Frankel 
and Kenneth A. Froot (1987a,b) have ex- 
ploited the survey data made available by 
the Money Market Service (MMS), the Amex 
Bank Review and the Economist Financial 
Report. 

The surveys that were investigated by 
Dominguez and by Frankel and Froot have 
had only their median responses reported. 
Heterogeneity among market participants, if 
it exists, has been aggregated out. If the 
market consists of homogeneous agents that 
share the same forecasting model with com- 
mon beliefs (priors) and information, then 
the median response would sufficiently de- 
scribe the market in terms of forecasts. How- 
ever, if market participants differ in the fore- 


1Dominguez (1986) used the Money Market Service 
(MMS) data from 1983 to 1985 to test a rational expec- 
tations hypothesis. Unbiasedness and the independence 
of forecast errors from the forward premium were tested. 
She found that survey forecasts were no better than the 
spot rate in predictive power and that rationality was in 
general rejected. In addition to the MMS data, Frankel 
and Froot (1987a,b) exploited the survey data collected 
by Amex Financial Service and also the Economist, 
which have longer sample periods and different forecast 
horizons. They found that expectations do respond to 
exchange rate changes. Moreover, short-term forecasts 
are more “destabilizing” than long-term forecasts; that 
is, the response to the degree of forecasted appreciation 
in response to appreciation is larger in the short-term 
horizon than in the long-term one. 
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casting characteristics, then focusing on the 
median misses the most interesting ques- 
tions, such as whether the differences persist 
or are temporary, whether the differences are 
correlated with the participant’s traits, and 
whether a rationality hypothesis is more 
likely to be rejected in individual data. Only 
individual responses of survey data can an- 
swer these questions. 

In this paper, I will use the survey data 
collected by the Japan Center for Interna- 
tional Finance (JCIF) in Tokyo, which al- 
lows me to investigate the individual re- 
sponses in the survey. In particular the JCIF 
data set has two distinct advantages over the 
data used by Dominguez and by Frankel 
and Froot. First, the JCIF data consist of 
individual responses with no missing obser- 
vations. This is the first paper to study the 
individual responses of exchange rate expec- 
tations, although individual responses of in- 
flation expectations were studied before 
Stephen Figlewski and Paul Wachtel (1981). 
Second, not only financial institutions but 
other companies as well are polled in the 
JCIF survey. Therefore, there is a chance to 
associate possible heterogeneity to the traits 
of the forecasters’ industry. 

There are four major findings in this pa- 
per. First, market participants are found to 
be heterogeneous. There are significant “in- 
dividual effects” in their expectation forma- 
tion. Second, the individual effects have 
characteristics of “wishful expectations”: ex- 
porters expect a yen depreciation (relative to 
others), and importers expect a yen appre- 
ciation (relative to others). Third, many 
institutions are found to violate the rational 
expectations hypothesis. Most of them un- 
derestimated the degree of yen appreciation. 
Fourth, forecasts with long horizons showed 
less yen appreciation than ones with short 
horizons. Put differently, market participants 
appear to have a “bandwagon” expectation 
in the short run, but a “stabilizing” one in 
the lorig run. The “twist” in forecast term 
structure could be “consistent” (in the sense 
of Kenneth A. Froot and Takatoshi Ito, 
1989), if an iterated substitution of a short- 
term forecast yields a long-term forecast. 
However, .cross-equation constraints implied 
by the consistency are strongly rejected. 
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1. Data Summary 
A. The Data Description 


The JCIF has conducted telephone sur- 
veys twice a month, in the middle and at the 
end of the month, on Wednesdays, since 
May 1985. Forecasts of the yen/dollar ex- 
change rate for the one-, three-, and six- 
month horizons are obtained from foreign 
exchange experts in 44 companies, including 
15 banks and brokers, 4 securities compa- 
nies, 6 trading companies, 9 export-oriented 
companies, 5 life insurance companies, and 5 
import-oriented industries? Each respon- 
dent is asked to give a point forecast for 
each horizon. In this paper, I assume that 
reported forecasts are the subjective means 
of respondents. We do not have any data on 
the subjective variance or range. The survey 
is meticulously arranged so that all 44 com- 
panies on the permanent list respond every 
week. 

When a data set is analyzed as panel data, 
the mean across individuals and the mean 
across time should not be confused. In the 
following, the mean across 44 individuals at 
a time will be referred to as the (cross sec- 
tion, total) average; the mean across individ- 
uals at a time in an industry group will be 
referred to as the group average. The mean 
across time of an individual, of a group, or 
of the “average” will be referred to as the 
(time) mean of the individual, of the group, 
or of the average, respectively. 

The JCIF calculates the total average, the 
standard deviation, the maximum, and the 
minimum of the 44 responses and also the 
industry group averages and the group stan- 


"The first few surveys were conducted not on 
Wednesdays but on the middle and last business days of 
the month. However, the survey date was fixed on 
Wednesday after the fourth observation. A twice- 
a-month survey means that observations are usually 
biweekly, with a couple of exceptions in a year. That is, 
there are 24, instead of 26, observations in the JCIF 
data in 52 weeks. It is unfortunate that the interval is 
not fixed. In the following, I disregard the problem 
arising from a mix of two- and three-week intervals. The 
survey started with 42 companies and expanded to the 
current 44 after the fourth survey in July 1985. 
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TABLE 1—A. TIME MEAN OF (sj (t, k) — s(t)},* 
May 1985-JUNE 1987, NUMBER OF OBSERVATIONS = 51 

Horizon 1 Month 3 Month 6 Month 
AVE : ; —1.420 . —1.431 — 0.044 
BAN ; — 1.404 — 1.658 —0.957 
SEC : — 1.097 — 0.834 +0.621 
TRA — 1.956 — 2.453 — 0.948 
,. EXP $ : & — 0.715 — 0.137 +1.736 
INS ] . —1.746 — 2.309 4-0.302 
IMP ME | —1.937 —1.536 — 0.430 

ACT : — 2.064 — 5.970 


—11.987 





TABLE 1 —B. UNCONDITIONAL EXPECTED CHANGE, DISTRIBUTION AMONG 
INDIVIDUAL RESPONDENTS OF THE TIME MEAN OF FORECASTED CHANGES 
IN THE EXCHANGE RATE OVER THE SPECIFIED HORIZON 





Horizon 1 Month 
percent 
+5.0 
x 
0.0 x 
XX 
XXXX 
XXXXXXXXXXXXXXX 
XXXXXXXXXXX 
XXXXXXX 
xx 
x 
—5.5 
Max 141 
Min —3.10 





3 Month 6 Month 
x 
xx 
x 
Xxxx 
x XXXX 
x 
xx XXXX 
xx XXXX 
Xxx . XXXX 
XXXXXXXXX XXXXX 
x XXXXXX 
XXXXXXX . x 
XXXXXXXX xx 
XXXX ` x 
xx xx 
x x 
xx 
x. x 
x 
Max 3.25 Max 4.62 
. Min — 4.76 Min —5.20 


Mean of the (unconditional) expected changes (in percent). Not annualized or | 


adjusted for k. 
x =1 respondent. 


dard deviations. On the day after the survey, 
the JCIF informs its subscribers, including 
those who are polled, of the summary statis- 
tics. The total average is also released to the 
press and other media. 


I will use, in addition to the panel data of ' 


the 44 companies, the public information 
part of the survey, the cross-section average 


(AVE) and the group averages for the dif- 
ferent industries: banks (BAN), securities 
companies (SEC), trading companies (TRA), 
companies in the export industries (EXP), 
insurance companies (INS), and companies 
in the import industries (IMP). The unit is 
yen per one U.S. dollar, so that a negative 
movement indicates a yen "appreciation." 
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The spot exchange rate, s(t), is measured at 
the closing quote in Tokyo on Wednesday of 
the survey week. 


B. Overview 


Table 1 shows the time means of (uncon- 
ditional) expected changes (in percent) from 
the spot rate at the time of survey for the 
cross-section total average, the group aver- 
ages, and (in a separate distribution table) 
for each individual. For the purposes of dis- 
cussion, the actual (ex post) changes of the 
spot exchange rate (ACT) for each horizon 
are reported in the same table. For each 
horizon and each individual or group, sub- 
tracting the actual changes from the fore- 
casts produces the forecast errors. 

In the one-month horizon, the (total) aver- 
age on a typical week showed an expected 
1.4 percent yen appreciation. The group av- 
erages ranged from a 0.8 percent to a 2 
percent appreciation. Relative to the total 
average, the export industry was the most 
biased toward a yen depreciation, and the 
trading companies and the import industries 
were the most biased toward a yen apprecia- 
tion. Looking into individual data, one ex- 


treme predicted a 1.4 percent depreciation of. 


the yen, while the other extreme predicted a 
3.1 percent of appreciation. The distribution 
of individual forecasts has a nice unimodal 
distribution. The average expected apprecia- 
tion of the yen in the three-month horizon 
was 1.4 percent, about the same as in the 
one-month horizon. (Note that no adjust- 
ment is made with respect to the length of 
horizon.) 

As in the one-month horizon, the export 
industry shows a yen depreciation bias (from 
the total average), and the trading compa- 
nies show a yen appreciation bias in the 
three-month horizon. A wide disagreement 
among individuals begins to appear in the 
three-month forecasts. It becomes a bimodal 


distribution: one group believes that the yen _ 


depreciates from the one-month to three- 
month forecast horizon, while the other be- 
lieves that the yen continues to appreciate. 
For the six-month horizon, the total aver- 
age shows that the market expects the yen to 
return to nearly the prevailing level at the 
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time of forecast. This is a sharp turnaround 
from the forecast of a 1.4 percent yen appre- 
ciation in three months. In fact, each of the 
group averages indicates that the group an- 
ticipates less yen appreciation in the six- 
month horizon than in either the one- or the 
three-month horizon. 

The findings of this subsection can be 
summarized and related to the contents of 
the rest of this paper. First, the findings are 
highly suggestive of heterogeneous market 
participants. A rigorous analysis and inter- 
pretation of the heterogeneity will be pro- 
vided in Section III. Second, large forecast 
errors were recorded during the intermittent 
waves of yen appreciation after September 
1985. Econometric tests on various forms of 
the rational expectation hypothesis will be 
conducted in Section IV. Third, the total 
average and most of the group averages have 
a *twist" in their forecasts, a yen apprecia- 
tion in the short horizon and a yen deprecia- 
tion in the long borizon. Section V investi- 
gates whether such twists in expectations are 
internally consistent. 


II. Wishful Expectations and Heterogeneity 


A. Econometric Issue — A Special Case 
of Panel Data 


Recall that our micro survey data set con- 
sists of 44 individuals and 51 observations. 
Suppose that an individual forecast forma- 
tion at time f consists of a common struc- 
tural part based on public information, 
f(4(t)) and an individual effect, gj. For a 
given forecast horizon, k (suppressed nota- 
tion), the expected exchange rate for individ- 
ual j, j—1,..., J (where in this paper J= 
44) is 


(1) sf(t) -fü()) + g;* uy), 


where s/(t) is a k-step ahead forecast of the 
spot exchange rate at time 7, by individual j; 
u,(t) is a pure random disturbance (with 
respect to j and f) representing, for exam- 
ple, a measurement or a rounding error. The 
cross-section average of individual forecasts, 
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S$yg(t) is defined as 
(2) sfve(t) =f(1(1)) + gave + avg (10). 


where x ,yg(t) = (Ex,(t))/J; x=s*, g, and 
u. Assume f(I(t)) contains a constant term 
so that normalization, g 4), = 0, is possible. 
Then subtracting each side of (2) from the 
corresponding side of (1), we obtain 


(3) s§(t) — sáyg(1) 
eg +(u (0) — uavg(t)). 


The estimator of the individual effect, g 
be obtained by regressing the left-hand s side 
of (3) on a constant over the sample period 
(across time). This procedure is simple and 
robust. It is unnecessary for the econometri- 
cian to know the exact structure of f(I(t)) 
as long as it is common to everybody for 
every survey date. 

If the difference in the individual effects of 
two individuals is to be estimated, a similar 
method can be employed. 


(4) sj(0)— si) 
= g,— 8, * (uj(r) ^ u,(1)), h* j. 


A (composite) disturbance term in equations 
(3) and (4) has mean zero and no serial 
correlations if u;(1) is serially and cross-sec- 
tionally uncorre elated and f(/(t)) is exactly 
common to all individuals. 

If the difference in individual beliefs ex- 
tends to “idiosyncratic” coefficients on pub- 
licly available information in the structural 
part, f(I(1)), the above procedure needs to 
be modified but is still applicable. Suppose, 
for example, that the forecast is in an extrap- 
olative form: 


(5) sf()-sQo) 
=a,+b,,(s(t-1)—s(t)) 
*5(r-2) sr) wu, 


where g, is the difference in a, Then the 
idiosyncratic individual coefficients can be 
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estimated by regressing the following equa- 
tions, for all j: 


(6) s$(1) — ssyg (1) 
+{ by; ~ brave} (s(t—-1) - s(10)) 
+ { ba; — bo ave} 
(s(1—2)-s((-1)) 


+u,(t) — uae- 

The above procedure parallels the tech- 
nique in the panel data analysis, although in 
the usual examples of the panel data analy- 
sis, the right-hand-side variables take differ- 
ent values for different individuals. Instead, 
it is reasonable here to assume that the 
structural part and the values of regressors 
(i.e., the past values of the exchange rates) in 
exchange rate forecasts are identical for all 
individuals, but with possibly different co- 
efficients. 


B. Heterogeneous Participants in the 
Tokyo Market 


In search of hard evidence for (or against) 
heterogeneity among market participants, I 
estimate 44 individual effects, g,, and “ group 
effects.” In detecting the “group effect,” 
group average forecast calculated by the 
JCIF is treated as an individual j, then the 
total average (or another group average) is 
subtracted.” 

The individual (or group) effect g,, esti- 
mated using equation (3), are reported in 
Table 2.4 


3Since the micro panel data set was made available 
on the condition that the anonymity of the source 
should be honored, it is impossible to aggregate the 
individuals into groups. 

For some cases, an allowance had to be made for 
AR(1) serial correlation in u;(t)— u4yg(t), or in u; (1) 
— uj (t), contrary to the assumptions mentioned earlier. 
This may be due to either serial correlation in u, or 
deviations in f(/(t)) among individuals. However, many 
of rejection cases (ie. confirming heterogeneity) are 
found without AR(1) disturbances. 
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TABLE 2—A. GROUP DEVIATIONS FROM THE TOTAL AVERAGE, FOR EACH HORIZON 
1 Month 3 Month 6 Month 
Horizon a DW or RHO a DWor RHO a DW or RHO 
BAN 0.017" 0.284 —0.228 0.530 ~0.941 0.371 
t-stat (0.25) (2.04) (—1.28) (4.29) (—5.74)* (2.81) 
SEC 0.305 0.438 0.561 0.421 0.743 0.446 
t-stat (1.25) (3.38) (1.62) (3.14) (1.47) (3.49) 
TRA —0.536 DW -—213 —1.022 DW =1.61 — 0.908 0.467 
t-stat (—4.98)* (—7.56)* (—-2.57)* (3.61) 
EXP 0.645 DW=2.07 1.294 DW =1.62 1.832 0.435 
t-stat (8.55)* (12.68)* (6.11)* (3.41) 
INS — 0.326 0.474 — 0.815 0.645 0.301 ‘0.661 
t-stat (-1.54) (3.72) (—1.93)* (5.86) (0.54) (5.99) 
IMP —0.517 DW =1.47 — 0.079 0.301 — 0.434 0.422 
t-stat (—3.76)° (—0.29) (2.17) (—1.39) (3.27) 
“It shows the "heterogeneous" group at the level of 1 percent. 
TABLE 2—B. WISHFUL EXPECTATIONS, DISTRIBUTION OF INDIVIDUAL EFFECTS 
Horizon 
Horizon 1 Month 3 Month 6 Month 
percent 
+50 
x x 
xx 
x x 
x 
x xo 
xx o 
x xxo ` xo 
x i Xxxoo 
xxxoo 0000000000 000 
0.0 X000000000000 o 000 
X00000000000 0000000 00000 
XXXX000 00000000 00000 
XXXX XXXO Xxoo 
x xo, xo 
x xo 
xx x 
x xx 
x 
o 
—5.5 xx 


x — significant individual effects; 
o = insignificant individual effects. 


From Table 2, panel A, we learn that for 


any horizon, group effects are significant for 


the export industry, with a depreciation bias, 
and for the trading companies, with an ap- 
preciation bias. A significant appreciation 
bias was also detected for the import indus- 
try for the one-month horizon, for the insur- 


ance industry for the three-month horizon, 
and for the banking sector for the six-month 
horizon. 

The distinctive effect of exporters in con- 
trast to importers or to trading companies 
can be highlighted by measuring the differ- 
ence in individual effects directly, as in equa- 


440 THE AMERICAN ECONOMIC REVIEW 


JUNE 1990 


TABLE 3— IDIOSYNCRATIC EFFECTS: EXTRAPOLATIVE FORM 


SF (2) S4ve(2) = aj — aayg + (51; — bayg) GG — 1) — 5(0) + (bay — brave Gt —2) 7 s( 73) t+ 4 (2) — avg 


HO: No idiosyncratic coefficient effects, b = 0 
(allowing for individual effect of a constant bias) 


Hi: No idiosyncratic coefficient of individual (constant) effect: a = b — 0 





Lag 1 lag (b, = 0) 2 lag 
length 1 Month 3 Month 6 Month 1 Month 3 Month 6 Month 
Horizon HO Al HO Hl HO H1 HO Hl HO H HO Al 
BAN F 0.122 0103 2.51 248 2.34 18.19 0.433 0.338 2.60 3.13 0.732 173 
sig, 0.729 0.903 0.120 0.095 0.133 0.00* 0.651 0.798 0.086 0.035 0.487 0.00* 
SEC F 0.815 137 0.0057 141 0,000 1.03 0.609 1.31 0281 1.32 0.032 0.740 
sig 0.371 0.265 0.847 0.253 0.984 0.367 0.502 0.224 0.756 0.280 0.968 0.533 
TRA F 0.461 21.0 0.390 24.6 0.652 516 1.69 1676 191 180 0.583 401 
' sig 0.500 | 0.00* 0.535  0.00* 0.423  0.009* 0.196  0.00* 0.161 0.00 0.562 0.013 
EXPF 428 40.5 2.16 66.44 0.557 1833 229 52.1 2.88 648 0.186 18.6 
sig 0.044 0.00* 0.148  0.00* 0.459  0.00* 0.113  0.00* 0.067  0.00* 0.831  0.00* 
INSF 0429 212 229 3.56 3.49 1.89 0.347 168 1.12 2.71 146 114 
sig 0.516 0.132 0.317 0.037 0.068 0.162 0.708 (0.186 0.335 0.056 0242 0.345 
IMPF 3.68 730 | 136 0.726 1.29 2.31 1.73 541 1.08 0.763 1.07 1.93 
sig 0.061 0.001* 0.249 0.489 0.262 0.110 0.188 (0.003 0.347 0.521 0.352 039 
F: F-statistics. 


sig: significance level. 


tion (4). (This is not reported here; see 
Takatoshi Ito, 1988b.) Exporters have a de- 
preciation bias in their expectation forma- 
tion compared to importers and trading 
companies for any horizon. Table 2, panel B 
shows that, for any horizon, about half of 
the 44 individuals have a significant bias in 
their forecasts. The deviations are sometimes 
very large. 

One might object to a formulation of the 
individual effects in the form of biases in the 
constant term. They could have different 
models. Since it is not likely that the JCIF or 
the econometrician could persuade each 
forecaster to justify the forecast with a 
model every week, we have to guess the 
form, assuming that each market participant 
has a common autoregressive forecasting 
model, but with different coefficients on the 
lag terms (possibly because of differences 
in their prior beliefs). As discussed above, 
idiosyncratic coefficients can be estimated 
from equation (6). The results are shown in 
Table 3. 

Table 3 once again shows that exporters 
and trading companies are significantly het- 
erogeneous for each of the three horizons. 
However, the differences came from the bi- 


ases in the individual (constant term) effects, 
not from the idiosyncratic coefficients of the 
lagged variables. Importers for the one- 
month horizon and banks for the six-month 
horizon also show the individual (constant) 
effect, as in Table 2, panel A, but fail to 
Show the idiosyncratic coefficients on the 
lagged variables. Therefore, the heterogene- 
ity is more like a constant bias rather than 
the differences in reacting to the recent 
changes in the exchange rate. Table 2 and 
Table 3 show solid evidence for heteroge- 
neous expectation formations among market 
participants. 


C. Discussion: Heterogeneity and 
Rational Expectations 


Most of the modern theory of finance and 
macroeconomics assumes the existence of a 
representative agent whose decision is an 
aggregate of market participants. In fact, the 
hypothesis of rational expectations would 
require that market participants be homoge- 
neous in their formation of expectations since 
the true stochastic process is unique. There- 
fore, findings of heterogeneity in this section 
cast some doubts on the homogeneous agent 
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framework commonly used in finance and 
macroeconomics, 

One might argue that if agents have pri- 
vate information that econometricians do not 
observe, the existence of individual effects 
may not be inconsistent with rational expec- 
tations. However, important news and vari- 
ables in the foreign exchange market are 
generally common knowledge. In fact, even 
if the individual information sets are differ- 
ent, the difference in expectations condi- 
tional on a common (i.e. intersection of) 
information set should be unbiased. The 
constant term, which detects individual ef- 
fects, is certainly contained in the common 
information set. Thus, the finding of signifi- 
cant individual biases rejects rational expec- 
tations. 

Put differently, under the assumption of 
rational expectation but private information, 
the forecast differences across individuals, 
that is, the dependent variables in equations 
(3) and (4), must be serially uncorrelated, 
contrary to our findings, provided that lagged 
group average forecasts are part of the com- 
mon information set (which is the case in 
the JCIF survey as explained in Section II, 
Part A.) 

One possible explanation of heterogeneity 
consistent with rational expectations would 
be a slow learning process due to a strongly 
biased prior. However, one has to model a 


learning process to assert this. Then, we - 


would be able to discuss how biases can be 
related to individual priors and learning pro- 
cesses. This is beyond the scope of this 
paper. 


D. Discussion: Wishful Expectations 


Having established heterogeneity, a dis- 
cussion of why certain market participants 
have depreciation or appreciation biases is 
in order. From Table 2, we notice some 


SI owe the observation in the paragraph to an anony- 
mous referee. There have been some investigations ex- 
amining whether diverse expectations can be rational 
depending upon agents' information sets (see, for exam- 
ple, Feldman, 1987; Marcet and Sargent, 1989; and 
Frydman, 1982, 1987). 
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regularity in the group effects: in the one- 
month-ahead forecasts, exporters have a de- 
preciation bias, while importers have an ap- 
preciation bias. The exporters' forecasts show 
a continuing deviation from the mean, sig- 
nificantly biased toward a yen depreciation, 
as the forecast horizon lengthens. In the 
three-month- and six-month-ahead predic- 
tions, trading companies, as opposed to im- 
porters, show a bias toward appreciation. 

Exporters tend to be long in dollars and 
importers short in dollars. It is difficult to 
completely cover the exposure to the foreign 
exchange risk, since the forward markets ex- 
ist only up to a one-year horizon, and tim- 
ings of trade and financial transactions can- 
not be matched exactly. 

Therefore, exporters wish that the yen will 
depreciate in the future, enabling their profit 
margins to increase and their products to 
compete better in the foreign markets. (This 
argument rests on an assumption of incom- 
plete “pass-through,” which is documented, 
for example, by Paul Krugmen, 1987, and 
Kenichi Ohno, 1988.) Their responses, being 
biased toward a yen depreciation relative to 
the average, seem to agree with their wishes. 

On the contrary, importers’ responses re- 
flect their wish for a stronger yen so that 
their import costs will decrease given incom- 
plete path-through. Note that the group ef- 
fect of trading companies behaves like that 
of import industries. One might think that 
the change in the exchange rate may be 
neutral for trading companies, since they are 
just intermediaries of imports and exports. 
However, the leading Japanese trading com- 
panies handle more imports than exports. In 
1983, the revenues of the leading nine trad- 
ing companies were derived from export-ori- 
ented activities for 20.0 percent, import-ori- 
ented activities for 23.6 percent, domestic 
activities for 40.3 percent, and trade between 
foreign countries for 16.1 percent (Miyohei 
Shinohara, 1986, p. 164). 

Hence, the findings show that market par- 
ticipants apparently form “wishful expecta- 
tions.” (A “Chicago test” for the validity of 
survey data would be to check whether 
money is where the mouth is. But the result 
here shows that people “put the mouth where 
the money is.”) This “wishful expectation” 
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—or an “optimist” view in John D. Hey’s 
(1984) sense— may be a reflection of nonra- 
tional honest mistakes in expectation forma- 
tion. A straightforward interpretation would 
be for respondents to mix wishful thinking 
with objective forecasts. However, there are 
a few deeper explanations of wishful expec- 
tations.” 

The Japanese manufacturing and trading 
companies usually set an in-house exchange 
rate for internal accounting, and the rate can 
be used for coordinating the sales depart- 
ment with the other departments. It is possi- 


ble that these in-house rates are heteroge- 


neous, and moreover are slightly biased so 
that the sales department is encouraged. The 
survey responses from these companies may 
be influenced by the biased in-house ex- 
change rate, although the respondent is not 
from the sales department. 


One might think that intelligent people like profes- 
sional traders and dealers can separate wishful thinking 
from scientific forecasts. However, there is some evi- 
dence in the psychology literature, kindly suggested by 
Kenneth J. Arrow, that wishful thinking is rather com- 
mon in social cognition and views of the self. ` 


Theories of the causal attribution process, pre- 
diction, judgments of covariation, and other 
tasks of social inference incorporated the as- 
sumptions of the naive scientists as normative 
guidelines with which actual behavior could be 
compared. 

It rapidly became evident, however, that the 
social perceiver's actual inferential work and 
decision making looked little like these norma- 
tive models. Rather, information processing is 
full of incomplete data gathering, shortcuts, 
errors, and biases. In particular, prior expecta- 
tions and self-serving interpretations weigh heav- 
ily into the social judgment process. (Taylor and 
Brown, 1988, p. 194, with emphasis added) 

"One might think that there may be self-selection 
among entrepreneurs and dealers: Those who are opti- 
mistic about the yen appreciation (depreciation) de- 
velop import (export, resp.) business. However, the JCIF 
polls include only leading companies, so that it is dif- 
ficult to imagine that they change their types of business 
because of exchange rate expectations. Those who are in 
charge of foreign exchange expectations and trades in 
those companies are usually in-house staff, who are 
subject to a lifetime employment practice. It is hardly 


the case in Japan that foreign exchange professionals | 


hop companies according to their biases in expectations. 


JUNE 1990. 


If the announcement of the JCIF survey is 
very influential on the market, the respon- 
dent may be induced to try manipulating the 
announced survey result by answering with 
biased forecasts. Exporters respond to the 
JCIF by announcing a depreciated rate, but 
only slightly depreciated so as to avoid obvi- 
ous detection, in the hope that the survey 
mean is biased toward depreciation. Ex- 
porters hope that the mean expectation with 
an “unexpected” depreciating bias could 
cause others to start selling yen, thus creat- 
ing a self-fulfilling prophecy; if importers 
understand that exporters have incentives to 
lie, then importers would counter by manip- 
ulating their announcements; and vice versa. 
Thus, as a Nash equilibrium, the mean may 
not be biased after all, although exporters 
and importers are biased. 

Despite its appeal to economists who are 
trained to think seriously about expectation 
and manipulation, this story of a manipula- 
tive motive has a-few shortcomings. First, 
the size of survey, that is, 44 respondents, is 
large enough that a manipulation by one 
respondent is insignificant unless the bias is 
large enough to be easily detected by the 
JCIF. Second, if other participants under- 
stand that exporters and importers have in- 
centives to lie, then they would not take the 
JCIF survey seriously, thereby removing the 
incentive to lie. It may be the case that 
market participants are simply naive in 
forming wishful expectations. 


HII. Rationality of Expectations 


A. Tests of Unbiasedness and 
Orthogonality 


In this section, I will apply standard tests 
of rational expectations to this survey data.® 
First, if the forecasts are rational, the fore- 


For the aspects of econometrics, see Mishkin (1983). 
The same procedure has been applied to the MMS data 
by Dominguez (1986). In this paper, I assume that 
reported forecasts in the survey are the subjective means 
of respondents. However, if agents were reporting the 
medians of a skewed subjective distribution, -hen the 
results of rationality tests could be affected. 
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cast errors should be random. In other words, 
survey forecasts should be unbiased. Second, 
given rational expectations, forecast errors 
should be uncorrelated with (orthogonal to) 
any information available at the time the 
forecast is made. Otherwise, the variable cor- 
related with the ex post error could have 
been exploited to make a better forecast. 

Under the null hypothesis of rational ex- 
pectations, the realized spot rate is the sum 
of a forecast and a forecast error: 


(7) s(t+k) -s*(t k) - h(t,k), 


where A(t, k) is the mean zero forecast error, 
uncorrelated with any variables available at 
t. It is well known that forecast errors, 
h(t,k), would be serially correlated if the 
forecast horizon is longer than the observa- 
tional frequency, that is, k » 2. Therefore, 
rational expectations imply that a — 0 and 
b =1 in the following regression: 


(8) s(t+k)-s(t) 
=a+b(s*(t,k)—s(t))+u(t). 


The test statistics are calculated using the 
Generalized Method of Moments to take 
care of the serial correlations of u(t). Re- 
sults of this unbiasedness test are reported in 
Table 4, panel A. 

Unbiasedness is rejected for trading com- 
panies and insurance companies of the one- 
month horizon, for securities and import 
companies of the three-month horizon, and 
for all groups but banks and import indus- 
tries for the six-month horizon. These rejec- 
tions are evidenced for rejecting a rational 
expectation hypothesis, in that market par- 
ticipants had unbiased forecasts. We would 
miss some rejections if we were only to look 
at the average of the 44 participants, since 
for the one-month and three-month hori- 
zons, rejections by some groups are not de- 
tected in the average for all participants.? 


* However, Muth (1961) originally interpreted ration- 
al expectations as applying only to aggregate expecta- 
tions. 
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The second implication of rational expec- 
tations is the orthogonality: Under the null 
hypothesis, forecast errors, h(t, k) — s*(t, k} 
— s(t t k), are uncorrelated with any infor- 
mation, z(t), at time f. In the literature, the 
past forecast errors s*(f— k, k)— s(t), the 
forward premium, f(t, k)— s(t), or the re- 
cent actual change s(t — k)— s(t) have been 
popular candidates for variables in the infor- 
mation set. I will follow the standard proce- 
dure by regressing the ex post forecast errors 
on these candidate variables: 


(9) s*(t, k) - s(1 Kk) 


— ac b(z(t)—-s(t)) * e(t), 


where z(t) — s*(t - k, k), f(t, k), s(t — k). 
Rational expectations (orthogonality) is a 
null hypothesis of a=b=0. Results of 
the estimation of equation (9), with z(t) = 
s(t—k), and the test of null hypothesis is 
reported in Table 4, panel B. (Results for 
other cases of z(t) are essentially the same 
and not reported here. See Ito, 1988.) There 
are only a few instances of rejections of the 
one-month and three-month horizons. How- 
ever, for the six-month horizon, the rejection 
is unanimous. This is consistent with the 
results of unbiasedness tests. So far, there is 
little evidence rejecting the rational expecta- 
tion hypothesis for the shorter horizons. 

Variables in the information set are not 
restricted to those tested above. When the 
second lagged term is added, the number of 
rejection cases increases dramatically. The 
results of estimating the following equation 
are reported in Table 4, panel C: 


(10) s*(t,k)—s(t+k) 
=a+b,(s(t-—k)—s(t)) 
+ b,(s(t-k-1)—s(t-1)) 
+e(t). 
Table 4, panel C shows rejections for most 
groups in all horizons. Even if the orthogo- 
nality test is conducted at the individual’ 


level, about three-quarters of the individuals 
are judged to be irrational. 
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TABLE 4— TESTS OF RATIONAL EXPECTATICNS 
(A) Unbiasedness 
(i) Estimates and Standard Errors of a and 5; Chisq and signif. for AVE 
1 Month 3 Month ; 6 Month 

a b CHISQ a CHISQ a b CHISQ 
— 0.028 — 0.485 2.59 — 0.043 1.167 521 -0119 0.908 10.09 

(0.017) (0.969) 0.274 (0.034) (1.167) 0.074 (0.041) (0.741) 0.006* 
(ii) Number of Cases in Group Data 

1 Month 3 Mcnth 6 Month 

Fail to reject H (at 1 percent) 4 4 2 
Reject H (at 1 percent) 2 (TRA,INS) 2 (SEC, IMP) 4 (SEC, TRA EXP, INS) 


(B) Orthogonality, past exchange rate movement, with 1 lag 


(i) Estimates and Standard Errors of a and b; Chisq and signif. for AVE 


1 Month 3 Month 6 Month 
a b CHISQ a CHISQ a b CHISQ 
0.004 0.166 3.883 0.042 0.306 9.504 0.114 0.227 18.908 
(0.010) (0.203) 0.144 (0.025) (0.225) 0.009 (0.036) (0.358) 0.000 
(ii) Number of Cases in Group Data and Micro Data 
Group Data Micro Data 
1 Month 3 Month 6Month 1Month 3Month 6 Month 
Fail to reject H 5 3 0 37 26 1 
Reject H (at 1 percent) 1 (EXP) 3 (SEC, EXP, IMP) 6 7 18 33 
(C) Orthogonality, past exchange rate movement, with 2 lags 
(i) Estimates and Standard Errors of a and b; Chisq and signif. for AVE 
1 Month 3 Month 6 Month 
a b c CHISQ a b c CHISQ a b c CHISQ 
0.007 | 0247  —0.3223 3829 0.043 0.330  —0.095 14.35 0112 0.183 0174 21.75 


(0.011) (0.185) (0.207) (0.000) (0.025) (0.220) (0.093) (0.002) (0.034) (0.299) (0.342) (0.000) 


Gi) Number of Cases in Group Data and Micro Data 


Group Data Micro Data 
1 Month 3 Month 6 Month 1 Month 3 Month 6 Month 
Fail to reject H 0 2 (BAN, TRA) 1 (BAN) 11 22 21 
Reject H (at 1 percent) 6 4 5 33. 22 23 


B. Discussion: Peso Problem and 
Bubbles 


Failing the rationality test in small sam- 
ples may not imply that expectations are 
formed irrationally, due to the often in- 
volved caveats of peso problems and bub- 
bles? (See Maurice Obstfeld, 1987, and 


I91f the forward rate is used in place of expectation 
of survey data, as is the case in papers other than ones 
with survey data, risk aversion is another source of bias 


in Faracact errare 


George W. Evans, 1986, and references 
thereof for discussions of these issues.) Sup- 
pose that concitional forecasts were formed 
rationally taking into account a small proba- 
bility of “crash,” but that the trash did not 
occur in the (small) sample. Then forecasts 
appear to have been biased when judged 
from ex post forecast errors. This is known 
as the peso prcblem. The sample size of this 
study is admittedly small (about two years), 
and this could be a reason for a rejection of 
rationality. 
However, the sample period for this study 


imalwdan a lae waetnd nftaw tha Mann 
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Agreement of September 1985. (See 
Takatoshi Ito, 1987, for news analysis of the 
exchange rate volatility after the Plaza 
Agreement.) The process of the sharp yen 
appreciation after the Plaza agreement can 
be regarded as a long-awaited “crash” of the 
dollar value. However, market expectations 
underestimated the magnitude of this crash.! 


In summary, this section shows that most | 
of the market participants violate necessary . 


conditions of the rational expectations hy- 
pothesis. However, these results should be 
interpreted with caution, because they could 
be a case of a peso problem. 


IV. Expectational Twist 
A. Introduction to Twist and Consistency 


In this section, the consistency of expecta- 
tion formation of short- vs. long-term expec- 
tations, as discussed in Froot and Ito (1989), 
is explored. Frankel and Froot (1987b) 
showed that the short-term expectations are 
of the bandwagon type, while the long-term 
expectations show some regressive character- 
istics.!? Thus, I will first replicate their re- 
gressions, and then raise the question of how 
to interpret a “twist” found in the data. 

However, Frankel and Froot (1987b) ig- 
nored the consistency issue of short- and 


“Yn that sense it may seem inappropriate to invoke 
the peso problem explanation in the usual sense for this 
period. The biased forecast errors resulting from the 
underestimation of the magnitude of a crash could be 
called the “Plaza problem.” Both “peso problems,” 
which arise when am infrequent crash did not happen, 
and “Plaza problems,” which occur when an infrequent 
crash did happen, are small sample problems. More- 
over, the latter is a special case of peso problems: A 
policy switch, including interventions, could halt a dol- 
lar decline and reverse the movement with a small 
probability. That did not happen in the small sample. 

P Frankel and Froot (1987b) showed, using the MMS, 
the Economist, and the AMEX data. sets, that short- 
and long-term expectations seem to have different char- 
acteristics. The data set with the short-term horizon 
yields the estimates indicating a bandwagon type (ex- 
trapolative) effect, while the data set with the long-term 
horizon yields results with a more regressive nature. 
However, the direct comparison of the short-term and 
long-term horizons is limited in their study because of 
the spread of horizons across different data sets and 
different sample periods. 
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long-term expectations formation: If expec- 
tations formation is internally consistent, a 
long-term forecast should be identical to the 
results of sequential substitutions of short- 
term forecasts, given a function of expecta- 
tions formation. The consistency becomes a 
testable hypothesis in the form of cross- 
equation constraints on the coefficients of 
the short- and long-term forecast equations. 
This consistency problem is parallel to the 
cross-equation constraints implied in the 
context of the interest rate term structure 
(Thomas Sargent, 1979) and in the context 
of uncovered interest parity (Ito, 1988a; 
Takatoshi Ito and Danny Quah, 1989). Froot 
and Ito (1989) have applied the test of con- 
sistency to the data collected by Money 
Market Service (MMS) for one-week- and 
one-month-ahead ‘forecasts and the 
Economist Financial Report for three-, six-, 
and twelve-month forecasts. They also used 
the averages from the JCIF data. In this 
paper, the same test is applied to the group 


means of the JCIF data, where one-, three-, 


and six-month forecasts are available. 


B. An Example of Extrapolative 
Expectation with One Lag 


First, let us consider, following Frankel 


and Froot (1987b), the extrapolative expec- 
tation with one lag: 


(11) s*(t,k)—s(t) 
uerborum ne 


. In (11), b « 0 implies a (destabilizing) band- 


wagon effect, while b > 0 implies a stabiliz- 
ing expectation formation. Results are re- 
ported in Table 5, which shows that the 1 
percent yen appreciation would make the 
average individual expect a further 0.01 per- 
cent appreciation in one month. However, 
the table also implies that the shock would 
make the same individual form an expecta- 
tion of a 0.13 percent depreciation in three 
months and a 0.22 percent depreciation in 
six months. Although different groups have 
different biases, the pattern of coefficients, 


b(one month) l 
< b(three months) < b(six months), 
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TABLE 5— EXPECTATION FORMATION, EXTRAPOLATIVE EXPECTATION WITH ONE LAG 
s$(t, k) - s(t) =a + b{s(t-1)— s(0) + EC 
Cases: b «O belief in a bandwagon effect 
b = 0 belief in constant appreciation 
b > 0 distributed lag form 
H: a=b=0 belief in random walk 
Estimates of a and b and their (standard errors) 
CHISQ for Hypothesis H: chisq(df = 2) and (significance level) 
ARI process on E is assumed. RHO is not reported here. 
Horizon (k) 
1 Month 3 Month 6 Month 
a b CHISQ a b CHISQ a b CHISQ 
AVE —0.015 —0.011 49.42 —0.017 0.137 9.60 — 0.002 0.220 5.49 
(0.002) (0.035) 0.000 (0.005) — (0.050) 0.000 (0.009) (0.066) 0.002 
BAN — 0.014 — 0.008 62.85 ` —0.019 0.087 6.75 —0.011 0.134 2.06 
(0.001) (0.044) 0.000 (0.005) — (0.056) 0.003 (0.009) (0.077) 0.139 
SEC —0.011 —0.058 8.05 —0.011 0.149 2.46 0.006 0.224 1.69 
(0.003) (0.061) 0.001 (0.005) — (0.108) 0.097 (0.009) — (0.141) 0.195 
TRA — 0.020 —0.029 69.67 —0.027 0.067 21.42 —0.011 0.194 2.41 
(0.002) (0.068) 0.000 (0.004) (0.096) 0.000 (0.006) (0.120) 0.101 
EXP — 0.009 0.061 18.82 — 0.004 0.168 3.19 0.016 0.304 6.77 
(0.001) (0.039) 0.000 (0.004) — (0.068) 0.050 (0.010) — (0.095) 0.003 
INS — 0.018 0.015 17.20 — 0.027 0.237 18.42 —0.001 0.376 5.79 
(0.003) (0.067) 0.000 (0.005) — (0.068) 0.000 (0.010) (0.111) 0.006 
IMP —0.018 —0.134 28.43 —0.019 0.285 6.95 — 0.008 0.288 3.60 
(0.003) (0.075) 0.000 (0.006) — (0.108) 0.002 (0.008) — (0.110) 0.035 
Number of Cases in Micro Data . 
Horizon 1 Month 3 Month 6 Month 
b >> O sig. 3 8 10 
b> 0 insig. 18 29 31 
b < 0 insig. 20 7 3 
b « 0 sig. 3 0 0 





in expectation. Consider estimating the fol- 
lowing k-month (Kk =1,3,6) expectation for- 


is almost unanimously observed. Hence, we 
may draw a conclusion, similar to that of 


Frankel and Froot (1987b), that the long- mations: 
term expectation is more stabilizing than the 
short-term expectation. (12) s*(t,k) 2d, * (1 a)s(t) 


It is easy to show that so long as the 
extrapolative expectation with one lag is as- 
sumed, a twist, that is, an appreciation in the 


+ b,s(t —1)  eus(t —2) uu (1), 


short run and a depreciation in the long run, 
in expectation is impossible. Put differently, 
the assumed formulation is not rich enough 
for the observed twist to be consistent. 


C. Consistency Tests 
Next, we adopt a distributed lag expecta- 


tion formulation with more than two lags, a 
formulation rich enough to produce a twist 


where u,(!) are independent, random vari- 
ables representing observation errors. After 
substitution, using the iterated projection (see 
Froot and Ito, 1989), the consistency restric- 
tions as cross-equation constraints are de- 
rived (see Table 6). 

Each of two sets of cross-equation restric- 
tions, one-month vs. three-month, and 
three-month vs. six-month, is tested sepa- 
rately, and the results are reported in Table 
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TABLE 6— CONSISTENCY TESTS, ONE-MONTH VS. THREE-MONTH EXPECTATIONS 


55 (5,1) s(1) = d, + a,s(t) + bs(t - 1) as(t—2) 
5$(63) — s(t) = d3 + ag3(t)+ bs(t —1) + ess(t —2) 


H: Consistency restrictions: d4 — (2+ a, + b,+((1+ a)?))d, 
43 =¢,-1+(2(1+ a,)b,)+ (0. a,)) 


b = (1+ ayy) + (27) - (5 (. 
& = (a(l a)? Y+ (ba) 


tay) 





Estimates and (Standard Errors) 


1 Month (OLS) 
di a, b cy 
AVE — 0.0261 0.0003 0.0000  —0.0003 
(0.0050) (0.0001) (0.0002) (0.0001) 


In the Group data, H is rejected in all 6 groups. 
In Micro data, H is rejected in 42 out of 44 individuals. 


Three-Month vs. Twelve-Month Expectations 

s; (1,3)— s(t) = dy + a4s(t) + bs(t — 3) + c3s(t — 6) 
57 (1,6) — s(t) = dg + ags(t)+ bes(t — 3) + egs(t — 6) 
dg = (2+ a3); 

ag 7 (1+ a)? - b-1 
b; = (1+ a4)54 + c3 

cs = (1+ a3) cy 


H: Consistency Restrictions: 


3 Month (OLS) 
d a b c 
AVE 0.0218 — 0.0009 0.0004 0.0002 
(0.0220) — (0.0003) (0.0003) (0.0002) 


In Group data, H is rejected for all 6 groups. 
In Micro data, H is rejected for 42 out of 44 individuals. 


6. The consistency is overwhelmingly re- 
jected in this formulation, too. 


D. Discussion: Inconsistency 


I hasten to add a caveat. If we misspecify 
the expectation formation, then the results in 
this section are not valid. For example, if a 
policy switch, such as a monetary tightening, 
is expected to occur around the second 
month from the point of forecasting, it is 
“consistent” to have a twist, although the 
test in this paper would not capture it. 

One might think that people use different 
economic variables for forecasting the future 
spot rate with different horizons. For exam- 
ple, chart (technical) analysis, which is a 
special case of (univariate) distributed lag 





3 Month (GMM) H 
d4 a4 by C3 CHISQ 

—0.0254  —0.0008 0.0005 0.0000 21821 

(0.0071) — (0.0071) (0.0001) (0.0001) 0.000 
6 Month (GMM) H 

D. A B C CHISQ 
0.0508 — 0.0019 0.0006 0.0008 570.1 

(0.0285) (0.0003) — (0.0002) (0.0001) 0.000 


expectation formations, is used for the 
Short-term horizon, but other factors come 
into consideration for the long-term horizon. 
A list of other factors includes trade bal- 
ances, inflation rate differentials, interest rate 
differentials, fiscal deficits, and policy 
switches. However, if these factors are rele- 
vant in the long run, they should be relevant 
in the short run, although the effect may be 
small in the short-run.? l 


suppose that uncovered interest parity (no risk 
premium) holds. An interest rate differential of 6 per- 
cent implies that the exchange rate changes by approxi- 
mately 3 percent in six months, a significant and easily 
detectable change. However, it predicts only a 0.5 per- 
cent change in one month, a change that is small and 
may escape detection. 
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V. Concluding Remarks 


In this paper, newly available survey data 
on the expected exchange rate in the Tokyo 
market were used to test several hypotheses 
regarding expectation formations. The JCIF 
data set is better than the data sets previ- 
ously used by Frankel and Froot (1987a,b) 
in that the survey includes the expectations 
of different industries, not only of banks and 
financial institutions but also of exporters 
and importers. Moreover, individual re- 
sponses can be used to avoid the aggregation 
problem altogether. 

Following are the major findings of this 
paper: First, market participants are hetero- 
geneous, with constant-term biases in their 
expectation formations. Second, “wishful 
expectations” were found: exporters (im- 
porters) are biased toward yen depreciation 
(appreciation) relative to others. Third, when 
the usual rationality tests were applied, 
among different groups, the unbiasedness of 
expectation was rejected in a few instances 
for shorter horizons and unanimously re- 
jected in the six-month horizon. Orthogonal- 
ity was soundly rejected. We may conclude 
that we have strong evidence against rational 
expectation formation in the Tokyo foreign 
exchange market. Fourth, consistency is 
overwhelmingly rejected given that the ex- 
pectation formation is a distributed lag 
structure with two lags. 

The present paper suggests that it is im- 
portant to consider a model with heteroge- 
neous agents for the international financial 
market. I hope that this paper stimulates the 
research in this direction. 
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A Non-Parametric Analysis of Productivity: The Case of 
U.S. and Japanese Manufacturing 


By JEAN-PAUL CHAVAS AND THOMAS L. Cox* 


A non-parametric analysis of technology, technical change, and productivity is 
presented in the context of cost minimizing behavior. A number of non-parametric 
tests concerning the existence and nature of technical change are applied to U.S. 
and Japanese manufacturing data. Non-parametric measures of technical change 


are presented. (JEL 226) 


The issues of productivity measurement 
and technical change have received consider- 
able attention in the literature (for exaniple, 
Dale W. Jorgenson and Zvi Griliches, 1967; 
Ryuzo Sato, 1970; Hans Peter Binswanger, 
1974; Rodney E. Stevenson, 1980; John R. 
Norsworthy and David H. Malmquist, 1983). 
Using a primal approach, technical progress 
is typically measured in terms of changes in 
output that are not attributable to changes 
in inputs. Alternatively, dual approaches 
measure technical progress in terms of the 
contributions to changes in cost (profit) not 
attributable to changes in input prices and 
output levels (output prices). This suggests a 
need to distinguish between the contribu- 
tions of technical progress and those of re- 
turns to scale (for example, George J. Stigler, 
1961) and input prices (for example, Zvi 

` Griliches, 1958). 

However, in the absence of a priori hy- 
potheses concerning the structure of techni- 
cal change, Peter Diamond et al. (1978) have 
shown the non-identifiability of the elastici- 
ties of substitution and the bias in technical 
change.! This suggests that a set of produc- 


*Respectively Professor and Assistant Professor of 
Agricultural Economics, University of Wisconsin-Madi- 
son, WI 53706. The seniority of authorship is equally 
shared. We wish to thank two anonymous reviewers for 
very helpful comments. Remaining errors are the joint 
responsibility of the authors. 

1Technical change is said to be Hicks neutral if the 
marginal rate of substitution between inputs is not 
affected by the change (for example, see Charles Black- 
orby et al. 1976). Non-neutral technical change is said 
to be biased. For example, biased technical change is 
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tion data can be generated by more than one 
combination of technology and technical 


change and raises serious identification and > - 


measurement issues. Depending on the data 
as well as maintained hypotheses concerning 
the structure of technology, the measure- 
ment of technical change may be exactly 
identified, identified up to a range of indeter- 
minacy, or not identified at all (Diamond 
et al). This identification problem suggests 
that traditional parametric analysis of tech- 
nology and technical change may give results 
that are sensitive to the particular parametric 
specification utilized.? In this context, analy- 
ses of technology change that are less depen- 
dent on the parametric specification of the 
model could be insightful. 

The objective of this paper is to present a 
non-parametric analysis of technology, tech- 
nical change and productivity in the context 
of cost minimizing behavior. The paper first 
extends the non-parametric analysis of pro- 
duction decisions (Sydney N. Afriat, 1972; 
Giora Hanoch and Michael Rothschild, 
1972; Hal R. Varian, 1984) in the context of 
technical change using an augmentation hy- 


labor saving (capital using) if the marginal product of 


capital rises relative to the marginal product of labor, 
ceteris paribus. 

?Note that, in general, the use of flexible functional 
forms (for example. translog or generalized Leontief for 
a production, cost, or profit function) does not help 
solve the identification of technical change problem. 
What is needed to solve the identification problem is 
a priori information about technology or the nature of 
technical change (see Diamond et al.). 
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pothesis, that is, where technical progress is 
assumed to increase the effectiveness of in- 
puts in the production of output (Section I). 
In particular, assuming cost minimization, a 
series of non-parametric tests for the exis- 
tence and nature of technical change are 
derived (Section IT). Empirical implementa- 
tion of these non-parametric tests is dis- 
cussed in Section III. An application of the 
proposed methodology to U.S. and Japanese 
manufacturing data from Norsworthy and 
Malmquist is presented in Section IV. 

This analysis provides useful insights on 
the nature of technology and technical 
change. First, it allows an empirical investi- 
gation of various separability hypotheses 
concerning the production function. Second, 
non-parametric tests of Hicks neutral techni- 
cal change are presented. Third, non-para- 
metric measures of productivity changes 
based on the augmentation hypothesis are 
obtained. Although our approach does not 
solve the identification problem raised by 
Diamond et al, our results do not depend 
on a particular parametric specification of 
production relationships. Such results are 
contrasted with the corresponding para- 
metric results obtained by Norsworthy and 
Malmquist using the same data. Finally, un- 
der the augmentation hypothesis, non-para- 
metric measures of technical change can be 
calculated. The measures obtained for U.S. 
and Japanese manufacturing appear reason- 
able and illustrate the empirical usefulness 
of the proposed methodology. 


I. Non-Parametric Tests of 
Production Decisions 


In order to develop non-parametric tests 
of production, consider a cost-minimizing 
firm facing the following problem: 


(1) Clp, y, A) = min ( ps: fva) 
< g(x, A), x20), 


where x is an (n X1) input vector, p is the 
(n X 1) vector of corresponding prices, y de- 
notes output, and f(y, A) x g(x, A) repre- 
sents technology, 4 being a vector of 
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technology indices. The function g(x, A) is 
assumed to be strictly increasing and con- 
cave in x throughout the paper. Also, we 
assume the existence of a point x = 0 satisfy- 
ing f(y, A) « g(x, A) (M. Slater's 1951 con- 
dition). Equation (1) characterizes the pro- 
duction decisions of a firm that behaves in a 
way consistent with the cost minimization 
hypothesis, C( p, y, A) being the indirect cost 
function. 

Assume that the firm is observed choosing 
x T times, x,,..., Xy, each observation being 
a feasible point (ie., satisfying f(y, 4,) < 
g(x,, A,), x, 2 0) associated with a situation 
t characterized by input prices p,, output y, 
and technology A, t —1,..., T. It is of inter- 
est here to investigate under what conditions 
the decision set Q= (x,,..., xz) is consis- 
tent with cost minimization as stated in (1). 
Throughout the paper, we take cost mini- 
mization as a maintained hypothesis. In this 
context, we present non-parametric tests of 
production decisions. These non-parametric 
tests consist in checking the consistency of 
the actual decisions Q= (x,,..., xp} with 
the cost minimization problem (1) without a 
parametric specification of the -production 
technology. 

A basis for these non-parametric tests is 
presented in the following proposition (see 
the proof in the Appendix). 


PROPOSITION 1: Given a set of feasible 
decisions Q= (xy,...,xp), each x, corre- 
sponding with, a situation (p, A, yj), t= 
1,..., T, then: 


a) If x, solves min( pix: f(y, A,) < 06 A,), 
xz 0), g(x, A) being a strictly increasing 
and concave function of x, then there exist 
À, such that 


Pi 
(2a) FOY A,) E g(x,. A) + x Gs 7x) 
: i t 
>0 
(2b) A,>0 
s,t=1,...,T. 


b) if (2a) and (2b) are satisfied, then there 
exists a function G(x, A) such that x, 
solves min( p/x:f( y, Aj) x GG, Aj), x € 
Q}. 


f ] 
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Equations (2) represent a set of necessary 
and sufficient conditions for the decisions 
Q = (x,..., Xr} to be consistent with the 
optimization problem (1) for some produc- 
tion technology. Testing for consistency con- 
sists in checking whether there exists a so- 
lution to the set of inequalities in (2). 
Although this non-parametric test is not a 
statistical test (with associated probability 
statements), it can provide useful informa- 
tion on the nature of technology. More im- 
portantly, by allowing for technical change 
the above results extend the non-parametric 
analysis of production decisions proposed by 
Hanoch and Rothschild or Varian (1984) 
(see equations (5), (6), and (7) below). 

The nature of technical change assumed in 
(1) is quite general since it does not specify 
a priori how the technology indexes shift the 
production function. However, in order to 
implement the non-parametric test given in 
(2), some knowledge of the nature of techno- 
logical change (more specifically, some 
knowledge of the functions f(y, A) and 
g(x, A)) must be hypothesized. These hy- 
potheses must be precise enough to make 
(2a) empirically tractable, but without re- 
quiring a complete parametric specification 
of technology (otherwise we would be back 
in a more traditional parametric analysis of 
technology). 

In this context, the "augmentation hy- 
pothesis" appears particularly attractive for 
modeling technical change: that is, technical 
progress increases the effectiveness of inputs 
in the production of output. This can be 
expressed by rewriting the function g(x, A) 
as 


g(h(x, A))= gh; (xi A,), ha( Xz, 42); 
h,(x,4,)], 


where A;(x;,4,) can be interpreted as a 
measure of the effectiveness of the ith input, 
i 1,..., n. Defining Y= f(y, Ap), the cost 
minimization problem (1) then becomes 


(3a) C(p.Y. A) 
= min { px: Y s g(h(x, A)), 


x20). 
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We assume here that the A;(x;,A;) are 
strictly increasing functions of x; i=1,..., n. 
Given the augmented input function X, — 
h,{x, A,), denote its inverse function by 
x; = H,(X,, Aj), i=1,...,n. The above cost 
minimization problem can then be alterna- 
tively expressed as 


(35) C(p,Y, A) l 
= min { p'H(X, A): Y< g( X), 
H(X, A) 20), 


where X—(X,..., XY, A-(A,..., Any, 
and H —(H,,..., H,. 

Given this alternative formulation of (1) 
under the augmentation hypothesis, expres- 
sions (2) become 


(4) Y,-Y,*(p/X) 
x[H(X, A) - H(X, A,)] 20 
(4b) d,>0 
s,t=1,...,T, 


which represent necessary and sufficient con- 
ditions for production decisions to be consis- 
tent with cost-minimizing behavior. Equa- 
tions . (4) provide a non-parametric test 
of production decisions in the sense that 
a priori specification of the functional form 
g(X) in the characterization of the produc- 
tion technologv is not required. However, 
checking the existence of a solution to the 
set of inequalities in (4) requires a priori 
information on the functions Y= f(y, Ao) 
and X;=h,(x;, 4j), i—1,...,n. 

Two simple alternative specifications for 
the function f(y, A) and h,(X;, A;) are the 
scaling hypothesis corresponding to the mul- 
tiplicative specification Y= y/Ao, X; = A; 
x; and the translating hypothesis corre- 
sponding to the additive specification Y= 
y — Ag, X; = A; + x; Assuming output trans- 
lating, then Y, — Y, in (4a) becomes Y, — Y, = 


X07 Ao, — Ys + Aos- Similarly, under input 


translating, we have H(X, A) = X,— A,— 
X, À,— A, Substituting these expressions 


into (4a) yields empirically tractable non- 
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parametric tests of production decisions. 
Under the maintained hypotheses of cost 
minimization with additive input. and/or 
output augmenting technical change, this will 
provide a basis for the empirical analysis of 
productivity presented below.’ 

Note that the input augmentation hy- 
potheses (either alone or with output aug- 
mentation) allow for biased technical change 
while output augmentation (without input 
augmentation) generates a Hicks neutral 
specification (where the marginal rate of 
substitution between inputs is unaffected by 
technical change), The next section presents 
non-parametric tests of production decisions 
and their interpretation under the additive 
augmentation hypothesis. 


If. Non-Parametric Analysis of Technology 
and Productivity 


Taking cost minimization as a maintained 


hypothesis, the above results generate .non- . 


parametric tests which can be used to inves- 
tigate the nature of technology and of tech- 
nical change. 


A. Weak Separability Test 


A production function is weakly separable 
in x if it takes the form g[G(x), A,.], where 
G(x) is an aggregator function of x, g is 
strictly monotonic in G, and “.” represents 
other arguments (besides x and A) of the 
production function. In this case, the choice 
of x for a competitive firm must be made 


"Alternatively, the scaling hypothesis corresponding 
to multiplicative input/output augmenting technical 
change could be used. Under the scaling hypothesis 
output scaling implies Y= y/Ag with Y, — Y, = y,/Ao; 
— y, /Ags, while input scaling implies ( X; = A;-x;) with 

H;(X,, Aj) = X/A; = x; A;/A;. Substituting these ex- 
pressions into (4) “yields non-parametric tests of pro- 
duction decisions. However, in the presence of input 
augmentations and scaling, the inequalities (4) are non- 
linear in the technology indices A. The nonlinearities 
make the scaling hypothesis less empirically convenient 
than the translating hypothesis in the analysis of techni- 
cal change (see Section III below). As a result, most of 
the results presented in this paper focus on the translat- 
ing hypothesis. Selected results from the scaling hypoth- 
esis are briefly discussed in footnotes. 
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such that x, is a solution to min{ p/x: f, € 
G(x), x = 0) for some f, which is a special 
case of (1). Then, equations (2) become 


(5) Aft Ypi Xs- 
A,>0 s,t-l,..,T, 


x,) z 0, 


where y,=1/A,. This is the test for weak 
separability presented by Varian (1984, 
p. 588). If the vector x is a subset of inputs, 
then (5) allows non-parametric testing of the 
separability of selected inputs in the produc- 
tion function. 

On the other hand, if the vector x includes 
all of the inputs, then (5) amounts to a test 
of the weak separability of all inputs from 
the technology indices A. But this weak sep- 
arability is equivalent to stating that the 
marginal rate of substitution between any 
two inputs is independent of technical 


‘change, which is the definition of Hicks neu- 


tral technical change in its most general form 
(Blackorby et al.’s extended Hicks neutral- 
ity). Thus, condition (5) allows non-paramet- 
ric testing of Hicks neutral technical change 
without an ad hoc specification of fechas: 
ogy or its change. 


B. Testing for the Existence of 
Technical Change 


In the-absénce of technical change, rewrite 
the cost minimization problem (1) as 
Min( p'x: y x g(x), x z a Then, equations 
(2) take the form 


(6) y 3 3010 7 x) 2:0, 
4670, s,¢=1,...,T, 
` where y, =1/A,. This is Varian's strong ax- 


iom of cost minimization (see Varian, 1984, 
p. 583). While the strong axiom of cost mini- 
mization (6) holds for any concave produc- 
tion function, it takes a more restrictive form 
under the additional assumption of constant 


return to scale (CRTS) technology since the 


marginal cost of output and average cost are 
constant and equal to each other. Noting 
that the multiplier A in (6) is the marginal 
cost of output and average cost is p'x/y, it 
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TABLE 1 —NON-PARAMETRIC TESTS OF COST MINIMIZATION UNDER ADDITIVE 
AUGMENTATION (TRANSLATING) HYPOTHESES 


Translating and Cost Minimization 
— Output Translating (Y = y — 49): 


(8a) Ao, = y+ Ao, + Y Dt x, 
where y,=1/A, 

— Input Translating 5 X= x, +A): 

(8b) A7 Ye) + Y Piel Xis + Ais — X, 
i-1 
— Input and Output Translating: 
n 

(8) ACh 7 dor Ya + Aos) + Y Du + Ais 7 Xu — 


i=l 


UC 


—x,)20,A,>0, 


A4) 20, A, >0 


Ay) 20, A, > 0 


Translating and Cost Minimization Under Constant Return to Scale 








— Output Translating (Y= y — Ag): 
Pi Xs 
OD FE Aer ht Aon = 0 
— Input Translating ( X; = x; + 4,): 
n 
(9b) 2 Pit(Xig + Ais — 


i-1 
—Input and Output Translating: 


PiX 
Y Pil Xs + Aig — Au)— 


i-1 


(9c) 


t 


then follows that y, =1/A,= (y)/( pix). 
Hence, under CRTS (6) becomes 


J 
(7) Mp7 Ys + = -pil x50) z 0. 
Pix 


t7 


This can be alternatively . expressed as 
Y ( PiXs/PiX;) = Ys which is Varian’s non- 
parametric test for constant return to scale 
(Varian, 1984, p. 585). Thus, for a particular 
data set, finding that the inequalities in (7) 
are satisfied would imply the existence of a 
stable linear homogeneous production func- 
tion that rationalizes the data in a cost mini- 
mization framework. If the inequalities in (7) 
are not satisfied then either the production 
function is not linear homogeneous, or the 
production function is not stable, or cost 
minimization is violated (or some combina- 
tion of the above). In the case where linear 
homogeneity and cost minimization are as- 
sumed as maintained hypotheses, violations 
to the inequalities in (7) would provide non- 
parametric evidence that the production 
function is not stable, that is, that technolog- 
ical change has taken place. 


E 


t 


» 
Aj) pix A 20 
t tt y, 


J& 7 Ags + Aor) 20 


C. Testing for the Nature of 
Technical Change 


We have argued above that the weak sepa- 
rability test (5) can be used to investigate 
non-parametrically whether technical change 
is Hicks neutral or biased. However, it is 
often of interest to have more precise results 
on the nature of technical change. We now 
pursue this in the context of the translating 
hypothesis discussed in Section I. The main 
results are summarized in Table 1. 

Under cost minimization and the translat- 
ing hypothesis (where Y — y — A, for output 
translating and X; — A; + x, for input trans- 
lating), inequalities (4) then yield equations 
(8) in Table 1. Under the additional assump- 
tion of constant return to scale, noting that 
the marginal cost of output is A,, it follows 
that A,= p/x,/y, Cost minimization and 
constant return to scale then yield equations 
(9) in Table 1. Under cost minimization and 
the translating hypothesis, the inequalities 
(8) (or (9) under CRTS) therefore provide a 
basis for a non-parametric analysis of pro- 
ductivity and technical change. In particular, 
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note that in the absence of input translating 
(A, = 0, i —1,..., n), then output translating 
(the A's) characterizes Hicks neutral techni- 
cal change. 

A special case of (9c) will be of interest in 
this paper (see below). This involves the case 
of additive separability where the production 
function takes the form y=g(x)+ Ag 
f(Z), where Z= (Zp... Z4) Zi- A;tz, 
and x and z are inputs. This implies that the 
production function is strongly separable in 
x and z, and that technical change takes 
place in an output translating fashion as well 
as an input translating fashion but only with 
' respect to z. Under the translating hypothe- 
sis, this is the form of the production func- 
tion that is required to justify the real value 
added measurement of productivity (see 
Michael Denny and J. Douglas May, 1978). 
Given this form of the production function, 
cost-minimizing behavior implies 


min { w'x:g x g(x), x 20) 
and 
min ( v'z: f < Ag 


*f(z 4 z+ Ag,..-), 220}, 

where w and v are the price vectors for x 
and z, respectively, and y= g+ f. Under 
the assumption of constant return to scale 
(where A, — (w/x, + 0/2,)/y,) then, from (6) 
and (8c), the non-parametric results associ- 
ated with this additive separability case are 


Yı 
, , 
WX, + UZ, 


(10a) 8,— 8, * 


ewe (Xs x) 20 


, , 
W/X, t Uz, 


(10b) (fi Ag fs t Aos) 
A t 
+ Y vy ( Zi, + Ais 7 Zu 44) z0 
fet 
(10c) We Bethe. 
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Under constant return to scale, the inequali- - 
ties (10) generalize Varian’s test of additive 
separability (see Varian, 1984, p. 590). by 
allowing for technical change in the context 
of the translating hypothesis. 


D. Goodness-of-Fit Measures 


Note that each of the non-parametric tests - 
presented above corresponds to a particular 
null hypothesis. If the stated inequalities are 
satisfied for a given set of data, then the 
corresponding null hypothesis would be ac- 
cepted. However, if there is evidence against 
the null hypothesis, that is, if the stated 
inequalities cannot be satisfied, then it would 
be useful to have some measure on the 
strength of the evidence against the null 
hypothesis. Here, we propose to rely on a 
goodness-of-fit measure originally developed 
in the consumer context by Afriat (1967) 
and discussed in Varian (1988). 

To illustrate the approach, consider the 
case of cost minimization under CRTS 
as reflected by the non-parametric test (7). 
Define the goodness-of-fit measures* 
(e*,...,e#) as the solutions of the following 
problem 


(11) cr 


E 





"Pi (xs = €x) 20, 


, 
ove 


If. (1+ prx,/ prx, — Y/Y) 29, $=1,...,T, 
note that (11) implies that ie = Min(1, 
Es DiX s / PiX — Vs/ 5h) If e, i 
1,..., T, in (11), then the inequalities i in (7) 
are , satisfied and the corresponding null hy- 
pothesis would be accepted. Alternatively if 
e* «1 for some t, then the inequalities in (7) 
would not be satisfied. In this case, (1— e*) 
can be interpreted as the smallest propor- 
tional reduction in cost of production that 


“In Afriat’s terminology, e* is called an efficiency 
index. 


456 THE AMERICAN ECONOMIC REVIEW 


would have to take place before the input 
bundle x, would be consistent with the non- 
parametric test (7). In other words, (1— e*) 
can be interpreted as the relative difference 
(measured in terms of relative cost of pro- 
duction) between the data point ¢ and the 
non-parametric hypothesis (7). Thus, e* in 
(11) can provide useful information by mea- 
suring how close the observed choices are 
from satisfying the null hypothesis, that is, 
by measuring the strength of rejection of this 
hypothesis. While these arguments were pre- 
sented in the context of expression (7), simi- 
lar results can be developed in the context of 
(5), (6), (8), (9), and (10). As will be illus- 
trated in Section IV, this can provide useful 
goodness-of-fit measures associated with 
non-parametric tests. 


IIl. Empirical Implementation 


From Proposition 1, the inequalities pre- 
sented in equation (5) through (10) are nec- 
essary and sufficient for the existence of a 
production function that would rationalize a 
particular set of production data under alter- 
native hypotheses. Non-parametric testing 
thus involves checking the existence of a 
solution to these inequalities. 

The empirical implementation of (7) is 
straightforward as the inequalities in (7) in- 
volve only observable variables, p, x, y. In 
this case it is a simple matter to check 
whether the inequalities in (7) are satisfied 
for all observations. However, the inequali- 
ties (5), (6), (8), (9), and (10), involve vari- 
ables that are not directly observable (for 
example, y, and f, in (5); Ag, and/or A; in 
(9); Aon Aj, Ze and f, in (10)). In these 
cases, the non-parametric tests consist in 
finding whether there exists a set of values 
taken by the unobserved variables which 
would satisfy the corresponding inequalities: 
Since the inequalities (5), (6), (8), (9), and 
(10) are linear in the unobserved variables, 
checking the existence of a solution to these 
inequalities can be conveniently formulated 
as a linear programming problem, as dis- 
cussed below.? 


ÓAs mentioned in fn. 3, hypotheses involving input 
sealing vield non-narametric test restrictions which are 
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Considering the inequalities (5), (6), (8), 
(9), or (10), denote by q the vector of unob- 
served variables in these expressions. Since q 
shows up in linear form in these inequalities, 
they can be written as B'q > c, given appro- 
priate definitions of the matrix B and the 
vector c. Then consider the linear program- 
ming problem 


(12) Min{b’q; B'qx c, qe€C), 
q 


where the vector b and the feasible set C, a 
cone, are appropriately defined so that prob- 
lem (12) is necessarily bounded. It follows 
that either problem (12) has a solution, or if 
it does not, it must be infeasible. In other 
words, the inequalities B’g >c have a solu- 
tion if and only. if problem (12) has a feasi- 
ble solution. In this context, checking the 
existence of a solution to the non-parametric 
inequalities can be done by checking the 
existence of a feasible solution to the linear 
programming problem (12) (for example, us- 
ing the simplex method). By choosing appro- 
priate values for the b's (the coefficients of 
the objective function in (12)), this can yield 
useful information concerning the rates of 
input and/or output augmentations that are 
consistent with the data (see below). 

Note that, even for a moderate number of 
observations T, the number of constraints in 
the linear programming (12) will typically 
exceed the number of activities. In this case, 
it will be computationally convenient to con- 
sider the linear programming problem, dual 
to (12) 


(13) Max(c'g:b - Bqe C*, 420), 
ğ 


where C* is the polar cone of C. It is well 
known that (13) has an optimal solution if 


nonlinear in the unobserved variables. Note that, say 


for T>10, the number of these inequalities can be 
large. Since solving a large system of nonlinear inequali- 
ties can be computationally quite difficult, this suggests 
that the non-parametric testing of input scaling will in 
general be a difficult task. In contrast, the translating 
augmentation hypotheses discussed here yield non-para- 
metric tests involving /inear inequalities which are par- 
ticularly convenient for non-parametric empirical work. 
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and only if (12) has an optimal solution (for 
example, David G. Luenberger, 1984; V. A. 
Sposito). Alternatively, if problem (12) is 
infeasible, then (13) is either unbounded or 
infeasible. 

Here, whenever the unobserved variables 
show up in linear form in the non-paramet- 
ric inequalities, we propose to solve the dual 
formulation (13) (using the simplex method). 
If (13) has an optimal solution for particular 
production data, then the data are consistent 
with the associated hypothesis. Alternatively, 
if (13) is either infeasible or unbounded, 
then we would conclude the corresponding 
inequalities cannot be satisfied and that the 
data are not consistent with the associated 
hypothesis. In this case, it will be of interest 
to obtain the goodness-of-fit measures such 
as the ones proposed in (11). Again, this can 
be done in the context of a standard linear 
programming problem. 

To illustrate, consider for example the case 
of cost minimization under input and output 
translating (8c). Under CRTS (where A,= 
P:X,/Y,), (8c) generalizes into (9c) with 
goodness-of-fit measures e,, as 





pix 
(14a) : * (y 7 Ag Ys + Aos) 
Y: 
+ > fiul Xis T Ai, — OX T Ai) = 0, 
i=l 
(14b) 0xe,x1. 


Let q be the vector of unobserved variables 
in (14) (i.e., the Aor s, the A;,’s and the e,’s). 
Then the set of linear inequalities in (14) can 
be written as B'4 7 c, and the solutions to 
these linear inequalities can be investigated 
in the context of the linear programming 
problem (12) (or (13)) given appropriate 
choices for the vector b. Here, we chose the 
b’s associated with the A,’s (the input trans- 
lating factors) equal to M?, the b's associ- 
ated with the A,’s (the output translating 
factors) equal to M, and the b's associated 
with the e's (the goodness-of-fit measures) 
equal to — M", where M is a large positive 
scalar. In doing so, the linear programming 
solution to (13) gives the largest goodness- 


of-fit measures e* <1 that are consistent 
with the data Note that in the race where 
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ef —1, t —1,..., T, the (14) becomes equiva- 
lent to the non-parametric test (9c). In this 
case, given this choice of b’s, the solution to 
(13) gives the smallest A,’s, that is, the 
smallest biases in technical change, that are 
consistent with the data. Given the A,’s, the 
solution to (13) also gives the smallest A,’s, 
that is, the smallest productivity changes, 
that are consistent with the data. Thus the 
non-parametric approach can provide useful 
information in the analysis of productivity. 
Finally, not that while this example has fo- 
cused on the empirical implementation of 
the non-parametric test (9c), the approach 
can be easily modified to handle as well the 
non-parametric tests discussed above. 


IV. Application to U.S. and Japanese 
Manufacturing 


In this section, we present an application 
of our non-parametric methodology to the 
analysis of productivity in U.S. and Japanese 
manufacturing. We apply selected results 
from Section II using the procedures de- 
scribed in Section III. The data for U.S. and 
Japanese manufacturing are from Norswor- 
thy and Malmquist and include four inputs: 
capital labor, energy, and material. These 
data cover the periods 1958—77 for the 
United States and 1965-78 for Japan. Im- 
plicit quantity indexes were created for out- 
put and inputs by dividing current dollar 
(U.S.) or yen (Japan) values by the associ- 
ated price indexes. These indexes were then 
adjusted so that the quantity indexes are 
equal to 1.00 in the base year (1967 for the 
United States and 1972 for Japan). Note 
that these price and implicit quantity in- 
dexes satisfy Fisher's weak factor reversal 
test (Ralph C. Allen and W. Erwin Diewert, 
1981). 

We evaluate data consistency with various 
non-parametric hypotheses over the 1958-77 
period for the U.S. and 1965-78 period for 
Japan. Given that these data exhibit zero 
profit, all of the analyses presented below, 
except for the separability tests via (5), as- 
sume constant return to scale (CRTS) as a 
maintained hypothesis (see Norsworthy and 
Malmquist). In the cases where a non-para- 


metric hypothesis is found to be inconsistent 
with the data the teste are recamnnted after 
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TABLE 2— RESULTS OF NON-PARAMETRIC TESTS FOR THE WEAK SEPARABILITY 
OF INPUTS VIA EQUATION (5): U.S. (1958-77) AND ! 
JAPANESE (1965-78) MANUFACTURING DATA 


Specification/Test 


Capital and Labor: f(K, L) 
Materials and Energy: f(M, E) 
Capital, Materials, and 

Energy: f(K, M, E) 
Capital, Labor, Materials, 

and Energy: f(K, L, M, E) 

(Technical Separability or 

Extended Hicks Neutrality) 


Source: Computations by the authors. 


US. Japan 
Accept Accept 
Accept Accept 
Accept Accept 
Accept Accept 


Note: Accept indicates the existence of a feasible solution to the dual LP system; hence, 
acceptance of the non-parametric hypothesis test. 


including the goodness-of-fit measures as 
discussed above. 

Table 2 summarizes the results of the vari- 
ous inputs weak separability tests. First, we 
non-parametrically evaluate the weak sepa- 
rability of capital and labor (K, L), materi- 
als and energy ( M, E), and capital, materi- 
als, and energy (K, M, E) via equation (5). 
We then evaluate the hypothesis of general- 
ized Hicks neutrality as the weak separabil- 
ity of all inputs (K, L, M, E) from output 
and the technology indexes (A) via equation 
(5). All input aggregates and the extended 
Hicks neutrality hypotheses are found to be 
data consistent over the periods analyzed for 
both the United States and Japanese manu- 
facturing data. It is insightful to contrast 
these results with parametric results of Nor- 
sworthy. and Malmquist obtained with the 
same data in the context of translog produc- 
tion and unit cost functions. Our non-para- 
metric results indicate the existence of a 
production function with extended Hicks 
neutral technical change that is consistent 
with the United States as well as Japa- 
nese data. In contrast, Norsworthy and 
Malmquist reject Hicks neutral technical 
change in both their estimated translog pro- 
duction and cost function for the Japanese 
data. These differences in results likely re- 
flect the Diamond et al. identification issues. 


For U.S. data, Norsworthy and Malmquist (p. 953) 
also reject Hicks neutrality using a cost function ap- 
proach, although they accept the hypothesis of Hicks 
neutral technical change based on a production func- 


tian annrnach 


Norsworthy and Malmquist similarly test 
for the weak separability of capital and labor 
using a translog production function as well 
as a unit cost function under alternative 
maintained hypotheses.” They reject this 
weak separability for both U.S. and Japa- 
nese production functions under all main- 
tained hypotheses. They also obtain similar 
results based on a cost function analysis of 
Japanese manufacturing (see Norsworthy ` 
and Malmquist, p. 953). 

For U.S. as well as Japanese manufactur- 
ing, our non-parametric results indicate the 
existence of a production function which 
rationalizes these data and is weakly separa- 
ble in (K, L) (as well as (M, E), (K, M, E), 
and (K, L, M, E)). These results suggest that 
Norsworthy and Malmquist's rejections of 
the weak separability of (K, L) using these 
same data are sensitive to the parametric 
specification of their model. 

Tables 3, 4, and 5 summarize the results of 
the non-parametric tests for alternative spec- 
ifications of technical change under the 
maintained hypotheses of cost minimization 
and CRTS.? As indicated in Table 3, the 


"Their separability test is Michael Denny anc Melvyn 
Fuss's (1978) local separability test (rather than the 
more restrictive weak global separability of Ernst Berndt 
and Laurits R. Christensen, 1973). 

The goodness-of-fit measures e* are reported in 
Table 4 only for those hypotheses which were found to 
be inconsistent with the data (i.e., had unbounded solu- 
tions to the dual linear programming formulaticn of the 
non-parametric test). In the case where we found data 
consistency, the associated goodness-of-fit measures are 
simnly &e* —1 for all vearc 
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TABLE 3——RESULTS OF NON-PARAMETRIC TESTS FOR ALTERNATIVE SPECIFICATIONS 
OF TECHNICAL CHANGE UNDER COST MINIMIZATION AND CONSTANT RETURN 
TO SCALE (CRTS): U.S. (1958-77) AND JAPANESE (1965-78) 

MANUFACTURING DATA i 


Specification/Test U.S. Japan 


Strong Axiom of Cost Min 
with CRTS: Reject Reject 
(No Technical Change) 

CRTS Cost Min: Reject Reject 
Output Translating 
(Hicks Neutrality) 

CRTS Cost Min: Accept Accept 
Input Translating 

CRTS Cost Min: Accept Accept 
Output & Input Translating ` 

CRTS Cost Min: é Reject Reject 
Output & Input Translating 


& Additivity 


Source: Computations by the authors. 

Note: Accept indicates the existence of a feasible solution to the dual LP system; hence, 
acceptance of the non-parametric hypothesis test. Reject indicates an unbounded 
solution to the dual LP problem, hence non-parametric rejection of the hypothesis. 


TABLE 4— NON-PARAMETRIC GOODNESS-OF-FIT MEASURES FOR ALTERNATIVE TECHNICAL CHANGE SPECIFICATIONS 
UNDER CRTS Cost MINIMIZATION AND/OR ADDITIVE AUGMENTATION: NORSWORTHY 
AND MALMQUIST U.S. AND JAPANESE MANUFACTURING DATA 


US. Japan 
Strong Additivity Strong ` Additivity 
Axiom Input Axiom Input 
of and of and 

: Cost Output Output Cost Output Output 
Year Min Augments Augments Min Augments Augments 
1958 0.6490 1.0000 0.9931 - - - 
1959 0.7324 1.0000 0.9976 - - - 
1960 0.7399 1.0000 0.9978 - f - - 
1961 0.7375 ` 1.0000 0.9960 - - - 
1962 0.7996 1.0000 0.9980 - - - 
1963 0.8539 1.0000 0.9998 - - - 
1964 0.8971 1.0000 1.0000 - - - 
1965 0.9268 1.0000 0.9997 0.5490 1.0000 0.9593 
1966 0.9385 0.9981 0.9983 0.6561 1.0000 0.9833 
1967 0.9203 0.9981 0.9989 . 0.8129 0.9999 1.0000 
1968 0.9389 1.0000 0.9991 0.9233 1.0000 1.0003 
1969 0.9365 1.0000 0.9984 0.9794 1.0000 0.9994 
1970 0.9086 10000 - 0.9990 0.9867 1.0000 0.9984 
1971 0.9335 1.0000 0.9994 0.9606 1.0000 0.9995 
1972 0.9729 0.9998 0.9964 0.9880 1.0000 0.9993 
1973 0.9860 0.9965  - 0.9928 0.9036 1.0000 0.9916 
1974 0.9378 1.0000 0.9974 0.8501 1.0000 0.9939 
1975 0.9380 1.0000 1.0000 0.9482 1.0000 1.0007 
1976 0.9822 1.0000 1.0000 1.0000 1.0000 0.9985 
1977 1.0000 1.0000 1.0000 0.9628 1.0000 0.9979 
1978 - - - 0.9752 1.0000 0.9898 





Source: Computations by the authors using SAS's LP procedure. 
Note: These goodness-of-fit measures are reported only for those hypotheses which were found to be inconsistent with 
these data (i.e., had unbounded solutions to the dual LP formulation of the associated non-parametric inequalities). 
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strong axiom of cost minimization without 
technical change via (7) is rejected for both 
U.S. and Japanese manufacturing. The asso- 
ciated goodness-of-fit measures from Table 4 
(Strong Axiom of Cost Min.) indicate that 
there is rather strong evidence against the 
null hypothesis of no technical change. These 
results suggest that technical change has sig- 
nificantly affected U.S. and Japanese manu- 
facturing production technology over the pe- 
riods analyzed. 

Under the maintained hypotheses of CRTS 
cost minimization, the nature of the techni- 
cal change is further analyzed in the context 
of the translating hypotheses discussed in 
Section II. Under the Hicks neutral output 
translating hypothesis (additive output aug- 
mentation, equation (9a)), consistency is also 
rejected for both U.S. and Japanese manu- 
facturing for the periods analyzed.’ The as- 
sociated goodness-of-fit measures from Table 
4, however, suggest that this nonparametric 
rejection is not strong as only 4 years of the 
United States (1966, 1967, 1972, 1973) and 
one year of the Japanese (1967) data re- 
quired adjustment to achieve data consis- 
tency. Given that the magnitude of the good- 
ness-of-fit measures are all very close to 1 for 
the U.S. as well as Japanese data (see Table 
4), the evidence against the null hypothesis 
appears to be quite weak. For example, mea- 
surement errors in the data used for the 
analysis may be sufficient to explain these 
goodness-of-fit results. Thus, our results are 
interpreted here as additional evidence sup- 
porting the hypothesis of Hicks neutral tech- 
nical change. 

The input translating hypothesis (additive 
input augmentations, equation (9b)), is found 
to be consistent with the data for both coun- 
tries. This suggests evidence that biased 
technical change could also rationalize these 
data. Given this result, it is therefore not 


surprising that non-parametric evaluation of : 


joint input and output translating hypothe- 


?Under output scaling (where Y= y/Ag) and con- 
stant return to scale hypotheses, the corresponding 
non-parametric test was also conducted. Data consis- 


tency for both U.S. and Japanese manufacturing was. 


also rejected under these hypotheses. 
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ses via (9c) is consistent with these data for 
both countries (see Table 3). 

As discussed in Section III, given an ap- 
propriate choice of b in (12), the linear 
programming results associated with the joint 
input and output translating hypotheses yield 
measures of the minimum output augmenta- 
tions (Ap) in equation (9c). These measures 
are summarized in Table 5.!! On average, 
the output augmentations increase faster for 
Japanese than U.S. manufacturing. This is 
consistent with the productivity growth re- 
ported by Norsworthy and Malmquist. The 
Ao, for the United States reveal an upward 
trend from 1958-73 with a major downward 
year-to-year change in 1970 and several 
smaller downward year-to-year changes (i.e., 
1961, 1963, and 1967). The Japanese Ao, 


reveal an upward trend from 1965-1972 with 


a downward year to year change in 1971. 
Both the U.S. and Japanese Ap, reveal a 
major decrease in output augmentation 
around the 1973 oil price. shock. The 
Japanese Ay, show a major decrease starting 
in 1973 and 1974. In contrast, the U.S. Ao, 
show a major decrease for one year in 1973. 
These results may reflect differences in the 
degree of oil import dependence between the 
United States and Japan where heavier im- 
port dependence is reflected by larger and 
longer adjustments in output augmentation. 
Also note that the Japanese Ao, reflect an- 
other year-to-year decrease in 1977 which 
corresponds to the second oil price shock of 
the 1970's. The relative magnitudes and du- 
ration of the year-to-year decrease, however, 
is much smaller than in 1973. 


The results presented below correspond to M= 
1000. The findings were found to be insensitive to 
choosing a larger value for M. 

Note that our formulation allows for “regressive” 
technical change in the sense that Ag, < Ag,.1 is per- 
mitted (see Table 5). The period following the oil price , 
Shocks of the.1970s demonstrates this possibility quite 
well. Similar patterns of technical change for U.S. and 
Japanese manufacturing 1965—74 are reported by Dale 
W. Jorgenson and Mieko.Nishimizu (1978). This sug- 
gests that large changes in relative prices may adversely 


- affect technological change reflecting the adjustments 
‘associated with induced shifts to alternative production 


processes better adapted to the new economic context. 
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TABLE 5—ESTIMATED MINIMUM RATES OF OUTPUT AUGMENTATION FROM ADDITIVE 
OUTPUT AND INPUT AUGMENTATION SPECIFICATION OF TECHNICAL CHANGE 
UNDER COST MINIMIZATION AND CRTS: NORSWORTHY AND MALMQUIST, 

' U.S. AND JAPANESE MANUFACTURING DATA 








U.S. 
Output 
Augments 
Year (Aor) 
1958 0.000 
1959 0.042 
1960 0.047 
1961 . 0.039 
1962 0.071 
1963 0.066 
1964 0.105 
1965 ` 0.176 
1966 0.191 
1967 0.179 
1968 0.199 
1969 0.198 
1970 0.120 
1971 . 0.198 
1972 0.244 
1973 0.262 
1974 0.210 
1975 0.218 
1976 ` 0251 
` 1977 i ` 0.263 


1978 - 


Source: Computations by the authors. 





Japan 
Chg in ` Output Chg in 
Output Augments Output 
Augments (Aor) Augments 
0.042 
0.005 
— 0.008 
0.032 
— 0.005 
0.040 
0.071 0.000 E 
0.015 0.020 0.020 
—0.013 0.063 0.043 
0.020 0.110 0.046 
—0.001 0.147 0.037 
—0.078 0.157 0.010 
0.078 0.133 —0.024 
0.045 0.171 0.038 
0.019 0.060 -0J11 
— 0.053 0.053 — 0.007 
0.008 0.145 0.092 
0.033 0.184 0.039 
0.012 0.152 — 0.033 
- 0.162 0.010 


Note: These output augmentation values are consistent with equations (9c) under the 


input and output translating hypothesis. 


Note that the U.S. Ao, indicate a strong 
rebound in output augmentation in 1976, 
that 1s, after the 1974—75 U.S. recession. Not 
until 1977, however, did output augmenta- 
tion surpass the 1973 (prior highest) levels. 
In contrast, the Japanese Ay, indicate a 
strong rebound in output augmentation in 
1975. By 1976, output augmentation sur- 
passed the 1972 (prior highest) levels. These 
results appear reasonable and illustrate the 
usefulness of the non-parametric approach 
to the measurement of productivity change. 

Last, we non-parametrically: evaluate the 
hypothesis of strong (additive) separability 
of capital and labor in the context of trans- 
lating. More specifically, we test for the exis- 
tence of a production function y= g(x)+ 
Ag+ f(Z) via equations (10), where x de- 
notes material and energy while Z denotes 
(translated) capital and labor. Note that this 


specification allows additive output (or real 
value added) augmentation (via Ay) and bi- 
ased technical change in capital and labor 
where Z, = z; + A; A, being the correspond- 
ing additive input augmentation. This 
specification would have to hold for the 
widespread double-deflation method of cal- 
culating (value-added) GNP to be valid 
(Denny and May). 

Table 3 indicates that these data are not 
consistent with this hypothesis over the time 
periods analyzed for both countries. This 
is consistent with the Norsworthy and 
Malmquist translog findings that a value 
added aggregate requiring additive separabil- 
ity. of (K, L) is inappropriate for either the 
United States or Japan over the periods ana- 
lyzed. The associated goodness-of-fit mea- 
sures in Table 4, however, suggest that the 
evidence against data consistency is rather 
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weak. That is, despite the numerous years in 
both series that have goodness-of-fit mea- 
sures lower than one, the lowest measure is 
found to be 0.9931 for the United States 
(1958) and 0.9593 for Japan (1965). These 
small discrepancies could possibly be ex- 
plained by data measurement errors alone. If 


so, Norsworthy and Malmquist's evidence - 


against the value added measure of produc- 
tivity would be sensitive to parametric speet: 
fication. 


V. Concluding Remarks 


This paper presents a non-parametric 
analysis of technology and technical change 
in the context of cost minimizing behavior. 
The non-parametric results of Hanoch and 
Rothschild, and Varian (1984) are extended 
to incorporate output augmenting (Hicks 
neutral) and input augmenting (biased) tech- 
nical change. In addition, goodness-of-fit 
measures following Afriat (1967) and Varian 
(1988) are incorporated to allow further 
evaluation of the evidence against null hy- 
potheses in non-parametric tests. 

The non-parametric approach allows the 
testing of several hypotheses concerning the 
existence and nature of technical change 
which are less dependent on parametric spec- 
ification of production relationships. For ex- 
ample, while Norsworthy and Malmquist 
found evidence against the hypothesis of 
Hicks neutral technical change, our non- 
parametric results indicate the existence of a 
production function exhibiting Hicks neutral 
technical change that is consistent with both 
the U.S. and Japanese data. While we as- 
sume cost minimization (i.e., the separability 
tests of Table 2) as well as constant returns 
to scale and additive output augmentation 
(i.e., the output and input translating results 
of Tables 3 and 4) as maintained hypotheses, 
our results do not require further parametric 
specification of the cost or production func- 
tion. This suggests that Norsworthy and 
Malmquist's analysis of Hicks neutral tech- 
nical change is sensitive to their parametric 
specification. | 

Alternatively, under CRTS cost minimiza- 
tion and additive augmentation (translating) 
hypotheses, our non-parametric results found 
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only weak evidence against the additive sep- 
arability of capital and labor from energy 
and material in U.S. and Japanese data. This 
additive separability is required to justify a 
value-added measurement of productiv- 
ity (Denny and May). Norsworthy and 
Malmquist found parametric evidence 
against this additive separability and argued 
that productivity measurements should be 
made based on gross output measures (rather 
than on value added)..Again, our results 
suggest that Norsworthy and Malmquist’s 
results may be sensitive to their parametric 
specification. 

Given the identification and measure- 
ment difficulties confronting parametric ap- 
proaches to technical change analysis 
(Diamond et al.), the proposed extensions of 
the non-parametric approach to include 
technical change hypotheses provide a pow- 
erful heuristic to complement the more tra- 
ditional parametric approaches. Empirical 
implementation of these extended non-para- 


. metric results can be conveniently performed 


using standard linear programming algo- 
rithms when the resulting inequalities are 
linear. By illustrating the usefulness of the 
approach, it is hoped that this paper will 
stimulate additional research on the non- 
parametric analysis of economic decisions. 


APPENDIX 


Proof of Proposition 1: 
Consider the saddle-point problem x*z 0, A*20 
such that 


(Al) L(x,A*, p, A, y) 


= L(x*,A*, p,A, y) 
zL(x*,A,p,A, y), Xx x0, A20, 


where L(x, A, p, A, y) = p'x + ALfCy, A)— g(x, 4). 

If g(x, A) is strictly increasing and concave in x, 
then the saddle-point criterion (A1) is a necessary and 
sufficient condition for x* to be a global solution to the 
optimization problem Min( p’x: f(y, A) < g(x, A), x x 
0} (see V. A. Sposito, 1975; Samuel Karlin, 1957). 

Denote the saddle-point of L(x,A, p,, Ay, 9) in 
(A1) by A, — A*( pj, Ay, y) and x, = x*( p, Ay, x). The 
saddle-point characterization (Al) implies that A,- 
[Or A5) — gC, 4,)] — 0, and 


L(x,, M, Po År ») zL(x,M, Po A, z) 


P t 
Bpx. 
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This yields 
Pixs + he LL At) — 2605: A = P%, M20, 


Given that g(x, A) is strictly increasing in x, it 
follows that A, > 0, f(y, 4,) = go A) and 


(2) foe 4)- g(x 4) ts =x) 


20, A,>9. 


This proves (a). 
To prove (b), define the function 


(A3) G(x,, A,) = 8(%5,45) + Mia [ fOr) 


Pi 
= g(x 4) +t æT x) 


For the case where s=1f note that (A2) implies that 
Ae[f n 4) 8(% AN z 0. If x, is feasible, this 
yields Ap C/O 4.) 8 A0] 20, or fO A) = 
g(x,, A) given À, 7 0. Also, in this context, we have 
G(x,, A) 2 fO A, ) from (A3) (by definition) while 
G(x,, As) 2 8(X,, A,) from (A2). Given fO As) = 
g(x,, A,), it follows that G(x,, A,) = g(x,, A,), that is, 
G(x,, As) gives a representation of g(x,,A,) for all 
data points s =1,..., T. 

From (A2) and (A3) and given Q= {x,, x;,..., xr], 
we have: 


Min ( pix: f(x. A) SG(x, As) xEQ} 


> min{ pix: / Qu 4) € g(x, A,) 
+ fs A) 7 865 Ay) 
ld 
EE Qe), xea} 
Às 
= min ( pix: pix, S pix, x EQ} = pix, 
x 


It follows that x, solves Min{ p/x: f(y,, As) < G(x, A,), 
xe&). 
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Herd Behavior and Investment 


By Davin S. SCHARFSTEIN AND JEREMY C. STEIN* 


This paper examines some of the forces that can lead to herd behavior in 
investment. Under certain circumstances, managers simply mimic the investment 
decisions of other managers, ignoring substantive private information. Although 
this behavior is inefficient from a social standpoint, it can be rational from the 
perspective of managers who are concerned about their reputations in the labor 
market. We discuss applications of the model to corporate investment, the stock 
market, and decision making within firms. (JEL 026, 522) 


A basic tenet of classical economic theory 
is that investment decisions reflect agents’ 
rationally formed expectations; decisions are 
made using all available information in an 
efficient manner. A contrasting view is that 
investment is also driven by group psychol- 
ogy, which weakens the link between infor- 
mation and market outcomes. In The Gen- 
eral Theory, John Maynard Keynes (1936, 
pp. 157-58) expresses skepticism about the 
ability and inclination of “long-term in- 
vestors” to buck market trends and ensure 
efficient investment. In his view, investors 
may be reluctant to act according to their 
own information and beliefs, fearing that 
their contrarian behavior will damage their 
reputations as sensible decision makers: 


...it is the long-term investor, he who 
most promotes the public interest, who 
will in practice come in for most criti- 
cism, wherever investment funds are 
managed by committees or boards or 
banks. For it is in the essence of his 
behavior that he should be eccentric, 
unconventional, and rash in the eyes of 


*Sloan School of Management, Massachusetts Insti- 
tute of Technology, Cambridge, MA 02139 and Har- 
vard Business School, Boston, MA 02163. We are grate- 
ful for helpful comments from Robert Gertner, Bob 
Gibbons, Bengt Holmstrom, Chi-Fu Huang, Robert 
Merton, Jim Poterba, Julio Rotemberg, Garth Saloner, 
Andrei Shleifer, Larry Summers, Robert Vishny, Mark 
Wolfson, and two anonymous referees, and participants 
in seminars at the NBER Summer Institute, Harvard, 
Rochester, Columbia, and Northwestern. 
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average opinion. If he is successful, 
that will only confirm the general belief 
in his rashness; and if in the short-run 
he is unsuccessful, which is very like- 
ly, he will not receive much mercy. 
Worldly wisdom teaches that it is bet- 
ter for reputation to fail conventionally 
than to succeed unconventionally. 


Thus Keynes suggests that professional man- 
agers will “follow the herd” if they are con- 
cerned about how others will assess their 
ability to make sound judgments. There are 
a number of settings in which this kind of 
herd behavior might have important implica- 
tions. One example is the stock market, for 
which the following explanation of the pre- 
October 1987 bull market is often repeated: 
The consensus among professional money 
managers was that price levels were too high 
—the market was, in their opinion, more 
likely to go down rather than up. However, 
few money managers were eager to sell their 
equity holdings. If the market did continue 
to go up, they were afraid of being perceived 
as lone fools for missing out on the ride. On 
the other hand, in the more likely event of a 
market decline, there would be comfort in 
numbers—how bad could they look if every- 
body else had suffered the same fate? 

The same principle can apply to corporate 
investment, when a number of companies 
are investing in similar assets. In Selling 
Money, Samuel Gwynne (1986, p. 58) docu- 
ments problems of herd behavior in banks' 
lending policies toward LDCs. Discussing 
the incentives facing a credit analyst, he 
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writes: 


Part of Herrick's job—an extremely 
important part as far as the bank was 
concerned— was to retrieve informa- 
tion about the countries in which the 
bank did business. But this function 
collided head on with what Herrick 
was actually doing out there... His job 
would never be measured by how cor- 
rect his country risk analysis was. At 
the very least, Herrick was simply do- 
ing what hundreds of other larger in- 
ternational banks had already done, 
and any ultimate blame for poor fore- 
casting would be shared by tens of 
thousands of bankers around the 
world; this was one of the curious ben- 
efits of following the herd. 


The aim of this paper is to develop a clear- 


er understanding of some of the forces that . 


can lead to herd behavior. We find that, 
under certain circumstances, managers sim- 
ply mimic the investment decisions of other 
managers, ignoring substantive private infor- 
mation. Although this behavior is inefficient 
from a social standpoint, it can be rational 
from the perspective of managers who are 
concerned about their reputations in the la- 
bor market. 

Our model is a “learning” model, similar 
in spirit to one studied by Bengt Holmstrom 
(19822). Like us, he considers a situation in 
which managers use investment decisions to 
manipulate the labor market's inferences re- 
garding their ability, where ability represents 
an aptitude for making decisions. This defi- 
nition of ability contrasts with one where 
ability adds to physical productivity, as in 
Holmstrom and Joan Ricart i Costa (1986) 
as well as in another part of Holmstrom 
(1982a). The key difference between our 
model and these others is that ours is most 
interesting when there is more than one 
manager, whereas theirs are single-manager 
models.! i 


lIn Holmstrom's (1982a) example where talent is 
related to the ability to make good decisions, there are 
inefficiencies with only one manager. This follows from 
his assumption that the outcome of a potential invest- 
ment project is unobservable when the investment is not 
undertaken. Our model differs in that the state of the 
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We assume that there are two types of 
managers: "smart" ones, who receive infor- 
mative signals about the value of an invest- 
ment, and “dumb” ones, who receive purely 
noisy signals. Initially, neither the managers 
themselves nor the labor market can identify 
the types. However, after the managers have 
made an investment decision, the labor mar- 
ket can update its beliefs, based on two 
pieces of evidence: 1) whether the manager 
made a profitáble investment; 2) whether the 
manager's behavior was similar to or differ- 
ent from that of other managers. - 

If there are systematically unpredictable 
components cf investment value, the first 
piece of evidence will not be used exclu- 
sively, since on any given draw, all smart 
managers could get unlucky and receive mis- 
leading signals. Hence the second piece of 
evidence is important as well. Holding the 
absolute profitability of the investment 
choice fixed, managers will be more favor- 
ably evaluated if they follow the decisions of 
others than if they behave in a contrarian 
fashion. Thus an unprofitable decision is 
not as bad for reputation when others make 
the same mistake—they can share the 
blame if there are systematically unpre- 
dictable shocks. 

This “sharing-the-blame” effect arises be- 
cause smart managers tend to receive corre- 
lated signals (since they are all observing a 
piece of the same “ truth"), while dumb ones 
do not (they simply observe uncorrelated 
noise). Consequently, if one manager mimics 
the behavior of others, this suggests to the 
labor market that he has received a signal 
that is correlated with theirs, and is more 
likely to be smart. In contrast, a manager 
who takes a contrarian position is perceived 
as more likely to be dumb, all else being 
equal. Thus even if a manager's private in- | 
formation tells him that an investment has a 
negative expected value, he may pursue it if 
others before him have. Conversely, he may 
refuse investments that he perceives as hav- 
ing positive expected value if others before 
him have also done so. 


world that determines investment profitability is always 


observable. In the case that we study, there are thus no 
inefficiencies in a single-manager setting. 
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These points are further developed in the 
five sections following this one. In Section I, 
we present the structure and assumptions of 
the model. The basic results on the existence 
of herding equilibria are summarized in three 
propositions in Section II. Section III looks 
more closely at some countervailing forces 
that may offset herding tendencies. In Sec- 
tion IV, we elaborate on the implications of 
the model for corporate investment, the stock 
market, and decision making within firms. 
Finally, Section V contains concluding re- 
marks. 


I. The Model 


The model that is developed in this sec- 
tion applies more literally to the example of 
corporate investment discussed above than it 
does to the stock market. We assume that 
the investments under consideration are 
available in perfectly elastic supply at a given 
price. This allows us to avoid explicitly con- 
sidering the feedback from investment de- 
mand to prices, thereby simplifying the anal- 
ysis considerably. In Section IV, we will 
discuss at greater length how we think our 
results carry over to the stock market, where 
this assumption is clearly not appropriate. 
For the moment, however, it may help for 
the reader interested in concreteness to bear 
in mind the following story: In our model, 
managers are in charge of capital investment 
at industrial firms, and they each are consid- 
ering investing in a cost-saving technology. 
The value of the technology will be realized 
in the future, and each manager receives a 
signal that gives him some information about 
the future state; The question we address is 
the following: How well does the aggregate 
level of investment reflect all of the available 
information? 


A. Timing and Information Structure 


The economy consists of two firms, firm A 
and firm B, run by managers we call A and 
B, respectively. These managers invest se- 
quentially, with A moving first. At date 1, A 
decides whether or not to make the invest- 
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ment. There are two possible outcomes at . 
date 3: either the “high” state, in which case 
the investment yields a profit (net of invest- 
ment expense and discounting) of xy > 0; or 
the “low” state, in which case the net profit 
is x, <0. The prior probabilities of these 
two states are a and (l— a), respectively. 
The outcome is publicly observable, even if 
neither manager decides to invest. 

In making his decision, A has access to a 
signal, which can take on one of two values: 
Sg (a “good” signal); or sẹ (a “bad” signal). 
Interpreting this signal is a bit complicated, 
because the manager does not know if he is 
“smart” or “dumb.” If he is smart, which 
occurs with prior probability 6, the signal is 
informative—that is, a good signal is more 
likely to occur prior to the high state than 
the low state. Formally, we have: 


(1) Prob(sg|x,,,smart) = p; 
(2) Prob(s,|x,,smart) =q < p. 


If. the manager is dumb, however, which 
occurs with probability (1— 8), he receives 
completely uninformative signals—he is as 
likely to receive sç prior to the high state as 
prior to the low state; 


(3) Prob(sg|x,,,dumb) 
= Prob(s¢|x,,dumb) = z. 


We make the assumption that the ex ante 
distribution of signals is the same for both 
smart and dumb managers— both are equally 
likely to receive sc, so that the actual signal 
received does not communicate any informa- 
tion about the manager's type. This amounts 
to assuming that: Prob(s,|smart) = 
Prob(sg;| dumb), or: 


(4) z ap t (1— a)q. 


Given that the manager does not know if 
he is smart or dumb, a straightforward appli- 
cation of Bayes’ law allows us to calculate 
the probabilities he attaches to the high state 
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after receiving the good and bad signals: 
(5) Prob(xylsc) = be 


[0p +(1—8)z] 


Z 


3 


(6) Prob(xyl|sg) 


., ,Da-p*a-690-21. 
Ps (1- z) ^ 





In order to make the investment problem 
interesting, we assume that the investment is 
attractive if a good signal has been received, 
but. not if a bad signal has been received: 


(7) BeXy t (L—- Be) x1 
»0»pgxy * (17 Hy) Xz. 


After A has made his investment decision 
at date 1, B makes his decision at date 2. 
Manager B also has access to a private sig- 
nal. In addition, he can observe whether or 
not firm A has decided to invest. This is 
valuable information; even in a world with- 
out reputational concerns, firm B’s invest- 
ment decision should be partially influenced 
by what firm A does. Our main point is that 
with reputational concerns, firm B’s man- 
ager pays too much attention to what firm A 
has done, and too little to his private signal.” 

Like manager A, firm B’s manager can be 
either smart or dumb. If one manager is 
smart and the other is dumb, their signals 
are drawn independently from the binomial 


^An analogy to noisy rational expectations models of 
the stock market (such as Martin Hellwig, 1980), where 
all the players move simultaneously, may be helpful. In 
these models, traders also put some weight on informa- 
tion drawn from the investment decisions of others 
(which are reflected in the stock price) and some weight 
on their private information. The question we address 
here is whether too little weight is put on private 
information in making decisions. Note, however, that in 
a noiseless stock market, the price reveals all informa- 
tion, so that ignoring private signals is actually optimal. 
See, for example, Paul Milgrom and Nancy Stokey 
(1982). 
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distributions given in equations (1)-(3). Sim- 
ilarly, if both are dumb, their signals are 
drawn independently— so that, for example, 
the probability is z? that two dumb man- 
agers both observe the good signal.? 
However, if both managers are smart, they 
are assumed to observe exactly the same 
signal. Thus the probability that two smart 
managers both observe sg when the true 
state is x, is p (as opposed to p? if smart 
managers received independent draws from 
the distributions described in (1) and (2)). 
This feature is crucial to our analysis. It can 
be generalized somewhat to allow the draws 
to be imperfectly correlated. However, if the 
signals of smart managers are drawn inde- 
pendently from the distributions, our results . 
concerning herd behavior fail to go through. 
Heuristicallv, herd behavior requires smart 
managers’ prediction errors to be at least 
partially correlated with each other. Al- 
though this feature may seem slightly unnat- 
ural given the current setup and notation, it 
amounts to nothing more than saying that 
there are systematically unpredictable factors 
affecting the future state that nobody can know 
anything about.4 For example, we might 
model the outcome of the state draw as 
being driven by the sum of two random 
variables, u and v. If u+v>0, then x, 
obtains. If u+v<0, then x, obtains. Our 
assumption is equivalent to allowing smart 


Our assumption that all dumb managers receive 
independent signais may seem extreme. Many investors 
who respond to the same stimuli—for example, the 
predictions of technical stock market forecasters—are 
often thought to be "dumb" for doing so. However, 
care must be taken in interpreting this fact with respect 
to our model. We only assume that dumb agents’ pri- 
vate signals are indeoendent— that is, these agents make 
independent errors when trying to interpret data on 
their own. 'The fact that people rely on common outside 
sources for forecasts in a suboptimal fashion is not at 
all inconsistent with our approach. Indeed, it is essen- 
tially our main prediction: privately formed opinions 
will be disregarded in favor of publicly available fore- 
casts. 

^Empirical evidence suggest that the assumption of 
systematically unpzedictable components is a reason- 
able one. For example, Patricia O'Brien (1988) finds 
that individual security analysts' prediction errors con- 
tain common components. 
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managers to observe u but not v. (As sug- 
gested above, we could generalize to allow 
each manager to observe u perturbed by 
independent noise, so long as we retained 
the unobservability of the common compo- 
nent v.) 

The importance of a common component 
to the prediction error for smart managers 
follows logic similar to that seen in effort- 
based models of relative performance such 
as in Holmstrom (1982b) and Barry Nale- 
buff and Joseph Stiglitz (1983). If prediction 
errors are independent, the labor market can, 
in our model, efficiently update on ability 
using only individual performance—that is, 
whether the manager picked a successful in- 
vestment. Analogously, in models of relative 
performance, if the agents face independent 
shocks to output, optimal contracts evaluate 
the agent based only on individual absolute 
performance. However, in these models, if 
errors are correlated, there is an informa- 
tional gain from comparing agents. In our 
model, one will not wish to evaluate too 
harshly a manager who picks a bad invest- 
ment, if his colleague’s similar choice sug- 
gests that they were both victims of a com- 
pletely unpredictable factor. This is the 
“sharing the blame” effect touched on ear- 
lier. 

Thus it is the common component to pre- 
diction errors that gives this model its bite, 
by causing some inferential weight to be 
placed on the similarity of managers’ deci- 
sions. Perversely, the existence of this extra 
channel of inference actually leads to ex ante 
reductions in efficiency—just the opposite of 
the result seen in the literature on tourna- 
ments. This is because here, managers at- 
tempt to actively manipulate their invest- 
ment decisions in such a way as to bias the 
inference process in their favor. Even if the 
market recognizes that they will be engaging 
in this manipulation, it will continue to exist 
in equilibrium.’ 


*The basic idea about manipulation of the learning 
process was first developed by Holmstrom (1982a). Drew 
Fudenberg and Jean Tirole (1986) apply the concept 
and refer to it as "signal jamming." 
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B. Managerial Objectives 


Our next step is to specify the managers' 
objective functions. In a first-best world, the 
managers would seek only to maximize the 
expected returns on investment, and would 
invest anytime their information (either from 
their private signal or from observing the 
other managers) indicated that investing had 
positive expected value. 

In our model, managers' investment deci- 
sions enable the labor market to update its 
beliefs about their ability. We denote by ê 
the market’s revised assessment of the prob- 
ability that a manager is smart. 

In order to establish a simple relationship 
between managers’ objectives and 6, we 
make several simplifying assumptions. Fol- 
lowing Holmstrom and Ricart i Costa (1986), 
we assume that: 1) the investment game is 
replayed once more after date 3; 2) at this 
point there is no further reason to build a 
reputation, so managers invest efficiently; 3) 
competition leads managers’ spot market 
wages to be set to the economic value of 
their ability. 

It is straightforward to demonstrate that, 
for a wide range of parameter values in our 
model, the expected return on the invest- 
ment opportunity is linear in the manager’s 
ability (as measured by Ê) if the manager 
invests efficiently. Hence the spot market 
wages referred to above will be proportional 
to 6.6 

We do not explicitly analyze contracting 
behavior in what follows. Rather, we assume 
(as do Holmstrom, 1982a, and others) that 
managers cannot be bound to their firms 
against their will ex post. This means that 
any long-term contract that would pay some 
types less than spot market wages in the 
second round of the investment game is in- 
feasible. N 

Since their future wages are linear in @, 
managers have some incentive to generate 


SThis will be the case if the manager’s investment 
decision in the second go-round of the investment does 
not depend on 8; the manager invests if and only if he 
observes the good signal. This condition is analogous to 
that posited for the first go-round in inequality (7). 
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high values of à, rather than to invest effi- 
ciently in the first round. Of course, it is still 
possible that short-term incentive contracts 
could serve at least partially to align man- 
agerial and firm interests, by specifying a 
profit-contingent wage in the first round of 
the investment game. Thus, in principle, it 
seems reasonable to believe that managers 
would be induced to act so as to maximize a 
weighted average of expected profits and 
their future compensation. However, this 
more general formulation leads to the same 
basic conclusions that obtain if managers 
care only about reputation—although natu- 
rally, more weight on expected profits will 
tend to attenuate the inefficiencies. For the 
sake of starkness and notational simplicity, 
we leave expected profits out of the manage- 
rial objective function. Later, in discussing 
our results we briefly touch on how they 
would be altered if managers cared about 
expected profits." 

The last assumption we make is that man- 
agers are risk neutral, so that their objective 
function simplifies to maximizing expected 
wages. This is equivalent to maximizing the 
expected value of 8. 

It should be noted that managers care 
only about their absolute ability assessment 
—not about whether they are judged to be 
more or less able than other managers. We 
touch on the importance of relative ability in 
Section III. 


II. Herding Equilibria 


A. Comparison with Efficient 
Investment Decisions 


In order to economize on notation, we set 
p —1- q. We also set a=}. Taken together, 
these simplifications imply that z=} from 
equation (4). For now, we leave the sign of 


"Our simplified formulation would be most literally 
applicable to situations where the state is publicly ob- 
servable, but cannot be verified by the courts, so that 
profit-contingent contracts are not feasible. (For more 
discussion of this point, see Oliver Hart and Holm- 
strom, 1987.) This may be a reasonable assumption for 
certain types of jobs where there are no easily describ- 
able performance measures, but it is not valid in other 
cases, for example, portfolio management. 
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(xg xj) unspecified, so that the invest- 
ment can have an ex ante expected value 
that is either positive or negative. 

As a benchmark, we first derive the opti- 
mal decision rules in a first-best world with 
no reputational concerns. Manager A would 
invest if and only if he observed s; —this is 
a consequence of the assumption in equation 
(7). Thus manager B can infer manager A’s 
signal from his investment decision. 

If manager B observes s, after firm A has 
invested, he makes his decision based on the 
two-signal information set (sg, sp). Given 
our symmetry assumptions, this implies the 
probability of the high state, prob(x,|5¢, 53) 
— 1, Thus the investment decision hinges on 
the sign of (x,,+x,)—if this quantity is 
positive, manager B will invest, and if not, 
he will not. 

Similarly, if firm A does not invest, and 
manager B observes sç, the investment deci- 
sion turns on the same criterion of whether 
(xg * x) 7 0. Clearly, in the first best, the 
order in which the information arrives is 
irrelevant to manager B's decision. If one 
manager observes sç and the other sees sp, 
manager B's decision will be the same re- 
gardless of whether the s; signal was re- 
ceived by him or by manager A. 

With reputational considerations, the deci-. 
sion rules are different. When manager A 
observes sç and invests, manager B will also 
invest, regardless of his signal and the sign 
of (x, + xz). Hence if this signal is sẹ and 
Xz, x, <0, the investment will be ineffi- 
cieat. Conversely, if firm A does not invest, 
firm B never will either, which is inefficient 
when manager B observes s; and xg t x,» 
0. Now the order in which information ar- 
rives is important to firm B's decision—the 
same aggregate information of (s5, 55) can 
lead it to invest or not to invest, depending 
on whether the signal sç is received by the 
first mover firm A or not. 


B. Equilibria with Reputational Concerns 


We now examine the equilibria that exist 
when managers seek to maximize the ex- 
pected value of 8. For now, we focus on the 
decision rules of manager B—in all the 
*continuation" equilibria that we look at the 
manager A behaves efficiently, by investing 


VOL. 80 NO. 3 


if and only if he observes sç. Later, we will 
establish that this efficient behavior by firm 
A’s manager is part of an bo ae of the 
overall game. . > 

We develop our results through a series of 
propositions. 


PROPOSITION 1: There does not exist any 
continuation equilibrium in which manager B's 
investment decision depends on the signal he 
observes. Thus the only possible equilibria are 
those where manager .B mimics manager A 
regardless of the signal, or where manager B 
does the opposite of manager A regardless of 
the signal. 


The proof will be by contradiction. We 
start by conjecturing the existence of the 
“separating” equilibrium described above.® 
We then determine the updating rules the 
labor market would use to calculate 0 in 
such an equilibrium. Finally, we show that 
given these updating rules, rational man- 
agers will not wish to behave as posited in 
the equilibrium. 

The revised ability assessments will be a 
function of the labor market's conjectures 
about the signals observed by the managers 
as well as the realized state of the world. Of 
course, only the managers observe their sig- 
nals, but in the putative separating equilib- 
rium, there is a one-to-one mapping from 
signals to actions. Thus for example, sup- 
pose the separating equilibrium calls for each 
manager to invest if and only if he observes 
sg. Then if manager A does not invest and 
manager B does, the market believes that 
manager A observed s, and manager B ob- 
served s,, and it can do its updating based 
on these beliefs. 

. As noted above, the main focus of our 
analysis is manager B. Continuing with the 
above example, suppose the high state was 
realized. How would the market revise its 
prior about the managers ability? Let 
(Sp, Sg. Xy) denote this event, and let 


"It is important to keep in mind that the only private 
information of the manager is about the signal he 
observes, not about his ability. Hence the "separation" 
is with respect to this signal. 
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(sy, Sg» Xg) be the revised prior. By Bayes’ 
rule, one can show that: . 


(8) é( Sp, 3g; Xn) 


B 7p4(1- 0) 
~ ipé(1- 6) +4(1— p)6(1- 8) + 11-6) 





=26p/(1+8).. 


The. explanation for this result is as fol- 
lows. There are three possible configurations 
of managerial ability that could give rise to 
this event: (dumb, smart); (smart, dumb); 
and (dumb, dumb). Note that (smart, smart) 
is not possible since if this were the case, 
both managers would have received the same 
signal, by virtue of our assumption that smart 
managers’ signals are perfectly correlated. 

We wish to know the probability of (dumb, 
smart) conditional on the event (55, sc, xj). 
If the configuration of talent is (dumb, 
smart), which occurs with ex ante probabil- 
ity 6(1— 9), the probability of (5,,5¢) in 
state xy is jp. This explains the numerator 
in (8). The denominator gives in addition the 
probabilities of (55, 5g|x,,) if the configura- 
tion is (smart, dumb) ang (dumb, dumb). 
These are 1(1— p) and 4, respectively. By 
symmetry, it is straightforward to show that 
O(Sc 55, XL) =20p/(1+ 0) also. 

The derivation of the other updating rules 
follow along similar lines. They are listed 
below: 


(9) 6(s5.5¢, xr) 
= Î( sc» Sp; Xu) 


=20(1- p)/(1+ 8); 


(10) 6( sy, 55, xy) l 
= (sc. sg. x1) 
. -20(1— p)+6) | 
— 460— p)+(1-6)?’ 
(11) 6( 5, 5p, x1) 


= (sc, se, Xn) 
20p (14-0) 
40p (1-0) 
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Now, in order for our posited equilibrium 
to actually hold together, it must be that 
managers find it in their interest to behave as 
assumed. For example, suppose that man- 
ager A has observed s, and has not in- 
vested. Given the updating rules above, can 
it ever be rational for manager B to invest 
upon observing sg, but not invest upon ob- 
serving sẹ? Or will one type of manager 
“break” the equilibrium by deviating and 
attempting to fool the market into thinking 
that he has received a different signal? 

To answer these questions, the following 
probabilities must be calculated: 


(12) Prob(xy|s5, sz) =43 
(13) Prob(xjlss, sg) 


| 40(1— p) (1-0) 
|». 460420-0y 


We can now check the rationality condi- 
tions that must hold for the equilibrium to 
be viable. One of these i is 


(14) Biss: Sn x )Prob( Xgl55; $5) 
+ Ê( sp, sc, Xz) Prob( x;|sp, sg) 
> 6( sp, Sp, xj )Prob(xg|sg, sg) 


+ 6( Sp: Sp» xy )Prob( x,|sg, sg) . 


Inequality (14) represents the requirement 
that if manager B receives signal sç, he 
prefers to invest (and identify himself as 
someone who had observed s,), rather than 
not invest (and masquerade as someone who 
had received signal s,). Direct substitution 
from equations (8)-(12) establishes that the 
inequality is violated —if manager B receives 
sg, he will wish to deviate by mimicking firm 
A and not investing. A symmetric argument 
establishes that if firm A has invested, there 
is also no separating equilibrium. In this 
case, if manager B observes sp, he will devi- 
ate by mimicking firm A and also investing. 

There are also potentially “perverse” sep- 
arating equilibria, where manager B invests 
if and only if he observes sj, rather than s. 
It can be easily demonstrated using the same 
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lines of reasoning that such equilibria are 
also not viable. This completes the proof of 
Proposition 1. 

Itis worth examining the updating rules in 
equations (8)—(11) to gain some intuition for 
the forces that break the efficient equilib- 
rium. Two main points emerge from these 
equations: 

First, (ss, SG; XH) > (ss, sG, XL) and 
6( 86, se xg) é(ss, Sg x). Holding Te in- 
vestment decision of manager 4 fixed, man- 
ager B is indeed compensated for making ` 
*absolutely" good decisions—for investing 
prior to a realization of xj, as opposed to 
investing prior to a realization of xz. 

Second, however, the investment decision 
of manager A does have an important exter- 
nality effect. Holding the correctness of the 
investment decision fixed, there is a higher 
payoff to manager B for imitating manager 
4. That is, 8(56, sc, XH) > (sg, Sg, Xz); and 
lsc Sq» XL) > (Sp S, Xr) 

Proposition 1 is a direct consequence of 
this second effect. Because of the payoff to 
imitation, even if the new information makes 
it more likely that contradicting manager 4 
is the economically correct decision, man- 
ager B prefers to mimic A. As a result, 
decisions cannot be made contingent on sig- 
nals, and there cannot be an equilibrium 
where manager B takes advantage of his 
private information. 

As was emphasized in the previous sec- 
tion, the result depends on our assumption 
that prediction errors are correlated across 
smart managers. If the signals of smart man- 
agers are independent, Proposition 1 no 
longer holds. This can be demonstrated by 
calculating the updating rules that would 
prevail if signals were independent. Denot- 
ing these rules by 4 ), and using the same 
Bayesian logic as before, we can derive 


(15) s, S6. Xy) 
X Ó' (sc, sc, Xu) 
= (sp, $5, Xi) 
sd (sc, 2 

208p 
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(16) (sp, Sc xz) 
= Ó'(sc, 86. x,) 
= 8'(s5, 5, xy) 
(sc, ss xy) 


26(1- p) 
~ 28- p)* (1-0): 


According to equations (15) and (16), in 
the case of independent signals, the labor 
market's assessment of firm B's manager is 
unrelated to the investment decision of firm 
A. All that matters is the profitability of the 
investment—investing before state x,, leads 
to a more favorable ability assessment than 
not investing before state xy. As a result, the 
inequality in (14) is satisfied (with equality) 
and it is possible to sustain the efficient 
equilibrium in which manager B's invest- 
ment decisions generally depend on his pri- 
vate signal. 

If the independence assumption is relaxed, 
the updating rules become a function of firm 
A's investment decision, inequality (14) is 
violated, and Proposition 1 holds. Thus per- 
fect correlation of smart manager signals is 
not necessary for our results—all that is 
needed is some correlation of prediction er- 
rors. 

Having established that continuation equi- 
libria with signal-contingent decisions by 
manager B do not exist, we now turn our 
attention to the equilibria that can be sup- 
ported in our model. 


PROPOSITION 2: There exists a continua- 
tion equilibrium in which manager B always 
mimics manager A, investing if and only if A 
does. This herding equilibrium is supported by 
the following “reasonable” out of equilibrium 
beliefs: i) if manager B deviates by investing 
when A has not, the labor market believes that 
he observed sç, and conversely, ii) if manager 
B deviates by not investing when A has, the 
labor market believes that he observed Sp. 


In order to prove Proposition 2, it is nec- 
essary to show that manager B will always 
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find it optimal to behave as prescribed, given 
the beliefs posited. Let us consider only the 
case where firm A has already not invested; 
the other case works exactly the same way. 

If manager B follows firm A by also not 
investing, his revised ability assessment is 
simply equal to 6 —there is no revision from 
the prior because the equilibrium is a “ pool- 
ing" one, with both s; and sj recipients 
choosing the same action. 

In order for manager B who observes sp, 
not to deviate, it must be the case that: 


(17) 082 Ó6(sg, sg, xg) Prob(xg|sg, 53) 
és, Sç» xz) Prob( x,|sp, sg). 


Inequality (17) is the requirement that the 
payoff to a manager who observes sp, pools, 
and receives 0, exceeds his payoff from devi- 
ating and investing, given that out of equi- 
librium beliefs are such that he will be viewed 
as having observed s; if he deviates. Direct 
substitution from (8), (9), and (13) verifies 
that the inequality is satisfied. 

In order for manager B observing sg not 
to deviate, the following must hold: 


(18) 06 x Ó(sg, sc, xg) Prob( xgsg, sg) 
+ (sg, Sg» Xr ) Prob( x;|35, SG). 


Comparison of (17) and (18) shows that it 
is relatively more tempting for manager B to 
deviate by investing after observing sg, as 
opposed to sj. (It is in this sense that the 
out-of-equilibrium conjecture that a deviator 
has seen sg is “reasonable.”) Nonetheless, 
(18) reduces to. 


(18) 


which is strictly satisfied. Thus Proposition 2 
is proved, and we have established the exis- 
tence of a herding continuation equilibrium. 

It should be pointed out that there is 
another, perverse continuation equilibrium 
in which the decisions of manager B do not 
depend on his signal. In this equilibrium, 
manager B always contradicts manager A, 
investing if and only if A has not. This 
equilibrium can only be supported by the 


6>6/(1+ 0), 
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following "unreasonable" beliefs off the 
equilibrium path; if manager B deviates by 
investing when the equilibrium calls for him 
not to, it is because he has observed s,; and 


if he deviates by not investing when the 


equilibrium requires investment, it is because 
he has observed sg. 

The multiplicity of equilibria stems from 
our assumption that investment decisions do 
not directly affect the manager’s utility, but 
are nothing more than a means of conveying 
information to the labor market about the 
signal that the manager has observed. If the 
labor market (perversely) interprets invest- 
ment to mean that the manager has observed 
Sp, then the manager may well invest if he 
wishes to convince the labor market that he 
has seen sj. It follows that in the current 
formulation of our model, we cannot pin 
down exactly what actions will be taken. 
However, we can pin down how much infor- 
mation is revealed in equilibrium: In both 
equilibria, managers' actions do not depend 
on their private signals and hence convey no 
information.’ 

This reasoning suggests that the model 
can be altered slightly so as to leave the 
herding equilibrium as the unique outcome. 
Suppose that managers do not invest di- 
rectly; instead, they report their signals to 
the “owners” of the firm, who thena make 
investment decisions to maximize profits. 
Proposition 1 then can be interpreted as 
saying that there is no equilibrium where 


?The structure of our model is similar in some re- 
spects to Vincent Crawford and Joel Sobel’s (1982) 
“cheap talk” model of strategic information transmis- 
sion. In their framework, an informed party (the 
*sender") transmits a message to another party (the 
“receiver”), who then takes an action that affects the 
utility of both parties. Similarly, in our model, one can 
view the investment decision as a message sent by the 
informed manager and the wage paid by the outside 
labor market as the response of the receiver. (However, 
our model differs from theirs in that the message itself 
—the investment decision—has real economic conse- 
quences to a third party, namely, the manager's firm.) 
What matters in these models is not the actual wording 
of the messages, but rather how they are interpreted by 
the receiver. Hence, it is impossible to determine exactly 
what words will be sent, though it is possible to deter- 
mine how much information is contained in these words. 
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manager B can be relied on to make infor- 
mative reports. Given that he cannot learn 
anything from his manager, the owner of 
firm B will then have to rely on the only 
available information, the action of firm A. 
The unique profit-maximizing decision for 
the owner of firm B is thus always to mimic 
firm A. 

In sum, then, the contradiction equilib- 
rium is probably not a sensible one. If one 
dismisses it, the herding equilibrium is left as 
the unique continuation equilibrium of the 
game. It remains only to establish that the 
efficient behavior on the part of manager A 
that has been assumed to this point is part of 
an overall equilibrium. 


PROPOSITION 3: There exists an equilib- 
rium for the overall game where manager A 
invests if and only if he receives sz, and where 
manager B alviays mimics manager A regard- 
less of B’s signal. 


In the proposed equilibrium, there is no 
information inherent in manager B’s ac- 
tions, since he always does the same thing, 
regardless of his signal. Thus manager A can 
only be evaluated absolutely—his revised 
ability 64 is a function of only his action 
and the realized state. The updating rules are 
therefore identical to those given for the 
two-manager, independent signal case in 
equations (15) and (16). That is 


(9). 6^(sc, xg) 
= ^ (sy, x1) 
= 20p/(20p ^ (1— 0)); and 
Q0) 64(sg, xz) 
= 64( 55,4) 
= 20(1— p)/Q9(1- p) 
-(1- 0)). 
In order for manager A to be willing to 


invest after observing s; and not invest after 
observing sp, the following two conditions 
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must hold: 


(21) 64( sg, xy) bgt ó^(ss, x,)(- He) 
> 64 (5,, xy) ug 0^ (5 x1) (1— ng); 
(22) 6^(sg, xy) Hp +Â (sp, x,)(1— ug) 
= O4( sc, xy) My + O4( sg, x,)(1— n5). 


From equations (5) and (6), we can obtain 
the simplification: pg — (1— ng) = 0p + 
3(1—6)>4. The inequalities can then be 
verified, which proves that manager A’s be- 
havior is part of an equilibrium.!? 

Finally, it should be emphasized that 
herding equilibria exist in a model with any 
number of managers. Consider a third man- 
ager C, who moves after A and B, but 
before the state of the economy is realized. 
Since we have established that in a two- 
manager herding equilibrium, manager B's 
actions are independent of his signal, man- 
ager C learns nothing from observing what 
firm B has done. He learns only from ob- 
serving firm A, whose actions do depend on 
the signal received. Thus manager C is in 
exactly the same position as manager B be- 
fore him, and the same arguments can be 
used to show that he too cannot make his 
actions contingent on his private informa- 
tion. This line of reasoning can be applied 
repeatedly to any number of subsequent 
managers. Note, however, that other equilib- 
ria may also exist with more than two man- 
agers. For example, with three managers, the 
second may deviate from the first if he con- 
jectures that the third manager will join him 
in a new herd. 


Tt should be pointed out that with different param- 
eter values, reputational concerns could induce even 
manager A to behave differently. Suppose that xj is 
relatively large. Then efficiency could require manager 
A to invest after seeing sg even if the posterior proba- 
bility of success ug is small. Inspection of (21) reveals 
that this efficient equilibrium cannot be sustained if 
Bg <}; manager A will be unwilling to invest and 
thereby contradict the ex ante wisdom that success is 
very unlikely. 
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IIl. Countervailing Forces 


Up to this point, the model has been 
simplified to focus exclusively on the forces 
that make herd behavior likely. Naturally, 
there are other considerations which may 
offset herding tendencies. We discuss four 
such considerations below: managerial con- 
cern for profits; limited liability; wages that 
depend on relative, rather than absolute tal- 
ent; and alternative definitions of ability. 

The herding equilibrium derived above is 
generally inefficient relative to the first best 
for some configurations of private informa- 
tion. For example, if xj, + x; > 0, then the 
equilibrium is inefficient when manager 4 
observes Sg, manager B observes sg, and 
firm B fails to invest. The expected profits 
lost by firm B due to herding, denoted by 
II, are equal to 1(xj + xz). 

If, however, managers place some weight 
on expected profits in their objective func- 
tions, these inefficiencies can disappear. De- 
note by R the amount by which tbe in- 
centive constraint in (14) is violated. R 
measures the perceived reputational benefit 
to managers from herding, and it is propor- 
tional to 


[6(55, 55, Xy) Tus Ó(55, Sc» xg)] 
* [Ó(55, 55, Xz) — É(sp, s. x;)]. 


Managers who care only about their reputa- 
tions will always herd, since R > 0. But man- 
agers who care about profits will have to 
trade off R against II. For any given weight 
on profits in managers’ objective functions, a 
large enough value of II will restore the 
efficient signal-dependent equilibrium. Put 
differently, as the weight on profits increases, 
the range of parameter values over which 
there is herd behavior shrinks—the more 
egregious inefficiencies associated with herd 
behavior are alleviated, even though herding 
can still occur when II is relatively small. 
This points to a role for short-term incentive 
contracts. Even if managers cannot be bound 
to their firms for life (so that they always 
place some weight on reputation) short-term 
contracts may shift the focus toward ex- 
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pected profits, thereby reducing herding ten- 
dencies. 

Limited liability is another factor that can 
alter the tradeoff between reputation and 
profits. It may be that managers have fixed 
outside labor market opportunities and thus 
never have to accept a wage less than L. If L 
exceeds the wage to a manager with ability 
8(Sp, Sg, Xz) there is a floor on how poorly a 
manager can fare by disagreeing with his 
peers. Thus limited liability lowers R, the 
reputational gain from  herding. Like 
profit-based compensation, limited liability 
shrinks the range of parameter values over 
which herd behavior is observed. A corollary 
is that herding may become more or less of a 
problem as a manager's career progresses. 
On the one hand, there.is apt to be less 
uncertainty about the managers ability, 
which should reduce the incentives for herd 
behavior. On the other hand, later in a suc- 
cessful career, wages are probably higher 
above the outside alternative L. This latter 
effect can increase the propensity to herd. 

Relative ability concerns are a third factor 
that may offset herding tendencies. In our 
model, managers only care about having a 
high absolute ability assessment, 6. In cer- 
tain situations, however, they may also care 
about how their 0 compares with that of 
other managers. For example, there may be 
a "superstars" effect present (see Sherwin 
Rosen, 1981), with top-ranking managers 
getting a disproportionately high wage. If 
this is the case, manager B will be more 
reluctant to mimic manager A, since by do- 
ing so he destroys any possibility of being 
the top-ranked manager. 

A final way to attenuate the model’s pre- 
dictions regarding herd behavior would be to 
introduce a broader definition of ability. As 


' we have defined it, ability is nothing more 


than an aptitude for making precise fore- 
casts about the outcome of a given random 
variable. But ability might in addition. in- 
clude a knack for uncovering new random 
variables to study. Rather than having to 
decide simply whether or not to buy a cer- 
tain asset, managers may also be responsible 
for finding alternatives to that asset. Under 
these circumstances, a desire to earn a repu- 
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tation for “creativity” may deter managers 
from all choosing to buy the same asset. 


IV. Implications of the Model 


The theoretical model developed above has 
implications in a number of different areas. 
In order to give a feeling for some of the 
potential applications, we discuss a few ex- 
amples. 


A. Corporate Investment 


Bank lending to LDCs was mentioned ear- 
lier as an apparent instance of herd behavior 
in corporate investment. A recent paper by 
Randall Morck, Andrei Shleifer, and Robert 
Vishny (1989) suggests that the potential for 
problems may be more widespread. They 
study the effectiveness of boards of directors 
im dealing with poorly managed firms. Their 
principal empirical finding is that top man- 
agement firings are primarily associated with 
poor performance of a firm relative to its 
industry, rather than with industrywide fail- 
ures. They interpret these results as evidence 
that boards have a difficult time assigning 
blame to their managers for mistaken strate- 
gies, when other firms in the industry are 
following similar strategies—at least in this 
segment of the labor market, there seems to 
be support for the “sharing-the-blame” ef- 
fect discussed earlier. And as long as this 
effect is at work, one might expect that man- 
agerial behavior would be distorted in the 
direction of herding. 

Herd behavior in corporate investment 
may have important consequences for the 
adoption of new technologies. Papers by 
Joseph Farrell and Garth Saloner (1985), 
and Michael Katz and Carl Shapiro (1985), 
have shown that compatibility externalities 
can lead to bandwagon effects in technology 
adoption. Our work can be viewed as com- 
plementary to theirs: even when compatibil- 
ity is not an important concern, bandwagons 
can arise. If the manager of one firm adopts 
a particular technology, this creates a repu- 
tational externality, in the sense that other 
managers will tend to be biased toward the 
same technology for reputational reasons. 
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B. The Stock Market 


Herd behavior by money managers could 
provide a partial explanation for excessive 
stock market volatility. By mimicking the 
behavior of others (i.e., buying when others 
are buying, and selling when others are sell- 
ing) rather than responding to their private 
information, members of a herd will tend to 
amplify exogenous stock price shocks. In a 
sense, the ideas developed ‘here can be 
thought of as providing the “microfounda- 
tions" for stock market phenomena that are 
often thought to stem from psychological 
sources such as “groupthink,” mass eupho- 
ria, or panic. 

It should be pointed out, however, that the 
model of this paper does not fit perfectly 
into a stock market setting, due to the as- 
sumption of perfectly elastic supply and the 
consequent lack of a market clearing price. 
Adding pricing considerations would com- 
plicate the formal analysis ` considerably. 
Nonetheless, we think that our basic insights 
do carry over to the stock market. At any 
given level of prices, money managers are 
likely to have an idea about the extent to 
which their competitors are “in” the market. 
If this is the case, there is the possibility that 
money managers will mimic each others' as- 
set allocation strategies—upon observing 
that manager A has 50 percent of his assets 
in stocks and 50 percent in bonds, manager 
B' may aim for a similar portfolio composi- 
tion, even when his private information sug- 
gests that current price levels are too low or 
too high.!! Thus one testable implication of 
our model is that the asset allocation deci- 
sions of professional money managers should 
be more closely correlated over time than the 
decisions of equally active private investors 


"another subtlety that is present in the stock market 
case but not in our model is a continuum of investment 
choices. A portfolio manager whose private information 
tells him that stocks are overpriced can “partially herd” 
by putting 45 percent, rather. 50 percent of his assets in 
Stocks. This may tend to dampen the aggregate effects 
of herd behavior. 
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\ 
who are. unconcerned about their Pu 
tions. 

Robert Shiller and John Pound (1986) pre- 
sent some evidence that can be viewed as 
consistent with the existence of herd behav- 
ior in the stock market. They surveyed insti- 
tutional investors to determine the factors 
that went into their decision to buy a partic- 
ular stock. Purchase of stocks that had re- 
cently had large price run-ups tended to be 
motivated by the advice of others (other 
investment professionals, newsletters, etc.). 
This contrasted with more stable stocks, 
where fundamental research (a systematic 
search procedure for a security with certain 
characteristics) played a more important role. 
This suggests that the comfort inherent in 
following common wisdom can lead profes- 
sional money managers to invest in stocks 
where fundamentals might dictate otherwise. 


C. Decision Making Within Firms 


Recent work on the theory of the firm by 
Raaj Kumar Sah and Joseph Stiglitz (1985, 
1986) has emphasized the role of managers 
as information filters. They argue that firms 
may organize themselves internally in such a 
way as to take maximum advantage of the 
fact that different managers tend to have 
errors of judgment that are not perfectly 
correlated.: Thus there may be benefits to 
having decisions made by. committees, or 
through a vertical chain of command where 
projects can be rejected at various points 
along the way: 

Our model points up certain limitations 
that may be inherent in group decision mak- 
ing, and also offers some new insights about 
how organizational structure can facilitate 
the decision-making process. As a stylized 
example, consider the case of a capital bud- 
geting committee meeting, where the man- 
agers are supposed to vote in turn on a 
proposed investment project. Ideally, the 
point of having several managers vote is to 
gather a wide range of information. How- 
ever, if career concerns are present, this may 
not work well. Once the first manager. has 
voted, the others may simply echo his choice, 
regardless of their private beliefs. Thus a 
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false consensus is achieved, and the informa- 
tion of the other managers is wasted.!? 

One way around this problem is to have 
managers submit their votes simultaneously, 
perhaps in writing. However, this may limit 
valuable exchanges of ideas. An alternative 
approach is to have those with the stronger 
reputational concerns vote first. As we noted 
in the previous section, if the limited liability 
effect is not too important, reputational 
concerns will be strongest among young 
managers, since there is presumably more 
uncertainty about their ability. Thus if the 
committee consists of young and old execu- 
tives, the young ones should be asked to 
voice their opinions before the old ones. 
More generally, this line of reasoning implies 
an advantage to a “bottom up,” rather than 
“top down” organization of information flow 
within a firm. To the extent that new ideas 
can be passed upstream for approval, this 
may result in better decision making than if 
the ideas are originated at a high strategic 
planning level and then are passed down- 
stream for line manager input.’ 


V. Conclusions 


Herd behavior can arise in a variety of 
contexts, as a consequence of rational at- 
tempts by managers to enhance their reputa- 
tions as decision makers. In addition to rep- 
utational concerns, there are other factors 
that influence herding. One of these is the 
extent to which there are commonly unpre- 
dictable components to investment out- 
comes: correlated prediction errors lead to 
the “sharing-the-blame” effect that drives 


“There are other explanations for this “yes-man” 
effect. One is discussed in Solomon Asch’s (1955) classic 
study of the effects of group pressure. He found that 
experimental ‘subjects were reluctant to disagree with 
others in the group (confederates) despite the fact that 
the confederates’ stated opinions were clearly wrong. 
Asch interprets the results as evidence in support of the 
view that individuals have an inherent psychological 
desire to conform to group norms. 

BxXeitaro Hasegawa (1986) discusses at length the 
emphasis placed by Japanese corporation on imple- 


menting a bottom-up flow of information. This form of ` 


organizational design is known there as “ringi.” 
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List oF NOTATION 


Symbol Meaning, 

Investment proceeds in “high” state 
Investment proceeds in "low" state 

Ex arte probability of high state 

Good signal 

Bad signal 

Prob(sg| xy, smart) 

Prob(sg | x; , smart) 

Prob(sg | xy, dumb) = Prob(sg|x,, dumb) 
Ex ente probability that manager is smart 
Posterior probability that manager is smart 
Prob(xg sg) ` 

Prob(xg|5g) 

Profit passed up in herding equilibrium 
Reputational benefit to herding 

Wage that can be earned in outside 


opportunity 


T 


hae F Won uy DOR 


managers to herd. Also important is the na- 
ture of the managerial labor market: herding 
is more likely to be a problem when man- 
agers outside opportunities are relatively 
unattractive, and when: compensation de- 
pends on absolute rather than relative ability ` 
assessment. 
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Perfect Equilibria in a Trade Liberalization Game 


By KIMINORI MATSUYAMA* 


The credibility of temporary protection is examined in a simple infinite horizon, 
perfect information game of timing in which the domestic government uses the 
threat of future liberalization to induce the domestic firm to invest. All pure 
Strategy subgame-perfect equilibria are cyclical and, surprisingly, one of them 
implements optimal temporary protection. However, this equilibrium fails to pass 
another credibility criterion called “Renegotiation-Proof.” The game has a unique 
Stationary subgame-perfect equilibrium in mixed strategies. (JEL 026, 422) 


In trade policy debates, it is often argued 
that domestic industries should receive tem- 
porary protection from import competition. 
Immediate trade liberalization and ensuing 
inflows of foreign products and capital would 
jeopardize domestic firms, while protection 
would allow the domestic industries to intro- 
duce new technologies and products, thereby 
effectively competing with their foreign ri- 
vals, Any such protectionist measure should 
be temporary, because, under permanent 
protection, the lack of competitive pressure 
reduces incentives for domestic firms to ra- 
tionalize their operations and to hold down 
costs. For example, this is the idea underly- 
ing the escape clauses in Article XIX of the 
General Agreement on Tariffs and Trade 
(GATT) and Sections 201 through 203 of the 
U.S. Trade Act of 1974. 

A similar argument arises in the context of 
infant industry protection. Temporary sup- 
port by governments sometimes helps new 
industries grow strong enough to meet inter- 
national competition, but indefinitely im- 


*Department of Economics, Northwestern Univer- 
sity, 2003 Sheridan Road, Evanston, IL 60208. This is a 
. substantially revised and shortened version of Mat- 
suyama (1987). I would like to thank Dilip Abreu, Eric 
Maskin, Daniel Vincent, Michael Whinston, and partic- 
ipants of the Math Center Bag Lunch seminar at North- 
western University, the NBER Summer Institute, and 
the conference on “Designing Policies to Open Trade” 
at the University of California-Davis, for their insight- 
ful discussion and two anonymous referees for helpful 
comments. My special thanks go to one referee whose 
advice helps to improve exposition greatly. All remain- 
ing shortcomings are my own. 
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posed protection often results in perpetual 
industrial stagnation. 

Despite its pervasiveness and (or perhaps 
because of its) persuasiveness, few studies 
have attempted to formalize this line of ar- 
gument.! In a recent paper on infant indus- 
try protection (Matsuyama and Motoshige 
Itoh, 1986), we showed that it can be theo- 
retically justified, using Michael Spence's 
(1979) model of a dynamic oligopoly market. 


!In the standard literature of infant industry protec- 
tion, the argument for temporary protection is based on 
the technological assumption that dynamic external 
economies, which could justify protection policies, dis- 
appear once the industry becomes mature. For example, 
Max Corden (1974, p. 256) discusses why protection 
should be temporary. “The temporary element can en- 
ter in three ways. (1) The learning may itself be tempo- 
rary, being a characteristic of the firm's infancy period. 
(2) The imperfection of information or of the capital 
market, as these apply to the firm concerned, may be 
temporary: as the firm expands and its costs fall it may 
find it easier to finance further investments, whether in 
visible or invisible capital. (3) We may be constrained to 
the use of a tariff as a method of protection (the fiscal 
constraint ruling out direct or indirect export subsidiza- 
tion), so that the tariff could end once imports of the 
product have been completely replaced, and should end 
if the firm has monopoly power and above-normal 
profits are to be avoided." On the other hand, the 
popular argument in trade policy debates puts more 
emphasis on the incentive effect of temporary protec- 
tion. For example, the study by the Organization for 
Economic Cooperation and Development (1985, p. 22) 
argues that "[p]rotection itself becomes less effective in 
promoting adjustment when—as a result of the re- 
peated renewal of protectionist measures—the firms 
being protected have no reason to expect that they will 
even be exposed to the full challenge of international 
competition." 
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In this model, temporary protection works 
for two reasons. First, it provides the domes- 
tic firm with an opportunity to accumulate 
its capital stock. Second, the anticipation of 
future removal of the barrier (the temporary 
nature of protection) gives an incentive for 
the domestic firm to do so. We also showed 
that such a temporary protection policy can 
be optimal from the national welfare view- 
point. 

However, we did not advocate adopting 
such temporary protection, because the ef- 
fectiveness of temporary protection crucially 
rests on the presumption that the govern- 
ment can make a credible commitment on 

: the future removal of protectionist measures, 
which we doubt is necessarily the case.” Af- 
ter all, there have been many protection poli- 
cies, which were said to be temporary and 
turned out to be permanent. Indeed, this 
danger of prolonged protection is widely rec- 
ognized. For example, Rachel McCulloch 
warns that: 


In principle, [temporary protection] 
policies provide breathing room for the 
affected industry, time in which to im- 
prove its competitive position or, in 
some cases, to phase out domestic pro- 
duction of goods where comparative 
advantage has shifted unambiguously 
abroad. In practice, however, neither 
adjustment to a status of full competi- 
tiveness nor phasing out of domestic 
production will necessarily occur dur- 
ing the limited period of import relief, 
so that the same industries return again 
and again for additional “temporary” 
relief. [1985, p. 154] 


In this paper, I formalize this pitfall of 
optimal temporary. protection: dynamic in- 
consistency.’ To do so, I construct a simple 


"This is not the only reason why we did not advocate 
temporary protection. See the concluding section of 
Matsuyama and Itoh for other reasons. 

The seminal paper on the issue of dynamic inconsis- 
tency of optimal economic policies is Finn Kydland and 
Edward Prescott (1977). Most of recent developments 
center on monetary and fiscal policies. A few studies 
have addressed this issue in trade policies: see Maskin 
and David Newbery (1986), Robert Staiger and Guido 
Tabellini (1987) and Aaron Tornell (1987).. 
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infinite horizon, perfect information game of 
timing in which the domestic government 
and the domestic firm move alternately. The 
government prefers liberalizing the domestic 
market, given the firm’s investment position. 
However, it is also willing to wait for an- 
other period if this could induce the firm to 
invest. The firm, if it believes that future 
liberalization is inevitable, prefers investing 
right before the liberalization in order to 
prepare for competition with its foreign ri- 
val. However, it may choose not to invest, 
hoping that this could induce the govern- 
ment to postpone the liberalization. 

This game is meant to encompass a vari- 
ety of situations. First, it could be the case 
of “infant” industries in developing coun- 
tries whose governments decide whether to 
remove the inherited protection immediately 
or to wait until an announced future date. 
Second, it could be the case of “injured” 
industries in developed countries, where the 
governments consider granting temporary re- 
lief in each period. Third, it could be the 
case of “declining” industries whose protec- 
tion is intended to ease pains of adjustment 
by allowing for gradual contraction of the 
industries. In the second and third cases, 
“liberalize” should be read as “refuse to 
grant import relief” and, in the third case, 
“invest” should be read as “reallocate.” 
What is crucial in terms of formal analysis is 
that each player’s action (but not its an- 
nouncement) has irreversible consequences, 
which requires a game of timing framework, 
instead of a repeated game framework.* 


“In a simple game of timing, each player’s only 
choice is when and whether to take a single pre-speci- 
fied action, and the game ends once one player has 
moved. Two kinds of timing games are extensively 
discussed in the literature (Ken Hendricks and Charles 
Wilson, 1986, and Hendricks, Andrew Weiss, and Wil- 
son, 1987). The first is the “war of attrition” developed 
in theoretical biology. Two animals fight over a terri- 
tory. Fighting costs. Once one animal quits, its oppo- 
nent gains the territory. This game has been applied 
to the problem of exit. See, for example, Pankaj 
Ghemawat and Barry Nalebuff (1985). In a war of 
attrition, each player prefers that the other moves first, 
but, if it has to move first, it prefers to do so sooner. On 
the other hand, in the second type of game of timing, 
"preemption game," each player prefers to move first, 
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The analysis presented below is partly mo- 
tivated by the recent renewed interest in 
temporary protection as a way of diverting 
political pressure of protectionism: see, for 
example, Gary Hufbauer and Howard 
Rosen’s (1986) and Robert Lawrence and 
Robert Litan’s (1986) proposal of “refurb- 
ishing escape clauses.” It is not at all clear 
what sort of political economic model they 
have in mind, but it appears quite difficult to 
deny any theoretical possibility of optimal 
temporary protection. The main theme of 
this paper is that adopting temporary protec- 
tion may be a bad idea, even if it is optimal, 
due to the lack of credibility. And the cen- 
tral question asked is in what sense “op- 
timal” policy lacks credibility. For this 
purpose, I intentionally do not describe situ- 
ations in detail, which could give rise to an 
optimal policy of temporary protection. This 
is not to say that I believe temporary protec- 
tion is generally optimal from the national 
welfare viewpoint. In fact, the government's 
objective is not necessarily identified as the 
national welfare below. 

After presenting the game in Section I, the 
pure strategy Nash and subgame-perfect 
equilibria are discussed in Section II. Sub- 
game perfection helps to eliminate all but a 
finite number of Nash equilibria, but, rather 
surprisingly, optimal temporary protection 
(and all Nash outcomes) can be supported 
by a subgame-perfect: equilibrium, which 
suggests the inadequate power of the sub- 
game-perfect restriction in this game. It is 


but, it prefers to do so later. See Drew Fudenberg and 
Jean Tirole (1985), The two firms plan to introduce a 
new product to the market that can profitably support 
only one firm, and it would be cheaper to introduce the 
product later. - 

One can think of the game in this paper as a hybrid 
of a war of attrition and a preemption game; The 
government’s payoff has the same property as in a war 
of attrition, while the firm’s payoff has the same as in a 
preemption game. That is, the government prefers that 
the firm moves first (invest before liberalization), but if 
it has to move first (liberalize before investment), it 
prefers to do so sooner. On the other hand, the firm 
prefers to move first (invest before liberalization), but it 
prefers to do so later. 
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also shown that all subgame-perfect equilib- 
ria are cyclical. 

In Section HI, I turn to the siationary 
subgame-perfect equilibrium, which exists 
uniquely in mixed strategies. I will calculate 
the probability with which the optimal tem- 
porary protection succeeds and the expected 
length of the protection period, and show 
that the success rate is higher as the govern- 
ment is more impatient and as the firm is 
more patient and that, as both players are 
more patient, protection would last longer. 
The section ends by discussing an alternative 
interpretation of mixed strategies. 

In Section IV, I employ another credibility 
criterion called “renegotiation-proof,” re- 
cently developed by Joseph Farrell and Eric 
Maskin (1987) and David Pearce (1987). It 
will be shown that the subgame-perfect equi- 
librium that supports optimal temporary 
protection is not renegotiation-proof. Sec- 
tion V concludes the paper. 


Y. The Liberalization Game 


Imagine the following scenatio.? Initially, 
the domestic monopoly firm (player 2) earns 
its maximum profit in the protected domes- 
tic market. Then, a new government (player 
1) takes office and the game starts. At the 
beginning of period 1, the government de- 
cides whether it liberalizes the market (L) or 
not (NL). If it chooses L, the foreign firm 
enters the market and both firms play the 
post-entry game from period 1 on. The liber- 
alization game ends. If the government 
chooses NL, the domestic firm decides 
whether it invests (J) or not (NI). If it 
chooses J, the domestic firm earns less profit 
in period 1. The government will liberalize 
the market at the beginning of period 2 and 
the foreign firm will enter and both firms 
will play the post-entry game from period 2 
on, with the domestic firm having the 
first-mover advantage. The liberalization 
game ends. If the domestic firm chooses NI, 


5This is only for fixing the idea. Other interpretations 
can easily be made, as suggested in the introduction. 
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FIGURE 1. THE TRADE LIBERALIZATION GAME IN 
K. MATSUYAMA, “PERFECT EQUILIBRIA IN A 
TRADE LIBERALIZATION GAME” 


then it earns its maximum profit in period 1. 
In this case, the government must decide 
between L and NL at the beginning of 
period 2 in the same situation as in period 1. 
This process continues until either the gov- 
ernment chooses L or the domestic firm 
chooses J. 

Figure 1 gives an extensive form represen- 
tation of this game. Some explanations of 
notations are in order: 


M: The government does not liberalize (NL) 
and the firm does not invest (NI). This is 
the status quo situation. Let M, and M, 
be the government's one-period payoff 
and the domestic firm's one-period profit 
in M, respectively. 

N: The government liberalizes (L) before 
the firm invests. The firm plays the post- 
entry game with its foreign rival without 
the first-mover advantage. Let N, and 
N, be the government's and the domes- 
tic firm's one-period payoffs in N. 

P: The government does not liberalize (NL) 
and the firm invests (J). Define P, and 
P, analogously. 

Q: The government liberalizes ( L) after the 
firm invests.. The firm plays the post- 
entry game with its foreign rival, with its 
first-mover advantage. Define Q, and 
Q, analogously. 


Let 0 < d, <1 and 0 < d, <1 be the discount 
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factors of the government and the firm. In 
Figure 1, payoffs of both players are given at 
each terminal node. For example, if the gov- 
ernment chooses NL in period 1, the firm 
responds by NI, and the government chooses 
L in period 2, then the state of the market is 
M in period 1 and N from period 2 on. 
Thus, the payoff of the government is 


M, + d, N, + d2N, + d?N, + 
= M,+ 4N/ü rs di) 


and the payoff of the firm is M,+d,N,/ 
(1— dy). 

To define the payoff functions more for- 
mally, let g= (g(1), 9(2),...} represent a 
strategy of the government, where g(t) is 
either L or NL, its move in period ¢ if the 
game has not ended. Likewise, let f be a 
strategy of the firm, with f(t)=J or NI. 
(Only pure strategies are considered until 
Section III.) Define m(g) as the smallest £t 
satisfying g(t) = L and n( f) as the smallest 
t satisfying f(t) = I. (Let m(g) oo if g(t) 
= NL for all ¢ and n(f)=œ if f(t) * NI 
for all t.) Then, the payoff functions of the 
two players are given by 


U,(g. f) 
X,(n(f)) 
if n(f) « m(g), 
- (A 1,2) 
Y,(m(g)-1) 
if m(g) <n(f), 
where, l 


X,(q) = [1- a£] M,/(1— d,) 
+ df [P, - d,0,/(1— d,)], 
[i= dj} M,/(1- d,) 
djN,/(1— dj) 
= [1- 4$] M,/(1- 4,) 
t di ?[M,-* d,N,/(1— d,)]. 


Y,(¢) = 
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Note that the outcome of the game, that is, 
the terminal node to be reached, depends 
solely on m(g) if m(g) xn(f) and solely 
on n(f) if m(g)>n(f).° When m(g) =1, 
the outcome is immediate liberalization and 
the payoffs are given by Y,(0) = N, /(1— d,). 
When m(g)=q+1<n(f), it is an unsuc- 
cessful q-period protection and the payoffs are 
Y,(q). When n(f)=q<m(g), it is a suc- 
cessful q-period protection and the payoffs are 
given by X,(4).- 

I make the following assumptions on the 
government's and the firm's one-period pay- 
offs. First, given the investment position of 
the domestic firm, the government always 
prefers liberalization and the firm always 
prefers protection: (Al) M, < N,, (A2) P,< 
Q (A3) M, > N}, and (A4) P,» Q,. Note 
that (A2) is consistent with the assumption 
implicit above that, once the firm invests, the 
government liberalizes the market in the next 
period. Second, the firm has no incentive to 
invest without threat of future liberalization: 
(A5) M, > P,. But, investment before liber- 
alization gives the firm advantage in the 
post-entry game with the foreign firm: (A6) 
Q> N,. Note that assumptions (A4)-(A6) 
imply (A3). Finally, the government prefers 
the domestic firm having advantage over the 
foreign firm: (A7) Q, > N,. Matsuyama and 
Itoh provide an example satisfying these as- 
sumptions, when the government’s goal is 
identified as the national welfare.’ More- 
over, these assumptions appear consistent 
with the view of the world held by the advo- 
cates of escape clause protection: see Huf- 
bauer and Rosen, and Lawrence and Litan. 

I also assume that both players are suffi- 
ciently patient, since this game would be 
fairly trivial if either player discounts its 


S'This does not mean that one can redefine the strat- 
egy sets so that a player's strategy is simply when it 
!chooses to end the liberalization game. This is because 
the credibility of both the government and the firm’s 
actions cannot be discussed without referring to the 
belief on what would occur in case one of the players 
fails to carry out its prescribed action. 

"This example in Matsuyama and Itoh also satisfies 
that P, > Mj: the domestic firm under-invests. How- 
ever, I do not use this assumption below. 
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future payoff heavily. For example, an impa- 
tient government might be unwilling to wait 
even for one period in order to induce the 
firm to invest. Then, the government can 
always reach its first best outcome by liberal- 
izing immediately. To rule out this case, I 
assume 


(A8) Y,(0) T N,/(1 = dı) 
« P,* diQ/(1- dj) = X (1). 


Given (A1), (A8) is also the condition for 
successful one-period protection to be the 
optimal outcome for the government.® 

On the other hand, when the firm dis- 
counts its future payoff heavily, the govern- 
ment cannot induce the firm to invest. The 
threat of future liberalization would not 
work. Knowing this, the government would 
choose to liberalize the market in period 1. 
The following assumption helps to rule out 
this case.? 


(A9) X,(1) =P, + d,0,/(1- dz) 
> M,+d,N,/(1-d,) = Y, (1). 


From (A1), (A3), (A8), and (A9), X,(q) 
and Y,(q) can be shown to satisfy: 


(P1) X,(q) is decreasing in q, 
(P2)  X,(4)and ¥,(q) 


are increasing in q, 


(P3) X,(4) > Y (a -1), 
(P4) X(4) > ¥(q), 
(P5) ¥,(0) > X,(0). 


*From (A2) and (A7), (A8) is equivalent to (N, — 
Pj)/(Qj— Pj) «d, «1. Thus, the government might 
need to be patient to be interested in temporary protec- 
tion. Note that N, may be smaller than P,, in which 
case (A8) imposes no restriction. Matsuyama (1987, 
Proposition 1) shows that, when (A8) does not hold, any 


' strategy with g(1)= L is a dominant strategy and im- 


mediate liberalization is the dominant strategy outcome. 

?From (AS)-(A6), (A9) is equivalent to (Mj— 
P,/(M, — P) + (Q2 ~ Nj) < d3 «1. Matsuyama 
(1987, Propositions 2 and 3) show that, when (A9) does 
not hold, immediate liberalization is the iteratve domi- 
nant outcome. 
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With (P1)-(P5), we are now ready to charac- 
terize pure strategy equilibria.!? 


I. Subgame-Perfect Equilibria 


Let us first look at Nash equilibria. To 
do so, note that, from (P1), (P3), and (P5), 
there exists a unique positive integer g* sat- 
isfying X,(q*) > Y,(0) > X,(q*+1). This is 
the maximal number of periods for which 
the government is willing to wait for the firm 
to invest. To simplify exposition, I assume 
away the nongeneric case X,(q*) = Y,(0).? 
Then, 


(P6) X,(4*) > Y,(0) > X (g** 1). 


PROPOSITION 1: The pure strategy Nash 
equilibria are characterized by either 


(1.1) m(g) 21, and n(f)>q*+1, 
or 


02) 1xn(f)-m(sg)-1xq*, 


which suggests that only immediate liberaliza- 
tion and successful q-period protection, where 
1<q<q*, are pure strategy Nash outcomes. 


PROOF: 
From the definition of qg*, the govern- 
ment's best response is m(g)=1 if n(f)» 


. To characterize the equilibria, all I need is As- 
sumptions (A1), (A3), (A8), and (A9). However, other 
assumptions are necessary to interpret (A8) and (A9) as 
the, need for patience of the players. 

NA pair of strategies (g, f) is called a Nash equilib- 
rium if g maximizes the government's payoff given f 
and f maximizes the firm's payoff given g (the mutual 
best response property). A Nash outcome is a terminal 
node to be reached at a Nash equilibrium. 

See Matsuyama (1987) for the case of X,(q*) = 
¥,(1). It can be shown, from (AL), (A2), and (A7), that 
there exists a strictly increasing sequence d(q) satisfy- 
ing d(1) = Max(0,(N, — P,)/(Qy — P,)}, d(oo) —1, and 
that d(q*) < dj < d(q* +1) is equivalent to (P6). This 
implies that a more patient government is willing to 
wait longer. It also means that, for any large positive 
integer q, there exists a number d € (d(q), 1), such that 
the government whose discount factor is equal to d is 
willing to wait for more than q periods. 
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q* +1; m(g) > n(f)if n(f) x q*. The firm's 
best response is n(f) z m(g).if m(g)=1, 
since, when m(g)=1, its payoff does not 
depend on n(f); n(f) ^ m(g)-1 if m(g) 
22, since X,(q) is increasing in q and 
X,(m(g)— 1) > ¥,(m(g)—1) from (P4). 
Therefore, (1.1) and (1.2) are the only com- 
binations of m(g) and n(f) which satisfy 
the mutual best response property. o 


COROLLARY 1: The optimal temporary 
protection can be supported by a Nash equilib- 
rium with m(g)=2 and n( f) =1. 


PROOF: 
It is immediate from setting n(f)=1 in 
(1.2). 


At the equilibrium given in the corollary, the 
government is willing to wait in period 1 if 
the firm invests in period 1. The firm will 
invest in period 1 if it believes that the 
government will liberalize in period 2, no 
matter what its investment position is; the 
threat of liberalization works, if believed. 
Moreover, since the firm invests in period 1, 
the government incurs no cost by commit- 
ting to the liberalization in period 2. 

It is often argued, however, that the Nash. 
equilibrium concept requires very weak ra- 
tionality on the part of players, and there- 
fore allows too many outcomes, some of 
which are unreasonable on intuitive grounds. 
In the context of the liberalization game, is it 
reasonable for the firm to believe that the 
liberalization in period 2 is inevitable? Is the 
government’s commitment credible? If the 
government had to choose between L and 
NL in period 2 after the firm chose NI in 
period 1—which would never happen when 
the prescribed strategies m(g) = 2, n(f) =1, 
are followed — does the government still have 
an incentive to stick to the prescribed move 
g(2)=L? 

To examine whether the Nash equilibria 
possess this sort of credibility, I will look at 
subgame-perfect equilibria, following Rein- 
hard Selten (1975). Subgame perfection 


DPA pair of strategies (g,f) is a subgame-perfect 
equilibrium of the game if its restriction to any subgame 
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means that no player can influence the out- 
come of the game by trying to make a threat 
which it would not carry out if called upon 
to do so. It turns out that subgame perfec- 
tion helps to eliminate all but a finite num- 
ber of Nash equilibria but it is not powerful 
enough to eliminate any of the Nash out- 
comes. 


PROPOSITION 2: There are q* 4*1 ue 
strategy subgame perfect equilibria, (g* if ky 
{k =0,1,...,q*}, defined by, 


eG) te ,  ift#k+1(modg*+1), 
NL, otherwise, 


I, if t s k (mod q*+1), 
f*(t) = l if i ( q ) 
. ANI, otherwise, 


which suggests that only immediate liberaliza- 
tion and successful q-period protection, where 
1<q<q*, are pure strategy subgame-perfect 
outcomes. 


PROOF: 

First, note that, for any subgame-perfect 
equilibrium (g, f), (P2) implies that f(t) = 
I—>gt+1)=L and (P4) implies that 
gt *1)—- L2 fal 

Now, suppose that m(g)=1. From 
Proposition 1, n(f)zq*-1 or f(t) - NI 
for t=1...q*. Then, from (P4), g(t) - NL 
for t=2...g*+1. This in turn implies 
f(q*+1) I because, by applying Propo- 
sition 1 to the subgame starting at t= 2, 
f(q* * 1) = NI would imply g(2) = L, a con- 
tradiction. From (P2), f(q*+1) = I implies 
g(q* +2) = L, which also guarantees that the 
restriction of (g, f) to the subgame starting 
atr (3xtx q* 41) is a Nash equilibrium of 
the subgame. By repeating this process for 
the subgame starting at ¢=1+(q*+1)z for 
any integer z, one can show that g(t) ^ L if 
and only if t =1+(q*+1)z and f(t) — I if 
and only if t=(q*+1)z. In other words, 


Mt En M NOR Ru MEE UR rtr 
of the original game is a Nash equilibrium of the 
subgame. An outcome of the game is subgame perfect if 
there exists a subgame-perfect equilibrium that leads to 
this outcome. 
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(g°,-f°) is a subgame-perfect equilibrium 
and the only one that satisfies m(g) =1. 
One can show that (g^, f*) is the only 
subgame-perfect equilibrium that 1 2 
m(g) = k+1 and n(f)=k, for k=1.. 
in a similar manner. Finally, there is z 
Nash, and therefore, no subgame-perfect 
equilibrium with m(g) x q*=2, from 
Proposition 1. Oo 


Proposition 2 shows that every pure strategy 
subgame-perfect equilibrium exhibits cycles 
with period qg*+1. Intuition behind the 
cyclicity should be clear. If the firm believes 
that the government will liberalize, it will 
invest. But, if the government believes that 
the firm will invest within g* periods, it will 
wait. But, if the firm believes that the gov- 
ernment will wait, it will not invest. But, if 
the firm will not invest within g* periods, 
the government will not wait, ad infinitum. 
The proposition also states that there are 
q* +1 subgame-perfect equilibria. This is due 
to the infinite, recursive nature of the game. 
Subgame perfection would not impose any 
restriction on the phase of the cycle. 


Remark 1: Although the infinite nature of 
the game is crucial for the multiplicity of 
equilibria, the mechanism which generates 
the multiplicity in this game is different from 
that in infinitely repeated games. In an in- 
finitely repeated game, the multiplicity arises 
only when each player’s strategy is history- 
dependent. The equilibria shown in Proposi- 
tion 2 are history-independent: neither play- 
er conditions its action on the past action 
by the opponent, which is payoff irrelevant. 
To see why the multiplicity arises despite 
history-independence, consider a T-period 
truncation of this game, I',, where the play- 
ers are barred from moving after period T. 
Then, there is the unique cyclical subgame- 
perfect equilibrium. (The government liber- 
alizes in period T, which determines the 
phase of the cycle.) Obviously, the original 
game can be considered as T, the limit of 
any subsequence of (T7... Let or denote 
the unique equilibrium of I. From a theo- 
rem on limit games, such as Fudenberg and 
David Levine's (1983), the limit of equilibria 


VOL. 80 NO. 3- 


of a sequence of games is an equilibrium 
of the limit of the games. The cyclicity im- 
plies that, although a sequence {o7}%_, 
does not converge, its subsequences, 
(orina z}z=1 {kK =0,1,2...q*} con- 
verge to different equilibria, (g^ A k) (k= 
0,1,2... q*), all of which are equilibria of the 
original di Ie since it is the limits of 
{Ikie} {k= 0,1,2...q*}. . This 
shows that the cyclicity, not the history- 
dependence, is the source of ee aes 


The next proposition is a. direct coroll 
of Proposition 2.. , THAM 


PROPOSITION 3: The optimal temporary _ 


protection can be su supported by a subgame-per- 
fect equilibrium (g, f*) defined by g (1) - L 
if and only if t = 2 (mod q* +1) and f (t) = I 
if and only if t=1 (mod q*-- 1). This is the 
only subgame-perfect equilibrium that supports 
optimal temporary proteron: l 


Proposition 3 states that Liberalization i in pe- 
riod 2 can be made credible through con- 
structing a cyclical form of strategies after 
period 2. It also demonstrates that the only 
one of the Nash equilibria shown in the 
corollary of Proposition 1 is a subgame-per- 
fect equilibrium. 

The question is then: Are these equilibria 
"reasonable"? My answer is “Probably not." 
First, these equilibria have a “bootstrap” 


nature. The government changes its behavior . 


periodically only because the firm: does so, 
and the.firm does so only because the gov- 
ernment does so. Second, optimal temporary 


protection can be made credible only if im- ` 


mediate liberalization is credible. But, in the 
subgame-perfect equilibrium shown. above, 
immediate liberalization can be. made credi- 


ble only because, if it would fail to liberalize, ` 


the government would punish itself by mak- 
ing a commitment not to liberalize for some 
time to come. This “self-punishing” property 
makes this equilibrium hardly convincing. 
Third, the cyclical nature of the strategies 
seems at odds with the following intuitive, 
although somewhat ad hoc, argument. If the 
firm does not invest in period 1, the situation 
the government faces i in period: 2 is exactly 
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the same as in eve 1; given the recursive 
structure of the game. If the government did 
not liberalize in period 1, how can one be- 
lieve that it does not do the same in period 
2?: These considerations suggest a stronger 
restriction on equilibria: that is, each player 
must choose the same move when it faces 


.the same situation. In the next section, I will 
` turn to a stationary subgame-perfect equilib- 


rium, which satisfies this requirement. 
Finally, the equilibria in Proposition 2 are 

vulnerable to the possibility of renegotiation. 

I will turn to the problem of renegotiation- 


_ proofness in Section IV. 


IIl. Stationary Subgame-Perfect Equilibrium 


The stationarity requires that strategies 
should be time-independent. Once this prop- 
erty is imposed, one cannot hope that a 
subgame-perfect equilibrium exists in. pure 
strategy space. In this section, I allow ran- 
domized strategies. Both players are as- 
sumed to be risk-neutral. Let V = (V,,V,) 


. represent the value of (the remainder of) the 


game for both players. In each round, the 
government chooses L with probability u 
and the firm chooses I with probability v. 
Let (u*, v*) denote equilibrium mixed strate- 
gies. From the argument. above, 0 < u* <1, 
0 < v* <1. This implies that both players are 


. indifferent between their alternatives: that is, 


(1) Py+ 4,0,/(1 nt dj) =M, + dV, 


0) N/ü- d JEPUA +d,Q,/(1~ 4) 


t a —v*)(M, r 4i). 


Éuriheiore; from the onmin of V= 
(Vy, V), 


O W-N/ü-4). 
(4). V, w*N,/(1- dj) ... 
| +(1>u*) 


XB * 40,/0 - 4). 
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Eliminating V, and V, from (1)-(4) gives. 


(5) N/ü-4) 
=v*{ Py + 4,0,/(1- 4) 
*(1— v*)( M; * d, N,/(1— 4,)} f 


(6) P,*4,0,/(1— 42) 
=M, + d,[u*N,/(1— d) 
+(1-u*) l 
x (P, 4,9,/0.- 4d;))]. 


There is a unique 0 « u* «1 which satisfies 
(6) and a unique 0 « v* «1 which satisfies 
(5). Furthermore, u* is a function of d, but 
not of d;, and v* is a function of d, but not 
of d, which allows us to denote u* = u(d;) 
and v* = v(d,). One can also show that these 
functions have the following properties: 
u(d.) is decreasing in d, u(d,) >1 as d, > 
d =(M,— P,)/[(M,— Pj) * (Q; — N;)), and 
u(d;) 0 as d,—1; v(dj) is decreasing 
in dy, v(d)) ^1 as d, d(1) = (N, + P,)/ 
(Q, — P,), and v(d,) 0 as Aah In other 
words, 


PROPOSITION 4: There exists a unique 
mixed strategy stationary subgame-perfect 
equilibrium, in which the government chooses 
L with probability u(d,) and the firm chooses 
I with probability v(d,), where both u: (d,1) 
(0,1) and v:(4(1),1) > (0,1) are decreas- 
ing, one-to-one functions.* 


It is easy to see why functions u and v have 
these properties. In order to make the choice 
between L:and NL indifferent, a more impa- 
tient government (a low d,) needs to be 
convinced.that it is likelier that the firm 
would invest before liberalization, if another 
chance is given (a high v*). Likewise, in 
order to be indifferent between 7 and NI, a 
more myopic firm (a lower d,) needs to be 
convinced that future liberalization is like- 
lier, even if it does not invest (a high u*). 
From (3), the value of the game to the 
government is N, /(1— d,). It fails to achieve 


Tt remains an open question whether a nonstation- 
ary mixed strategy subgame-perfect equilibrium exists. 
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its maximum payoff due to its inability to 
make a credible commitment on the second- 
period liberalization: 


(90) V,=¥,(0) =™,/(—d,) 
< P,+4,Q,/(1—d,) = X,(1). 
On the other hand, from (1) and (A9), 


(8 ¥,(0) =",/(1-d,) «V, 
< P, +4,Q,/(1~ d;) = X,(]1). 
Thus, . this equilibrium weakly Pareto- 


dominates immediate liberalization, while it 
is strictly Pareto-dominated by. one-period 
successful protection. 

The probability with which optimal tem- 
porary protection succeeds is i 


[1- u(d;)] v(d;). 


The success rate is higher as the government 
is more impatient and as the firm is more 
patient. Another measure of “success” would 
be the probability with which the firm will 
eventually invest and the market ends Ep. in 
Q. This is given by 


(1-u*)e*/[1-(1- v) - v9]; 


which is also decreasing in d, and increasing 
in d,. One can also calculate the expected 
length of protection. The market is liberal- 
ized in period 1 with probability u*. The 
sécond-period liberalization: has probability. — 
equal to (1— u*jv* -(1— u(1— vSu* = | 
(1— u*){1 —(1 — u*1— »*), In general, 
protection lasts for T' periods with probabil- 
ity (1— 90 - 0 - v - v*)( - w*) 
(1— v*)}"~1. From this, the Ese length 
of protection is 


- yü- (1- aiee 


which increases with d, and d}. Therefore, 
if both players are more patient, protection 
would last longer. 

To some economists, mixed strategies seem 
odd and less appealing, but they would ap- 
pear more reasonable upon closer inspec- 
tion. Certainly, it is hard to imagine the 


—X 
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chairperson of the International Trade Com- 
mission (ITC) or the president casting dice 
to decide whether or not to protect the do- 
mestic industries not ready to face import 
competition. However, this aspect of the 
equilibrium is attributable to the fact that 
there is no chance move in the model. To see 
this, note that randomizing matters, not be- 
cause it would affect the distribution of the 
player's payoff directly, but because it would 
keep the opponent uncertain about the 
player's choice. Therefore, a random dis- 
turbance to the player's payoff, which is 
unknown when the opponent moves but 
known when the player moves would do.! 
For example, suppose that, in each period, 
uncertainty about the domestic demand re- 
solves after the government's move and be- 
fore the firm's. The firm can make a choice 
deterministically contingent on the market 
condition, but the government knows only 
the probability distribution of the firm's 
choice. Likewise, the government's decision 
in the next period would reflect new devel- 
opments on economic and political situa- 
tions after the firm's investment decision in 
this period. Then, even if the government 
uses pure strategies, the firm is uncertain 
about the government's move. 

In reality, it is highly uncertain whether a 
domestic industry would receive temporary 
protection. For example, Lawrence and Litan 
(1986, Table 3-2) reports the U.S. experience 
of escape clause relief. It shows that, be- 
tween 1975 and 1985, the presidents have 
granted domestic industries import relief in 
only fourteen of the 33 cases which the ITC 
affirmed import injury. It is not hard to 
imagine that the presidents made their deci- 
sions taking into account many factors which 
did not exist when these industries had 
brought the cases before the ITC. 


This also explains why the probability distribution 
of the player’s choice depends on the opponent’s dis- 
count rate, but not on the player's. 

John Harsanyi (1973) proves that, in a static game 
context, a mixed strategy equilibrium in a game with 
deterministic payoffs can be approximated as a pure 
strategy equilibrium in a game with randomly disturbed 
payoffs. 
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IV. Renegotiation-Proof 


'This section examines the credibility of 
optimal temporary protection by invoking 
* renegotiation-proof," the concepts recently 
proposed by, among. others, Farrell and 
Maskin, and Pearce. Although they devel- 
oped their theories in the context of in- 
finitely repeated games, one can easily apply 
these concepts to the liberalization game due 
to its recursive structure. 

A noncooperative solution like a subgame- 
perfect equilibrium is sometimes interpreted 
as a “self-enforcing agreement.” Imagine the 
situation where players can freely discuss 
their strategies, but cannot make binding 
commitments. Then any viable agreement 
must be self-enforcing: that is, once players 
agree to play equilibrium strategies, no player 
has an incentive to renege. In games with 
many equilibria as the liberalization game, 
this interpretation is appealing as a way of 
explaining how players know which equilib- 
rium is to be played. Moreover, in the liber- 
alization game, if the government and the 
firm were barred from communicating with 
each other, they could hardly coordinate their 
strategy choices so that one of the cyclical 
equilibria given in Proposition 2 would 
emerge. However, this ability to communi- 
cate itself would make these equilibria un- 
tenable. To see why, consider a subgame- 
perfect equilibrium supporting immediate 
liberalization. From (Pl) (P2) (P4) and 
(P6), 


(9) X(4) >¥,(0), 
X,(q) > Y,(0) for alll <q < q*; 


that is, immediate liberalization is Pareto- 
dominated by other subgame-perfect out- 
comes. Therefore, it is hard to imagine that 
the government and the firm agree upon this 
outcome. But, once they recognize this, other 
subgame-perfect equilibria would become 
also untenable. More specifically, consider 
optimal temporary protection in Proposition 
3. If the firm reneges and does not invest in 
period 1, the government is supposed to 
liberalize in period 2. But, at the beginning 
of period 2, the firm could make a proposal 
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of moving to an alternative subgame-perfect 
outcome which is more attractive to both the 
government and the firm than immediate 
liberalization. If the firm realized that it could 
renegotiate with the government in this way, 
it might find not to invest advantageous. 
Therefore, optimal temporary protection 
equilibria would collapse. Likewise, all equi- 
libria in Proposition 2 have this problem. 
Farrell and Maskin propose a credibility cri- 
terion to rule out this sort of equilibria.!’ 

In objecting to this one might argue that 
the firm cannot effectively negotiate with the 
government when it failed to invest, for the 
following reason: For example, consider the 
case q* =1. There are only two pure strategy 
subgame-perfect outcomes: immediate liber- 
alization and optimal protection. To ask the 
government to wait for one more period, the 
firm needs to argue that optimal protection 
is a credible outcome. But this requires that 
immediate liberalization also be credible, 
since the credibility of optimal protection 
depends on that of immediate liberalization. 
In other words, in order to renegotiate away 
from immediate liberalization, the.firm has 
to argue that it is not possible to renegotiate 
away from it. 

However, the firm can negotiate to aban- 
don the immediate liberalization equilibrium 
in favor of the stationary subgame-perfect 
equilibrium discussed in Section III. From 
(7) and (8), this equilibrium (weakly) Pareto 
dominates immediate liberalization and its 
credibility does not rely on that of immedi- 
ate liberalization. If the firm can renegotiate 
away from liberalization this way, then all 
equilibria in Proposition 2, and therefore 
optimal protection, are not credible. This is 
in the spirit of Pearce's concept of renegotia- 
tion-proof. 

In summary, optimal temporary protec- 
tion is not renegotiation-proof in the sense 
of Farrell and Maskin. It is not renegotia- 
tion-proof in the sense of Pearce when mixed 
strategies are allowed for. 


In the terminology of Farrell and Maskin, all pure 
strategy equilibria of this game are not “weakly renego- 
tiation-proof.” 
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V. Concluding Remarks 


In this paper, I have formalized dynamic 
inconsistency of optimal temporary protec- 
tion, by constructing a simple, infinite hori- 
zon, perfect information game of timing in 
which the domestic government and the do- 
mestic firm move alternately. I: examined 
subgame-perfect equilibria under the as- 
sumptions which guarantee that optimal 
temporary protection can be supported by a 
Nash equilibrium. Subgame perfection helps 
to eliminate all but a finite number of equi- 
libria, but, rather surprisingly, optimal tem- 
porary protection can be supported by a 
subgame-perfect equilibrium. This is because 
the threat of period 2 liberalization can be 
made credible through constructing a partic- 
ular cyclical form of strategies after period 2. 
I do not find this equilibrium compelling. 
One reason is that it is not renegotiation- , 
proof. There exists a unique stationary sub- 
game-perfect equilibrium in mixed strategies. 
In this equilibrium, the government’s payoff 
is smaller than in the first best outcome, due 
to its inability to make a credible commit- 
ment. 

However, I do not mean to say that the 
government cannot make a credible commit- 
ment to the future liberalization in reality. 
There are several possibilities which the lib- 
eralization game fails to take into account. 
First, the government might be aware of 
“the demonstration effect” of liberalization. 
If infinitely many industries ask for import 
relief sequentially, the government can and 
has incentive to build a reputation to be 
tough through a similar mechanism as the 
Folk Theorem in the repeated Prisoner’s 
Dilemma game. Such a reputational mecha- 
nism can be designed in a renegotiation- 
proof way. Second, the government might be 
able to sign a contract with a third party 
(perhaps, the GATT) to make the cost of 
postponement of liberalization (or renewal 
of temporary protection) prohibitively high. 
Third, the domestic government might want 
to ask a foreign government to exert diplo- 
matic pressure to liberalize the domestic 
market (as some observers suspect that 
Japanese Ministry of International Trade and 
Industry (MITT) has done with the United 
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States), Of course, these mechanisms should 
be analyzed more formally by explicitly 
modifying the game between the government 
and the firm, which I will leave on the agenda 
for future research. 

Throughout the paper, I have asked 
whether a particular equilibrium is reason- 
able by investigating its credibility. An alter- 
native criterion of “reasonableness” would 
be its robustness. If a set of possible equilib- 
ria disappears when the game is slightly per- 
turbed in a natural way, one cannot rely on 
such equilibria in prediction. For one can 
never be sure of the exact nature of the 
games governments and firms play in reality. 
Thus, it is highly desirable to examine the 
robustness of equilibria by, say, introducing 
informational asymmetry into the game. For 
example, one can consider the case where, 
with some probability, the firm’s investment 
effort might fail to materialize and the gov- 
ernment cannot observe the firm’s action. 

Another natural extension would be to 
allow the firm to accumulate capital gradu- 
ally. However, such a modification would 
make the model intractable, since one can- 
not invoke recursivity when each stage game 
depends on the level of capital stock. One 
way of getting around this difficulty is to 
assume that the number of periods is finite: 
see Tornell (1987). I find, however, that the 
finite period assumption is at least as unreal- 
istic as the binary restriction of the firm’s 
action space. 

Perhaps the most problematic feature of 
the model is its treatment of the domestic 
government as a unified, coherent body of 
decision makers. In reality, any economic 
policy is a product of complex interactions 
among different parts of the government, 
each of which has its own objectives. In fact, 
a number of recent studies deal with the 
political economy of protectionism, as exem- 
plified by Robert Baldwin (1985). The ap- 
proach adopted here should be regarded as a 
complement of, not a substitute for, this 
growing body of literature. 

Finally, some mention should be made 
regarding two other possible solutions to 
avoid permanent protection. First, some ob- 
servers suggest that temporary protection 
should be granted to domestic industries only 
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when they commit to adjustment plans. This 
view is reflected in a number of recent con- 
gressional proposals to reform the U.S. es- 
cape clause. In the context of the liberaliza- 
tion game, this amounts to enforcing the 
domestic firm to invest during the protection 
period. This approach, however, requires the 
government to participate actively in the re- 
tooling process of the industries. The danger 
of such government intervention in practice 
might be substantial and it is well docu- 
mented.!® Second, one might hope that the 
danger of prolonged protection can be re- 
duced by designing appropriate institutional 
arrangements. In fact, it is often argued that 
the escape clause itself is “a useful safety 
valve for protectionist pressures (Kenneth 
Dam, 1970, p. 106).” However, this ap- 
proach is, at best, an imperfect solution to 
the problem. As long as temptations to pro- 
vide further protection remain, there exists a 
tendency to circumvent the institutional ar- 
rangements, as suggested by the recent drift 
of U.S. practice away from the escape clauses 
in negotiating orderly market arrangements 
and voluntary export restraints. 


See U.S. Council of Economic Advisors (1984, pp. 
108-9) and Lawrence and Litan (1986, pp. 84-96). 
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Differential Payments Within a Bidder Coalition 
and the Shapley Value 


By DANIEL A. GRAHAM, ROBERT C. MARSHALL, 
AND JEAN-FRANCOIS RICHARD* 


Bidder coalitions at English auctions frequently distribute collusive gains among 
members via a secondary auction or "knockout." When coalition members are 
sufficiently heterogeneous, nested coalition structures are observed in which a 
knockout is conducted at each level of nesting. The nested knockout’s characteris- 
tics are investigated. Within many settings we find that the expected payments to 
coalition members via the nested knockout equal the Shapley value. Incentive 
compatibility problems of the nested knockout are also analyzed. 


Our understanding of auction schemes has 
progressed significantly since William Vick- 
rey's (1961) seminal work. Recent research 
has focused on the optimal design of auc- 
tions as well as the strategic behavior of 
bidders and auctioneers within specific envi- 
ronments. A rich set of results has emerged 
from these investigations. However, one fre- 
quently observed characteristic of strategic 
behavior remains relatively unexplored to 
date— bidder collusion. 

Within the independent private values 
model (IPV) it is well known that the domi- 
nant noncooperative strategy for each bidder 
at an oral ascending bid (English) auction is 
to remain active until the bid reaches his or 
her personal valuation for the item being 
auctioned. Similarly, at a second price auc- 
tion it is a dominant noncooperative strategy 
for each bidder to submit a bid equal to his 
personal valuation. Restricting attention to 
the English and second price auctions, it is 
also well known that outcomes attainable 
through collusive behavior strictly dominate 
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those associated with noncooperative’ play. 
Intuitively, a bidder coalition or ring, which 
consists of k of the n bidders attending an 
auction, can expect to gain, relative to non- 
cooperative behavior, by suppressing k —1 
of the competitive bids at the main auction. 
Daniel A. Graham and Robert C. Marshall 
(1987), hereafter GM, show that when the 
members of a bidder coalition are homoge- 
neous it is possible for them to organize 
themselves in such a way that participation 
is mutually advantageous and yet there is no 
advantage to cheating. These facts corre- 
spond well to many of the *stylized facts" of 
ring behavior summarized in Section I. In 
practice, rings allocate items won at a main 
auction through a secondary auction called a 
knockout. If member types are considered to 
be relatively homogeneous, then only a sin- 
gle knockout is observed. In this situation 
the difference between the price paid for the 
item at the knockout and the price paid for 
the item by the coalition at the main auction 
is equally divided among all coalition partici- 
pants. Within the IPV model where bidders 
draw their valuations from the same distri- 
bution, GM have proposed a mechanism 
through which the gains from coalition for- 
mation are equally distributed among all ring 
members. 

In practice, when ring members are het- 
erogeneous, it is common to observe a nest- 
ed coalition structure. This structure facili- 
tates differential payments to ring members. 
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Specifically, a knockout is conducted at each 
level of nesting—the details of the procedure 
are described in Section I. 

The goal of this research is to investigate 
the characteristics of the nested knockout 
and the factors underlying its use by practic- 
ing bidder coalitions. Attention is devoted to 
a single-object English (or second price) auc- 
tion. To facilitate an initial understanding of 
the nested knockout, in Section II we assume 
that the valuations of bidders are commonly 
known by all bidders and that the levels of 
nesting equal the number of bidders within.a 
ring. Representing this complete information 
game in characteristic function form yields 
a remarkable result—the nested knockout 
produces an imputation that not only be- 
longs to the core of the cooperative game but 
also is identical to the Shapley value of the 
game. This particular distribution of ring 
benefits is also explained in terms of Roger 
B. Myerson’s (1980) “cooperation structures” 
as the result of cooperative agreements in 
which the parties to the agreement share 
equally the benefits derived from the agree- 
ment. 

In Section III we not.only adopt a more 
realistic setting in which valuations are pri- 
vate information but we assume, as observed 
in practice, that the knockout is conducted 
after the main auction (i.e, an ex post 
knockout) and without the use of the budget 
breaking ring “center” employed by GM. To 
simplify the analysis it is assumed that the 
ring members draw their valuations from the 
same distribution or, in other words, that 
they are ex ante homogeneous. The knock- 
out is modeled as one in which the low bid is 
divided among all members, the difference 
between the second lowest and the lowest is 
divided among all members save the one 
who submitted the lowest bid, and so forth. 
Obvious incentive compatibility problems 
arise in this context. Specifically, low-valued 
bidders have an incentive to bid in excess of 
their true valuations at the post-auction 
knockout in hopes of enlarging their share of 
the “pie.” We investigate the stategic behav- 
ior of ring participants when side payments 
are determined by the nested knockout in 
both a simultaneous and sequential game 
framework. Surprisingly, in both the simul- 
taneous and sequential versions of the nested 
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knockout, a ring member with valuation v 
receives equal ex ante expected payoffs (for 
any value of v). This result thus extends the 
GM “equal shares” result to mechanisms 
that are far more descriptive of the nested 
knockouts used in practice. 

Some initial results are presented in Sec- 
tion IV regarding the unequal sharing of. 
expected ring benefits among ex ante het- 
erogenous bidders, that is, bidders who draw 
their valuations from different, but com- 
monly known, distributions. These ring 
members adopt an incentive efficient and 
durable mechanism, in the sense of Bengt 
Holmstróm and Roger B. Myerson (1983), in 
which payments differ among the ring mem- 
bers—the payment to a particular ring mem- 
ber depends upon the distribution from 
which he draws his valuation. It is demon- 
strated that these payments yield an imputa- 
tion that is once again identical to the Shap- 
ley value of the characteristic function game 
based upon expectations of payoffs, pro- 
vided that cooperation by ring members en- 
tails agreements to share equally the gains 
resulting from their cooperation. This result 
is also related to that obtained by Myerson 
(1980). 


I. The Stylized Facts of Bidder Coltusion 


The character of a bidder coalition de- 
pends upon the type of object being sold. 
Ralph Cassady, Jr. (1967, ch. 13), discusses 
the many types of rings found throughout 
the world. Below we provide the stylized 
facts of cooperative behavior at an English 
auction where a nondivisible item is being 
sold. This account is not conjecture but is 
based, in fact, upon information provided by 
both auctioneers and ring members. For fur- 
ther details we refer the interested reader to 
Graham and Marshall (1985). GM explains 
Facts 1-6 within the context of a single 
object English auction where bidders' valua- 
tions are independently and privately drawn 
from the same distribution. The present re- 
search focuses on Fact 7. 


l. Rings exist and have a stable form of 
organization over time. 

2. Rings adopt strategies that eliminate 
meaningful competition among members 
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at the main auction and yet ensure that 
no item will be sold to a non-ring bidder 
or be retained by the auctioneer at a price 
below the maximum of the individual ring 
members’ personal valuations. 

3. Rings have open membership policies in 
the sense that bidders who are expected 
to be competitive at the main auction are 
invited to join. 

4. The auctioneer responds strategically to 
the existence of a ring. 

5. Rings attempt to conceal their existence 
from the auctioneer. 

6. The benefits of ring formation are shared 
among the members rather than, for ex- 
ample, accruing entirely to the ring mem- 
ber who ultimately obtains possession of 
the item. 

7. Since the expected benefits to the ring of 
a particular bidder’s membership often 
varies across bidders, the ring frequently 
adopts a procedure by which the benefits 
of ring formation are shared unequally 
among its members. A nested coalition 
structure is frequently observed in which 
the individuals expected to make the 
greatest contribution, ex ante, are typi- 
cally found in the innermost ring, and 
those expected to make the smallest con- 
tribution are found in the outermost ring. 


The details of the nested knockout are 
best described with an example. Suppose 
that a coalition of ten bidders has formed 
and appointed a sole bidder to bid on their 
behalf at the main auction. Suppose further 
that the sole bidder has won the item at the 
main auction for po. These ten bidders be- 
come the members of an initial or outer ring. 
Four levels of nesting develop: within the 
initial ring with ten members there is a ring 
of size eight, within the ring of size eight 
there is a ring of size six, and within the ring 
of size six there is a ring of size four. Ignore 
for the moment the decisions of individuals 
to participate in these rings—this will be 
addressed in Sections II and III—and call 
these rings R,, Ra R, and R,, respec- 
tively. Note that the rings are nested in the 
sense that R;,, C R; Finally, let RNR;,, 
denote those bidders who are members of 
ring R, but not members of ring R;,, 
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The nested coalition structure makes pos- 
sible a series of knockouts 1n which English 
auctions are used to determine the ultimate 
ownership of the item. In the first knockout 
the members of R, bid collusively against 
those in RN Ra. In practice the members of 
R, would appoint a sole bidder who would 
remain active in the bidding up to the high- 
est valuation within the members of R,. The 
members of RN R, would bid competitively. 
Let p, denote the winning bid. The individ- 
ual (or sole bidder) who bid p, wins the 
item in the first knockout and pays p, for it. 
The “surplus” p,— p, is then equally di- 
vided among all members of R,. ( po is paid 
to the sole bidder of the original coalition 
who paid this amount to the auctioneer at 
the main auction for the item.) If the win- 
ning bid came from a member of RNR,, 
then the knockouts are finished. On the other 
hand, if p, were submitted by the sole bid- 
der of R,, then the item becomes the prop- 
erty of R, and another knockout takes place. 
In the second knockout only members of R, 
participate— bidders in R,\ R, have left with 
their side payments by the time this knock- 
out occurs. Now the sole bidder for R, will 
bid against the members of RR. If Pr 
denotes the winning bid then p,— p, is 
equally divided among all members of R,. If 
p, came from a member of R,/R;, then all 
is done—if not, the item becomes the prop- 
erty of R4 and another knockout will be 
conducted. It is possible for the knockouts 
and side payments to continue in this man- 
ner until the members of R, have acquired 
the item. Should this happen, the members 
of R, would then conduct the final knock- 
out, bidding competitively among themselves 
to determine ultimate individual ownership 
of the item. 

Table 1 illustrates the side payments pro- 
duced by the nested knockout for the case in 
which the item reaches the final knockout by 
R,. These payments sum to p, — po. In addi- 
tion, the ultimate winner of the item receives 
the usual surplus: the winner's valuation, v, 
minus p,. Consequently, the winner in ring 4 
receives 


Ring 4 Winner: x — x, (v— p,). 


This illustrates the nested knockout. A gen- 
eral definition will be provided in Section II. 
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TABLE 1—SIDE PAYMENTS IN THE NESTED KNOCKOUTS 
































Knockout RNR: RAR; RAR, R4 
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9 8 8 8 
P7 P2 P37 Pa 
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i n i 3 — PB =P 
Total xf x= X ince: ae d é : X4 yt 2 


The use of the nested knockout by practic- 
ing bidder coalitions leads to a number of 
questions. Does the nested knockout possess 
unique properties that recommend its use 
relative to other surplus sharing mecha- 
nisms? What type of bidding behavior would 
be induced by a nested knockout given that 
high losing bids are advantageous to low-val- 
ued ring members? If incentive compatibility 
cannot be obtained within the nested knock- 
out as described above, then is it possible to 
construct an incentive compatible and eff- 
cient mechanism for the allocation of the 
collusive gain? In the following sections these 
questions are addressed. First, in Section II 
we characterize the payments of the nested 
knockout in an environment that allows us 
to suppress incentive compatibility concerns. 
In Section III we move to an IPV framework 
and investigate bidding behavior for two 
variants of the nested knockout. Finally, in 
Section IV we pose a variant of the nested 
knockout where payments are made ex ante 
by a ring "center" instead of being deter- 
mined ex post via ring members' bids. This 
mechanism is. both incentive efficient and 
durable. 


H. The Complete Information Auction Game 


-~ We initially consider a highly simplified 
knockout game in which all of the ring mem- 
bers at an English auction have valuations 
that are known to one another. This “public 
values" model is extremely unrealistic but (i) 
illustrates an important feature of the nested 
knockout procedure and (ii) will be relaxed 





in the next section to allow for valuations to 
be private information. For notational con- 
venience we assume that the n, initial ring 
members are ordered by their valuations so 
that 


(1) D»QmUQmco EU. 


Suppose that the ring has acquired the 
item at a price equal to pọ. Given the com- 
plete information of members valuations we 
may suppose that po x v, and let n denote 
the number cf ring members whose valua- 
tions exceed pp: 


n = max(k|o, > po}. 


These n bidders will be the only active 
participants of the ring in the complete in- 
formation auction game. The nested knock- 
out imputation for this game is easily de- 
fined. Since valuations are public knowledge, 
perfectly fine nesting is possible. By perfectly 
fine we mean that ring R, will contain 
(L..,n), R, will contain (1,...,n —1), 
etcetera. The kth knockout then pits the 
members of R,,,— (L...,n— k} against 
RNR,QQ,7Í(Kk). The imputation resulting 
from the nested knockout procedure gives 
bidder i i 


n—i - 
U;—U, P 
j j+1 n 0 
x= ) por meno 


jei J 


fori «n, 


U, 7 Po. 
X = 
n n , 
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so that 


mE: 


x fori<n. 


i Xii gea 

Note that the conventional specification of 
the characteristic function would assign the 
maximin value to a coalition S in the game 
played between S and the complementary 
coalition N — S. However, since the max- 
imin value is zero for any ring that is not 
all-inclusive this approach would yield. the 
rather uninteresting characteristic function 
that assigns zero to all coalitions other than 
N. Our specification of the characteristic 
function, on the other hand, is based upon 
the fact that it is a dominant strategy in the 
game between S and N — S for S to remain 
active until the bidding reaches 


max V; 
ies 


and for N— S to remain active until the 
bidding reaches 


max vj. 
jes 


Since the payoff to S under these domi- 
nant strategies is 
2 S) = max, max v; — max v;,0 
(2) w(S) = max{ max v, — max v;,0) 
VS CN, 
we take this to be the relevant characteristic 
function for the complete information game. 


Given the indexing convention (1), the char- 
acteristic function can also be expressed as 


(3) w(S) 


0 iles 


=({v,—0, if1,2,..,i-1eS$ 
and iS 


Recall that an imputation, X, is in the 
core of w iff 


Y xz 


ies 


w(S) WSCN. 
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It follows that . 


THEOREM 1: The nested knockout PENN 
tion is in the core of the complete information 
auction game. 


PROOF: 
See Appendix, Part A. 


The nested knockout imputation, which 
gives larger payments to bidders with larger 
valuations, belongs to the core. An equal 
division of w( N) among the members of the 
ring, on the other hand, does not generally 
belong to the core. Suppose, for example 
that n = 3, the valuations are 


(vi, va, 03) = (12,7,3) 


and pọ = 0. Then the characteristic function 

for this game is 
w({1}) =5 

»((1,2)) «9 


w({2}) =0 
w({1,3}) =5 
»((1,2,3)) =12 


w({3}) =0 
w({2,3}) = 


and the nested knockout imputation is 
(x, X35; x3) = (8,3,1). 


An equal division of w({1,2,3}) would 
give each bidder a payoff of 4. That this is 
not in the core follows from the fact that 
w({1,2}) 7:9 > 4-4. On the other hand, the 

*winner take all" imputation, (12,0,0) in 
this example, is in the core. The nested 
knockout imputation has additional proper- 
ties which distinguish it from this and other 
core imputations. 


VThe set of imputations in the core is defined by the 
inequalities x, > 5, x; > 0 and 12 2 x, + x, > 9 together 
with the identity x, + x4 + x, —12. It follows, in partic- 
ular, that we can have x, > x, or x4 > x; but not both. 
When the valuations are not too far apart, the core 
includes "decreasing" imputations, for example, if 
(01, 05, 04) = (12,9,7) then the imputation (3.5,4,4.5) is 
in the core. 
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It is also useful to compare the nested 
knockout imputation to the Shapley value 
for this game. Let y, denote the payoff of 
bidder i in the Shapley value of game w. 
Surprisingly, perfect nesting under complete 
information yields the Shapley value for the 
game: i 


THEOREM 2: y, x, Vi 


PROOF: 
See Appendix, Part A. 


Two interesting consequences of this theo- 
rem are worth noting. Characteristic func- 
tion games with "conference structures" have 
been examined by Myerson (1980).? A con- 
ference is any set of two or more players 
who might meet together to discuss their 
cooperative plans. A conference structure, 
Q, then is any collection of conferences. Two 
. players are connected in a given conference 
structure if they can be coordinated eitlier 
by meeting together in some conference or 
by meeting in separate conferences that have 
one or more members in common to serve as 
intermediaries or by some longer sequence 
of overlapping conferences in the given con- 
ference structure. An allocation rule is a map 
that sends each possible conference struc- 
ture, Q, into a feasible allocation of payoffs 
to players, x,(Q), i1 n. An allocation 
rule is fair if any two players always share 
equally the gain associated with their form- 
ing a conference. An allocation rule is stable 
if no player is ever harmed by the formation 
of a conference that includes him. Myerson's 
principal result is that there i$ a unique fair 
allocation rule and that this fair allocation 
rule for any given conference structure coin- 
cides with the Shapley value for the game, 
which is altered by requiring that players can 
only coordinate through the conferences in 
the given conference structure.? Myerson also 


?See also Myerson (1977). This paper examines “co- 
operation structures" in characteristic function games 
with transferable utilities. À cooperation structure is a 
special case of a conference structure in which every 
conference has exactly two members. 

Let S/Q denote the partition of S defined by the 
requirement that the subsets are the maximal connected 
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TABLE Z— THE FAIR ALLOCATION RULE 


Q x(Q)  x(Q)  x(Q 


— 
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Uu 
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shows that the fair allocation rule is stable if 
the game is superadditive. 

The connection of Myerson’s results to 
our own is immediate. Since the nested 
knockout imputation and the Shapley value 
of the complete information auction game 
coincide (Theorem 2) and since the charac- 
teristic function of this game is superaddi- 
tive, we may conclude that the nested knock- 
out imputation is consistent with the unique 
fair and stable allocation rule for the com- 
plete information auction game. This unique 
fair allocation rule is illustrated in Table 2 
for the example given earlier.‘ 

A fair allocation rule gives equal benefits 
to the members of each conference— all 
members of any given conference would gain 
equally from its formation or, equivalently, 
would lose equally from its dissolution. The 
gain, for instance, to forming {1,2} if no 
other conference exists is 4 in the example, 
and this is shared equally by players 1 and 2. 
Note that the allocation rule is stable—add- 
ing (1,2), for example, to (1,3) and (2,3) 
helps both 1 and 2 (but hurts 3). 

Overall, although there are other imputa- 
tions in the core, the nested knockout impu- 


coalitions that can be coordinated if the piayers meet 


only in the conferences of Q. Then the altered game is 
defined by 


w(S/Q)= VS c N. 


i, (S) 
S,es/g 

“The last conference structure listed, {1,2,3}, is 
equivalent to ({1,2},{1,3},{2,3}} in the sense that 
S/{{1,2}, (1,3), (2,3)) =S/{1,2,3} VS c N, where 


N/Q= ( ( jli and j are connected by Q)i €N ). 
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tation is the unique fair and stable allocation 
rule and, consequently, the Shapley value as 
well. 


Ill. The Post-Auction Knockout 


Now suppose that we relax the assump- 
tion that the valuations of members are pub- 
lic knowledge within the coalition and adopt 
instead the IPV framework in which v, is 
private information to the pth bidder and 
modeled as an independent observation from 
the distribution function F(v). It proves no- 
tationally convenient to assume that F(v) 
can be characterized by a density function 
f(v). We shall consider successively a si- 
multaneous (single bid) and a sequential 
(multiple bid) version of the knockout. The 
equilibrium bids from these versions will be 
denoted, respectively, with superscripts “s” 
and “m.” Throughout the discussion it is 
assumed that the ring initially acquires the 
item at price po, which is assumed to be less 
than v, the highest valuation among . ring 
members.? 

It will be shown below that ring members 
with valuations less than py may neverthe- 
less find it (ex ante) profitable to participate 
in the knockout as long as their valuations 
exceed a reservation value v, x py to be 
determined. 

Following the seminal equivalence results 
in Myerson (1981), Milton Harris and Artur 
Raviv (1981) and, in particular, John Riley 
and William Samuelson (1981, Proposition 
1) revenue comparisons between auction 
schemes essentially require the evaluation of 
the corresponding reservation values, which 
are defined as the valuations below which it 
is unprofitable to submit a bid. In particular, 
equality of the reservation values associated 
with two distinct auction schemes entails 
their revenue equivalence. Hence, the com- 


This assumption will be justified in Section IV where 
a mechanism will be identified that assures that ring will 
only acquire items at prices that do not exceed the 
highest valuation among its members. Here we focus 
upon the behavior of ring members within the ring—the 
behavior of the ring (and its members) within the larger 
auction is the subject of Section IV. 


GRAHAM ET AL.: PAYMENTS WITHIN A COALITION 499 


parison below of the simultaneous and se- 
quential knockouts are based entirely upon a 
comparison of the reservation values associ- 
ated with each. Determining the relationship 
between these values is (nontrivially) estab- 
lished through a recursion argument. The 
bid functions that emerge from these deriva- 
tions.are of significant interest in their own 


rights. 
A. The Single Bid Knockout 


In this version of the knockout, members 
of the coalition are required to report a 
single bid to a “ring center.” The ring mem- 
bers are then ordered by decreasing bids so 
that 


(4) b,>b,>--: mb, 


The bidder with the largest report wins the 
item, and the total payoff to the ring of 

— Po is allocated as follows. The winner 
receives 


n b,— b. 
(5) X= 0 —bc Y M, 
x p=2 P 


and the rank / losing bidder, 7 1, receives 


n b —b 
(6) ae ln, 


where b, ,, = Po for notational convenience. 

As in Section I, we shall assume that ring 
members who find it unprofitable to partici- 
pate in the ex post knockout abstain from 
submitting bids. Hence, after these members 
have left the room, the » remaining active 
participants share the information that their 
n valuation are bounded below by v,(n, Po), 
the reservation value associated with n and 
Po, and for which we use the shorthand 
notation v,. Let 


Ellos) = b A 
, BEZEW FO) - F(v,)] 
[1- F(v,)]' 


+ 


psisn 
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denote the distribution of the pth largest 
valuation among / valuations drawn inde- 
pendently from the distribution F(-), condi- 
tionally on them all being larger mnt the 
constant v,. Let 


fi Cols) = p(5) 


x [1- F(v)]* ^ [F(v) - F(v)]^* 
[1- F( v,)]' 


x f(v) 


denote the corresponding density. Under 
these circumstances 


THEOREM 23: The (symmetric) Nash equi- 
librium bidding strategy for a bidder with valu- 
ation v in a ring with n active members is to 
submit a bid $3(v; v,) satisfying: 


AOA dt 


(7) Co; va) = To Eo) ' 


U > Ux, 


where the reservation value v, is zero if py < 
$2(0; 0) and is otherwise given by the unique® 
solution of the identity 


(8 po= Si (0s; Ue) = [Cerea 


SThe uniqueness of v, follows from the fact that the 
partial derivative of fy (tlv) w.r.t v, is negative for 
t< t, and positive for 2» tę, where t, > v, is such that 
nF(t,) -(n—2)-2F(uo,). Since the integral of the 
derivative from v, to oo is zero, it follows that the 
right-hand term in (8) is an increasing function of v,. If, 
for example, valuations are uniformly distributed on 
[0,1], then 


n-1 
nl 


H 





ntl n-i CN 
Po "41 <S po otherwise. 
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where py is the price at which the ring ac- 
quired the item. Hence v, € py. The corre- 
sponding ex ante expected payoff to the bidder 
is 


= bs — 
(9) P*(om)- —— 





+f — t) fP H{tlvy) dt, 


where by =£5(Ux, Ux) So that by= pg if 
Ux > 0. : 


The proof of this proposition is available 
on request from the authors. An example is 
provided in the Appendix, Part B. 

In words, the optimal bid for a player with 
valuation v is the expected value of the 
second highest valuation among n indepen- 
dent drawings from the distribution F(-) 
conditionally on (i) the n valuations being 
all larger than v, and (ii) the second highest 
being larger than v. If the ring has acquired 
the item at a price py 2 $2(0, 0), then the 
ex ante expected payoff to a ring member 
with valuation v is given by the expectation 
of the function max(0, v — t), where t is the 
highest valuation among n —1 independent 
drawings from F(-), conditionally on them 
all being larger than v. If instead py < 
§5(0,0), then participating ring members 
share an additional expected payoff given by 
b, — Po- l 

The Nash strategy (7) is monotone in v so 
that the ordering of the bids corresponds 
to that of the valuations and, in particular, 
the ring member with the largest valuation 
always wins the item. Furthermore, if 
Er (v|vx) <1, then 


(10) &(o; vy) > v, 


so that it is optimal for the ring members to 
overbid relative to their own valuations.’ The 


"At a first price auction (PV with homogeneous, risk 
neutral bidders), the symmetric Nash equilibrium bid 
strategy for a bidder with valuation v is equal to the 
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reason for doing so is obvious. A (mono- 
tone) strategy of underbidding cannot possi- 
bly be a Nash equilibrium since it implies 
that each term in the payoff functions (5) 


and (6) is nonnegative, including the term 


v, — b,. Hence it would pay for a ring mem- 
ber to raise his bid at least up to his own 
valuation since by doing so he would in- 
crease his share of the ring total payoff even 
if he turned out to be the winner. Given this 
overbidding, ring members face the possibil- 
ity that v, — b, might be negative. This re- 
strains them from raising their bids arbitrar- 
ily since doing so would increase their chance 
of winning the item and, beyond a certain 
level, would only reduce their expected pay- 
off. 

Similar considerations will play an essen- 
tial role in our analysis of the multiple bids 
knockout, where it will be shown that the 
sequential nature of the game makes it more 
attractive to risk higher bids during the ear- 
lier stages of the game— given the possibility 
of backing out at later stages— but at the 
same time raises the possibility that “loss- 
bidding" might occur at every stage of the 
game. i 

Note finally that decreasing n shifts the 
distribution function F7'(1|v,) to the left and, 
therefore, increases v, as defined in (8). It 
follows that no ring member who has de- 
cided not to participate would change his 
mind on the basis of the corresponding deci- 
sions of the other bidders. 


B. The Multiple Bid Knockout 


We now consider a sequential game con- 
sisting of n — 1 successive knockouts, each of 
which results in the elimination of one player 
from further contention. At each stage mem- 
bers submit bids with the difference between 


expected value of the second highest valuation from n 
given that the highest valuation equals v. The difference 
between this strategy and the one for the simultaneous 
knockout lies in the conditioning set. For the first price 
auction bidders attempt to win the item in order to 
obtain a positive surplus while at the simultaneous 
knockout it is advantageous to be the highest bidding 
loser. 
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the smallest bid in the current stage and the 
smallest bid in the immediately preceding 
stage being divided equally among the mem- 
bers participating at the current stage. In 
addition, all except for the member submit- 
ting the smallest bid advance to the next 
stage. As usual within the context of sequen- 
tial games, solutions are obtained by means 
of backward recursions. It proves notation- 
ally convenient to index the successive 
knockouts in reverse order and, more specif- 
ically, to use as index the number of players 
remaining in contention at the given stage of 
the game so that at stage /, the / remaining 
players are invited to submit bids and the 
one submitting the lowest bid is eliminated. 
During play of the game / runs backward 
from n to 1 and the last stage's winner gains 
possession of the item. Given this indexing 
convention, the payoff functions to the play- 
ers are still given by expressions (5) and (6) 
on the basis of the sequence of bids b 
l:n-2, where b? denotes the lowest bid 
observed during round /. An important issue 
concerns the information made available to 
the remaining participants at the end of each 
round of the knockout. We adopt a stylized 
version of the actual bidding process in which 
inner ring bids are handled by a sole bidder 
and b/ is the only round / bid that is 
revealed to the participants.? 

Let $/"(v) denote the round / (symmetric) 
Nash equilibrium strategy for the / players 
still in contention. Conceptually at least, $7" 
depends on the sequence of previously ob- 
served bids b... b7. It will be shown, 
however, that $/" depends only upon v, is 
monotonically increasing, and thus can be 


*The. bidders are also effectively indexed by / since, 
as we shall demonstrate below, by playing their sequen- 
tial Nash equilibrium strategy, they are eliminated in 
increasing order of their valuations. Note, however, that 
the bids that are generated by the Nash strategies in the 
course of the game need no longer increase. from one 
stage to the next. This is the relevance of “loss-bidding” 
in the present context. l 

?This assumption could be implemented by means of 
a thermometer device, such as that described in GM, 
which stops as soon as the lowest bidder withdraws and 
is reset at zero for each new round of the knockout. 


502 THE AMERICAN ECONOMIC REVIEW 


inverted. Let \7(b) denote the inverse func- 
tion. 

An important distinction emerges now that 
the bidding strategy changes at every round 
of the game. On one hand, the round / 
losing bid is given by 


(11) br —$7(vj) 


On the other hand, the / —-1 players remain- 
ing in contention at the end of round / 
observe b; and thus can base their next 
round decision on the information that their 
valuations (v,,...,¥,_,) are bounded below 
by v, which they can retrieve from the in- 
verse function A7(bj"). Furthermore, since 
the Nash strategies are monotone increasing, 
it follows that the players’ optimal bids at 
round / —1 are bounded below by bf, where 


Il: 2n. 


(12) bž=ġřP (v)  Ek2-nm. 


Note that (12) also applies for / — n -1 with 
0, 4.1 = Va Since at the opening of the knock- 
out the n players have the information that 
the valuations are bounded below by the 
appropriate reservation value. Under these 


circumstances: : 


THEOREM 4: The recursive symmetric Nash 
equilibrium bidding strategy for a bidder with 
valuation v who is active in round l is to 
submit a bid $/"(v) given by 


a) mo»-f "ifi ro) at, 


V È Vipi Ll: 2n, 


with a reservation value v, v, as defined 
in (8). The corresponding expected payoff for 
the rest of the game (including round 1) is 


* Am 
+1 br 


b 
(4) Y (uo) - 


+ f (0-8) A loi) dt, 


Urei 
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where 
bra = aoa) [:2n-1 
b= ir (v), l:2-n-1 
br Po 
bx = bł 


with bx as defined in Theorem 3. 


The proof of this proposition is available. 
on request from the authors. An example is 
provided in the Appendix, Part B. The criti- 
cal fact that the two versions of the knock- 
out share a common reservation value stems 
from the identity 


(15) ba (o) e iv; v). 


In words, the optimal bid for a player with 
valuation v who is active at round / is the 
expected value of the second highest valua- 
tion among / independent drawings from the 
distribution F(-) conditionally on these / 
valuations all being larger than v. The ex- 
pected payoff yet to accrue to the player is 
the expectation of the function max(0, v — 
t), where ¢ is the largest valuation among 
l —1 independent drawings from F(-), con- 
ditionally on them all being larger than v,,,, 
corrected by an equal share of the difference 
between the round / lower bound b, and 
the previous round lowest effective bid. b/5 ,. 

The interpretation of $" in expression 
(13) indicates that 


06) . gr (v) z £i) 
for all v and, in particular, for v = v, so that 
(17) br > b* 


[:2-n. 


Furthermore, since $/" , is monotone increas- 
ing, we also have 


(18) b" >b  bk2onm. 


However, as the example provided in the 
Appendix, Part B, illustrates, it is no longer 
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the case that bj., is necessarily larger than 
bi^, hence the possibility of “loss-bidding.” 

The comparison between the sequential 
game and the single bid game is now 
straightforward. 


COROLLARY 5: Under the conditions of 
Theorems 3 and 4 


(9) Pr(olo,) 
(20) Ff(olo)) > 0 (9) > zo) 


Q1) V; (olvis) > Fr (volor) 
l:22n-1 


-Vr(ww) Yo > 0» 


PROOF: 

(19) follows from Riley-Samuelson (1981, 
Proposition 1) given that the corresponding 
reservation values are equal; compare (7) 
with (14) for / — n. (20) follows from our 
interpretation of the results in expressions 
(7) and (13). (21) follows from the meen 
in expression (17). 


The revenue equivalence (19) implies, but 
is stronger than, the obvious result that un- 
conditionally on v, the ex ante payoffs are 
the same in both versions—a necessary con- 
sequence of the homogeneity of the players. 
Trivially, the latter imputation also corre- 
sponds to the Shapley value for the game 
with characteristic function: 


Q2) w(S) 


= E |max{ max o, - Ein 8) 


VSCN, 


where E denotes the expectation operator 
with respect to all valuations. 

Expression (20) demonstrates that the se- 
quential version of the game generates addi- 
tional overbidding compared to the single 
bid version in the opening round. Essen- 
tially, the possibility of revising one's bid 
during the later rounds makes it more attrac- 
tive to gamble for a larger first-round payoff. 
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Expression (21) indicates, however, that after 
the first round the multiple bid game be- 
comes less attractive to the remaining play- 
ers since they are now confronted at every 
round with the possibility of loss bidding. It 
is this second effect that prevents a player 
from arbitrarily raising his opening bid in 
the sequential game. Our equivalence result 
(19) indicates that the two. effects exactly 
offset each other ex ante relative to the sin- 
gle bid version of the game. Graphs of the 
bid functions for specific examples are found 
in the Appendix, Part B. ; 


IV. The Private Information Auction Game 


To this point we have demonstrated that 
the nested knockout has unique properties in 
a complete information environment (Sec- 
tion II), but that with incomplete informa- 
tion there is systematic overbidding (Section 
III). The latter result was derived under the 
assumption of ex ante bidder homogeneity 
—ring members draw their valuations from 
the same distribution. Given that nesting is 
used in practice when ring members differ 
from one another ex ante —draw valuations 
from different distributions—this leaves an 
important question unanswered. What re- 
sults. hold for heterogeneous bidders under 
incomplete information? In this section we 
propose a mechanism that a coalition of 
ex ante heterogeneous bidders at a second 
price auction can employ that induces ring 
members to report truthfully their valuations 
to the ring, assures that the highest reporting 
ring member will win an item obtained by 
the ring, pays members the Shapley imputa- 
tion, and creates no incentives for cheating. 

Suppose now that we extend the IPV 
model to allow for heterogeneous bidders by 
modeling the valuation of the ith bidder to 
be an observation from F,(v) for i= 
1,2,...,n. These distribution functions are 
assumed to be common: knowledge to. all 
bidders. For expositional convenience we re- 
tain the assumption that the auctioneer sets 
a reserve price of zero. 

As noted in Section III the post-auction 
knockout does not provide incentives for 
ring members to report their valuations 
truthfully. This represents no difficulty for 
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the case of homogeneous bidders since the 
monotonically increasing and symmetric bid 
strategy identified there assures that the bid- 
der with the highest valuation will ultimately 
win the item. Under heterogeneity, however, 
such a knockout gives rise to asymmetric bid 
strategies and positive probabilities of the 
item not being won by the bidder with the 
highest valuation. Such potential inefficien- 
cies reduce the expected payoff to ring for- 
mation and motivate the search for more 
efficient mechanisms. 

Consider a mechanism that requires the 
individual members of the ring to make prior, 
not necessarily truthful, reports regarding 
their private valuations to the ring-center. 
Then for any given vector of reports of pri- 
vate valuations from ring members, the 
mechanism must determine, perhaps’ ran- 
domly: (1) the recommended bid for each 
member to submit at the main auction, (2) 
the ring member that will ultimately receive 
an item that is won by the ring at the main 
auction, (3) the payments to collect from 
(make to) each member of the ring. 

Such a mechanism is called incentive com- 
patible if and only if it is a Nash equilibrium 
in the resulting game for each member to 
participate (the expected payoff to each 
member in this equilibrium is at least as 
great as the payoff that member could expect 
by not participating), to report his private 
valuation truthfully, and to submit the rec- 
ommended bid at the main auction. A given 
mechanism dominates another if the ex- 
pected payoff to each and every ring member 
is at least as great in the former mechanism 
as in the latter one, regardless of the ring 
members’ private valuations. An incentive 
compatible mechanism is called durable if 
the members of the ring would never unani- 
mously approve a change to another mech- 
anism even if they knew more than just 
their own valuations, that is, communication 
had occurred — see Holmstróm and Myerson 
(1983). he 

With these preliminaries in mind consider 
the following mechanism, which we call the 
second price pre-auction knockout or PAKT: 


The "ring center" makes a fixed pay- 
ment, P,, to each of the ring members. 
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prior to the main auction. (The magni- 
tude of these payments will be identi- 
fied shortly). Each of the n members of 
the ring submits a sealed “reported 
bid” to the ring center at this time. The 
member submitting the highest bid is 
selected by the ring center as the sole 
bidder and advised to bid on his own 
behalf at the main auction. Ring mem- 
bers other than the sole bidder are 
advised not to submit bids at the main 
auction. Should the sole bidder win the 
item at the main auction, he would pay 
the auctioneer the contracted price and, 
additionally, would pay the ring center 
the difference between the second high- 
est reported bid from the ring and the 
contracted price provided that this dif- 
ference is positive. 


This scheme assures the winning bidder of 
receiving a guaranteed fixed payment and 
paying a price equal to the second highest of 
all bids and reported bids for the item. Part 
of this price goes to the auctioneer and part 
to the ring center—one requirement for 
identifying the fixed payments will be that 
their sum equals the expected value of that 
portion of the price paid to the ring cen- 
ter. The ring center serves in this context as 
both “mediator” and “budget-breaker”; see 
Holmstrém (1982). Our interest in the sec- 
ond price PAKT stems from the following 
proposition. 


THEOREM 6: Truthful reporting is a domi- 
nant strategy for participants in the second 
price PAKT. 


PROOF: 
See Appendix, Part C. 


One obvicus implication of this result is 
that the bidder with the highest valuation 
from the ring will report this valuation to the 
ring center and subsequently submit this val- 
uation as a bid. Thus no bidder, ring or 
non-ring, will ever pay more for the item 
than his or her valuation. 

Given the dominant strategies of truthful 
reporting to the ring-center by members of S 
and to the auctioneer by members of N — S, 
we can again reasonably employ the defini- 
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tion of the characteristic function given by 
equation (3) with the understanding that the 
expectations operator now accounts for the 
heterogeneous distributions across bidders. 
Let y; equal the expected value of the dif- 
ference between the highest valuation and 
the second highest valuation from the ring 
given that the highest valuation belongs to 
the ith bidder. As before, let ẹ; be the 
Shapley imputation to the ith bidder. Then 
define the fixed payments for the second 
price PAKT as follows: 


(23) P,= 4;— Yi 


With these fixed payments the second price 
PAKT yields an imputation identical to the 
Shapley value. We also note 


THEOREM 7: With fixed payments corre- 
sponding to (23), the second price PAKT is an 
incentive efficient and durable mechanism. 


PROOF: 
See Appendix, Part C. 


Note also that the results of Myerson dis- 
cussed in Section III again imply that the 
allocation resulting from the second price 
PAKT is consistent with the requirements 
for both fair and stable allocation rules.!? 


V. Conclusion and Future Research 


Within the independent private values 
model the total amount available to a bidder 
coalition to divide among its members at 
either a second price or English auction is 
equal to the difference between the maximal 
valuation within the ring and the price paid 
for the item at the main auction, given that 
the difference is positive. The central issue of 
this paper has been to determine the charac- 
teristics of the mechanism used by practicing 
bidder coalitions to disperse this difference 
among its members. 


10-The susceptibility of both second price and English 
auctions to stable collusive behavior has been studied in 
GM. : 
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In order to obtain an initial understanding 
of the nested knockout, several special cases 
have been studied. The decomposition of the 
problem involves three main issues. First, 
payments to ring members could be made 
ex ante (before valuations are drawn by 
members) or ex post. Second, the bidders 
could be homogeneous or heterogeneous in 
terms of their distributional identity. Third, 
with respect to valuations drawn within the 
ring, the auction could be modeled as a 


“game of complete or incomplete informa- 


tion. . 

Graham and Marshall (1987) have already 
studied the ex ante, homogeneous, incom- 
plete information environment. In this study 
we find that the payments to ring members 
described by Graham and Marshall (1987) 
are the ex ante Shapley values of the bid- 
ders. 

In an ex post, heterogeneous, complete 
information auction game, it has been 
demonstrated that the nested knockout re- 
sults in side payments to ring members that 
are not only in the core but exactly equal to 
their Shapley values. Within a more realistic 
setting, the incomplete information auction 
game, it has been shown that homogeneous 
ring members overbid in both the simultane- 
ous and sequential versions of the ex post 
nested knockout. These results demonstrate 
the incentive compatibility problems of the 
nested knockout. Finally, in the incomplete 
information auction game with heteroge- 
neous bidders, an ex ante nested knockout 
allocation mechanism that awards ring mem- 
bers their ex ante Shapley values has been 
proposed. This mechanism also has the de- 
sirable characteristic of being both incentive 
efficient and durable (in the sense of Holm- 
stróm and Myerson, 1983). 

Although we have posed an ex ante mech- 
anism in Section IV that achieves efficien- 
cy subject to incentive compatibility con- 
straints, the most pressing issue for future 
research concerns the ex post use of a nested 
knockout by heterogeneous ring members in 
the incomplete information auction game. 
The main problem is to determine whether 
potential inefficiencies within the ring are 
unavoidable. Specifically, does there exist 
Nash equilibrium behavior that precludes the 
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possibility of a ring member without maxi- 
mal valuation from winning the object? If 
this possibility is not eliminated, then not 
only will the Shapley value be unattainable 
as a characterization of the resulting side 
payments to ring members but the coalition 
will not be maximizing its total payoff. This 
case is of particular interest since it most 
closely parallels the reality of practicing bid- 
der coalitions. 

Another issue for future research involves 


the characterization of the nested knockout : 


within a common values or general symmet- 
ric model setting. Bidder coalitions may not 
make use of ex ante payment schemes be- 
cause of the value of the information that is 
transmitted to members during the bidding 
at the main auction. Specifically, ex ante 
bidding. commitments (i.e., no one bids but 
the highest valued ring member) may not be 
self-enforcing within such a setting. 


APPENDIX . 
A. Results for the Complete Information Auction Game 


PROOF OF THEOREM 1: 
Recall that the valuations are ordered: 


PETERR T 
and the nested knockout imputation for the ring is 


Up 7 Obed 
X15 Xap t ’ 
" 





where y denotes the number of bidders with a valuation 
at least as great as the ith bidder. We need to show 


Y xzw(S) vsScN. 
ies 
Since x; 2 0 Vi and 
0 ifles 
w(S)=( 9-4, if12,.,k-1€8, 
and kéS 


we are done if 1 € S. Suppose then that 1,2,..., k -leS 


' Vt could, alternatively, be shown that the game w is 
convex. Theorem 1 would then follow from this fact and 
Theorem 2. 
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and k € S. Then 
l k-1 
L x= Z x; + L Xi 
ies i=] ies 
Pa i-k 
—pD-U.t Y X, 
. tes 
i>k 
= Vy Ug. 


PROOF OF THEOREM 2: 

Let y? denote the Shapley imputation to player i in 
the all inclusive, complete information auction game 
among players (1,2,..., p) with valuations v, 2 v > 
E, where p —1,2,..., n. We consider the addition ` 
of bidders to this all inclusive game. Consider now the 
recursion from p —1 to p. Bidder p comes in with the 
lowest valuation among the p players. He can enter in 
p different positions in each permutation of the p —1 
other bidders. Only if he is last is his marginal contribu- 
ticn to the coalition positive. This occurs ( p —1)! times, 
of which ( p — 2)! are against bidder i, i —1,2,..., p —1, 
in which case player i's marginal contribution is re- 
duced by v. Henze 











(p-2)! 
=p- L sf 22" 
yf = Vf = p! Up 
i=1,2,...,p—1 
~1)! 
u- V m 
4 p! 
-2 
P 
It follows from recursion that 
jf ud ne NN Ue? E Up 
t i (ilu (i+2)(i+1) p(p-1) 
oo Sis SA iu Fie 
i i4l p 
and thus that 
p-i 
V; — Va - 
y= Bl. j-12,24p-1. 
Jus d p 


This is precisely the knockout imputation when evalu- 
ated at p — n. 


B. The Post Auction Knockout: An Example 


The proofs of the propositions of Section III are some- 
what lengthy and have been omitted. They are, of 
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TABLE 3—-CONDITIONAL DENSITY FUNCTION 


PE)  PrEJ|v) JE) f (ele, E) Jsl, Ej) 
2v, 2(v-— v) 
E 1/3 vu 3v 19,1 a (0, v) Eg ONE (0,0) 
1 : 
E; 1/3 2v((1—-v) 6v(1—v)Ig Tole 51e.» 
2(v, — v) 2(- v) 
1/3 (Q-v 30-0)F, =I ———3 
E; / (=v) (1— vy To, 0oy O Go © 
course, available upon request from the authors. These find that 
results can be illustrated, however, by means of a simple 
example where our focal player with valuation v faces (26) E(x|E, v) =}4(30-1) 


two other players with ordered valuations V, > V,. The 
three valuations are drawn independently from the uni- 
form distribution on [0,1]. It is further assumed that 
Po = 04 = 0. 

Since the expectation of the highest of the three 
valuations is 3/4, it follows that the ex ante Shapley 
value for each of the three players is 1/4. 

Three events are considered in the discussion that 
follows. Let E, denote the event that the focal player is 
ith in the ranking, where i: 1 — 3. Conditionally on E;, 
the density of V and those of V, and V, given that 
V=v are given in Table 3, where 1,, ,, denotes the 
indicator function for the interval ]a, b 


We will now discuss in turn the (sequential) multiple- 


bid and the single-bid versions of the three-player 
knockout. 


The Multiple Bid Game. The first (1-3) and second 
(1 = 2) Nash equilibrium bid functions (13) are respec- 
tively given by 


Q4) iQ) =20.+ 2), 


tr (v) =4(142v). 


Their graphs are given by the two lines in Figure 1. 
For comparison, the curve illustrates the Nash equilib- 
rium strategy for the single bid game. 

Let x 2 ẹ(v, Va, Vp) denote the (random) payoff to 
our focal player with valuation v. Conditionally on Ej, 
i: 13, x is given by 


(25) E:xco-HT(E)-i8 (Vs), 
Ej x= 757 (0) - 887 (V), 
Ej x iB (v). 


On the basis of the density functions of V, and V,, as 
given in Table 3, we can evaluate the expectation of x, 
conditional on E; and v or conditional on E; only, and 


E(x|E) =$ «0.313 
E(x|E;,v) ^ À(To +2) 
E(x|E;) =} = 0.229 
E(x|Ej,0)=¢(v +1) 
E(x| Ej) =$ ~ 0.208. 
Unconditionally on E,, we find that 
Q7) E(x|v) = VP (v10) 
=}(1+20). 


The expectation of the latter expression over V obvi- 
ously coincides with the ex ante Shapley value of 1/4. 

The graphs of the expectations of x conditionally 
and unconditionally on E, are given in Figure 2. The 
unconditional expectations of the 1st, 2nd, and 3rd 
highest valuations are respectively 0.75, 0.50, and 0.25. 
Note that the ordering between the conditional expecta- 
tions E(x|E;, v) critically depends on v and reflects the 
importance of loss-bidding in the second round. A 
player, for example, with valuation v « 2/3 would bene- 
fit from being eliminated in the first round since he can 
only lose in expected payoff by surviving the round.? 


The Single Bid Game. The single bid Nash equilibrium 
strategy (7) is given by 


: 1142»43v 
ume qe. 


VAt v 72/3, E(x|E, v) = E(x| Ey, v). 
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round of 3 










round of 2 


valuations 


@ First round low bid divided among all three bidders - 


W Second round winning bid 
A Second round low bid 


FIGURE 1. EQUILIBRIUM STRATEGIES IN THE 
SINGLE- AND MULTIPLE-BID GAMES 


: The graph of 6C ) i is illustrated in Figure 1. Note 
that à 


(28) £7 (0,0) > £$(0;0) > 12 (o;0) voe (0,1), 


which nicely illustrates our earlier discussion of the two 
games. 

The conditional payoffs are. still given by. equation 
(25) except that the bid functions $/7' and $7" are now 
replaced by $3. A conceptually straightforward though 
somewhat tedious derivation leads to the following ex- 
pressions for the conditional payoffs. 


OE(x|E,v)- a^ aS est s eno), 


1 1+20+3v? 
a a DERE 


1 : 3 
—-—— |6v 424 —1n(142 
96 [se 2 : (1 ». 


1 1420430? 
Bees) eae ^ 

7 27. 
E(x) = zc + Ge m3 = 0.305, 


91 8 B 
E(X|£) =~ 154 03 = 0.253, 


13 


27 
E(x|E,) LB gu 0.193. 
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FIGURE 2. EXPECTED PAYOFF IN THE 
MULTIPLE-BID GAME 
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FIGURE 3. EXPECTED PAYOFFS IN THE 
SINGLE-BID GAME 





Note also that E(x|v) coincides with the expression 
in (27) as established in Corollary 5. The graphs of the 
expectations of x, conditionally and unconditionally on 
E;, are found in Figure 3. 


C. Results for the Private Information Auction Game 


PROOF OF THEOREM 6: 

In the event that the ring wins the item, the member 
submitting the highest reported bid is awarded the item 
and pays a total price equal to the second highest 
bid/reported bid of the N+1 bids (N — K non-ring 
bids at the main auction, K reported bids from the ring 
members and the reserve price bid of the auctioneer). 
The payment to each ring member is a fixed non-contin- 
gent constant and therefore cannot affect incentives. 
Since the second price PAKT requires the winning 
bidder to pay the second price of all bidders and 
payments to ring members do not affect incentives, it 
follows from the logic of Vickrey's (1961) seminal paper 
that it is a Nash equilibrium strategy for each member 


: of the ring to report his or her valuation truthfully to 


the ring-center and to follow his bidding recommenda- 
tions at the main auction. In fact, truthful revelation 
and compliance is not only a Nash strategy, it is a 
dominant strategy as well. 


PROOF OF THEOREM 7: 

Voluntary participation is also advantageous. Ring 
membership entails three possible outcomes for a given 
ring member. First, if the ring does not acquire the item, 
membership is advantageous since the ring member 
receives P, and would have obtained nothing acting 
individually. Second, if the ring wins the item but the 
item is awarded to another member, membership is still 
advantageous since once again the member receives P; 
and would have obtained nothing acting individually. 
Third, if the ring wins the item and it is awarded to the 
ring member then membership is still advantageous 
since the number pays precisely the same price as would 
have been necessary acting individually but again re- 
ceives P. Consequently, voluntary participation is also 
a dominant strategy. 

The equilibrium strategies assure that the ring will 
win the item in every circumstance in which some 
member has a valuation exceeding the cost of acquiring 
the item. Further, in every case in which the ring wins 
the item it is awarded to that member with highest 
valuation. The mechanism, in short, assures the ring 
members of the greatest possible joint payoff in each 
contingency. Consequently, there exists no other “bal- 
anced budget" mechanism, whether incentive compati- 
ble or not, that dominates the second price PAKT. 
Therefore, the second price PAKT is incentive efficient. 

Since the second price PAKT is incentive efficient 
and since participation and truthful revelation of valua- 
tions in the second price PAKT is a dominant strategy 
it follows that the second price PAKT is a durable 
mechanism (by Theorem 2 of Holmstróm and Myerson, 
1983). 
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The Economics of Modern Manufacturing: 
Technology, Strategy, and Organization 


By PAUL MILGROM AND JOHN ROBERTS* 


Manufacturing is undergoing a revolution. The mass production model is being 
replaced by a vision of a flexible multiproduct firm that emphasizes quality and 
speedy response to market conditions while utilizing technologically advanced 
equipment and new forms of organization. Our optimizing model of the firm 
generates many of the observed patterns that mark modern manufacturing. 
Central to our results is a method of handling optimization and comparative 
statics problems that requires neither differentiability nor convexity. (JEL 022) 


In the early twentieth century, Henry Ford 
revolutionized manufacturing with the intro- 
duction of his “transfer line” technology for 
mass production, in which basic inputs are 
processed in a fixed sequence of steps using 
equipment specifically designed to produce a 
single standardized product in extremely 
large quantities for extended periods of time. 
Although the specialization of Ford’s facto- 
ries was extreme—the plants had to be shut 
down and redesigned when production of 
the Model T was ended—the transfer line 
approach influenced generations of industri- 
alists and changed the face of manufacturing 
(see David A. Hounshell, 1984). 

In the late twentieth century, the face of 
manufacturing is changing again.’ First, the 
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Jonathan B. Lovelace, Professor of Economics, Gradu- 
ate School of Business, Stanford University, CA. 94305- 
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nomic Policy Research of Stanford University. We are 
grateful to Tim Bresnahan, Morris Cohen, Xavier de 
Groote, Victor Fuchs, John McMillan, Roger Noll, Mike 
Riordan, Nate Rosenberg, Ed Steinmueller, Hal Varian, 
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helpful discussions, comments, and suggestions. 

Probably no single firm is involved in all the changes 
we will describe. Nevertheless, there is a definite, dis- 
cernable pattern of change in technology, manufac- 
turing, marketing, and organizational strategy that char- 
acterizes successful “modern manufacturing.” For a 
description of the technologies involved, see U.S. 
Congress Office of Technology Assessment, 1984. 


specialized, single-purpose equipment for 
mass production which had characterized 
Ford’s factories is being replaced by flex- 
ible machine tools and programmable, multi- 
task production equipment. Because these 
new machines can be quickly and cheaply 
switched from one task to another, their use 
permits the firm to produce a variety of 
outputs efficiently in very small batches,” 
especially in comparison to the usual image 
of mass production (Nicholas Valery, 1987). 
Kenneth Wright and David Bourne (1988) 
report that in a recent survey of aerospace 


‘and other high precision industries 8.2 per- 
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cent of all batches were of size one and 38 
percent were sixteen or less. An Allen-Brad- 
ley Company plant making electric controls 
is reported to be able to switch production 
among its 725 products and variations with 
an average changeover time for resetting 
equipment of six seconds, enabling it to 
schedule batches of size one with relative 
efficiency (Tracy O'Rourke, 1988). Even in 
the automobile industry, flexible equipment 
has become much more common. Recently, 


?Optimal batch size can be determined via a stan- 
dard Economic Order Quantity model, in which the 
setup costs of switching from making one product to 
making another are traded off against the costs of 
holding the larger average inventories of finished goods 
that go with longer runs and less frequent changeovers. 
Optimal batch size is a decreasing function of setup 
costs and so batch sizes optimally decrease as more 
flexible machines are introduced. 
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General Motors’ engineers were able, for the 
first time in company history, to use a regu- 
lar, producing facility to make pilots of the 
next year's model cars. The engineers set the 
equipment to make 1989 models after work- 
ers left the factory on Friday afternoon, ran 
the equipment to manufacture the new mod- 
els over the weekend, and then reset the 
equipment to produce 1988 model cars so 
that regular production could be resumed on 
Monday morning (Thomas Moore, 1988). In 
contrast, with the older, less flexible tech- 
nologies that have been the norm in the 
industry, changing over to produce the new 
years models typically involved shutting 
down production for weeks. 

Flexible equipment and small batch sizes 
have been accompanied by other changes. 
Smaller batch sizes are directly associated 
with a shortening of production cycles and, 
with reductions in work-in-process and fin- 
ished goods inventories. Shorter product cy- 
cles in turn support speedier responses to 
demand fluctuations and lead to lower back 
orders. Indeed, a general strategic emphasis 
on speeding up all aspects of the firm's oper- 
ations is becoming common (Brian Du- 
maine, 1989). This is manifested in shorter 
product-development times, quicker order- 


processing, and speedier delivery, as well as . 


in producing products faster. Examples 
abound. General Electric has reduced the 
design and production time it takes to fill an 
order for a circuit-breaker box from three 
weeks to three days, in the process reducing 
back orders from sixty days to two (Du- 
maine). The Allen-Bradley plant nientioned 
above fills orders the day after they are 
received, then ships them that same day by 
air express (O’Rourke). Building on early 
development by Toyota, many manufactur- 
ers now plan production jointly with their 
suppliers and maintain constant communica- 
tion with them. This allows the downstream 
firms to replace inventories of components 
and supplies with “just-in-time” deliveries of 
needed inputs (James Abegglen and George 
` Stalk, Jr, 1985; Nicholas Valery, 1987). 
Combining flexible production and low fin- 
ished goods inventories with reliance on 
electronic data communications, Benetton 
maintains inventories of undyed clothing 
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(shirts, scarves, pullovers) and uses nightly 
sales data gathered at its automated distribu- 
tion center from terminals in individual 
stores to determine the colors it should make 
and where to ship its output (Tom Peters, 
1987). Strategies like this one require not 
only flexible equipment to produce the right 
products, but short production cycles so that 
products are available at the right time. The 
extreme of this line of development is pro- 
duction of previously mass-produced items 
on a make-to-order basis: Moore reports a 
widespread rumor that the GM Saturn pro- 
ject will involve cars being custom-built ` 
within days of receipt of customers’ com- 
puter-transmitted individual orders. 

The manufacturing firms that adopt these 
new technologies and methods appear to 
differ from traditional firms in their product 
strategies as well. Many firms are broaden- 
ing product lines, and there is a widespread 
increased emphasis on quality, both through 
frequent product improvements and new 
product introductions, and through reduc- 
tions in defects in manufacturing. Caterpillar 
Corporation’s $1.2 billion “Plant with a Fu- 
ture” modernization program has been ac- 
companied by a doubling of the size of its 
product line (Ronald Henkoff, 1988). Rub- 
bermaid insists that 30 percent of its sales 
should come from the products introduced 
in the preceding five years, and 3M Com- 
pany has a similar 25 percent rule for each 
of its 42 divisions, with 32 percent of its . 
$10.6 bilion in 1988 sales actually coming 
from products less than five years old (Rus- 
sell Mitchell, 1989). Meanwhile, reports of 
order-of-magnitude reductions in percentage 
defects are becoming commonplace. 

New organizational strategies and work- 
force management policies are also part of 
this complex of changes. Ford has adopted a 
parallel, team (rather than sequential) ap- 
proach to design and manufacturing engi- 
neering that, in conjunction with CAD/ 
CAM (Computer Aided Design / Computer | 
Aided Manufacturing) techniques, has cut 
development time on new models by one : 
third (Alex Taylor, III, 1988). AT&T suc- 
cessfully used a similar, multidepartmental 
team approach in developing its new 4200 
cordless phone (Dumaine), as did NCR with 
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its recently introduced 2760 electronic cash 
register (Otis Port, 1989b). Lockheed Corp.’s 
Aeronautical Systems Group has managed 
to reduce the time for designing and manu- 
facturing sheet-metal parts by 96 percent, 
from 52 days to 2; the project manager 
credits organizational changes (including the 
arrangement of workstations, redefinition of 
worker responsibilities, and adoption of team 
approaches) with 80 percent of the produc- 
tivity gain (Port, 1989a, p. 143; Warren 
Hausman, 1988). Motorola’s adoption of a 
pay scheme based on the skills employees 
acquire (rather than on their job assign- 
ments), its elimination of segmented pay cat- 
egories among production workers, and its 
giving workers multiple responsibilities (in- 
cluding having production workers do qual- 
ity inspections) is credited with major im- 
provements in quality (Norm Alster, 1989). 
A further auto industry example comes from 
GM’s massive investments in new technol- 
ogy, which have gone hand-in-hand with new 
supplier relations and more flexible work 
arrangements, as well as a broadened prod- 
uct line (General Motors Corporation, 1988). 

More generally, Michael J. Piore (1986) 
provides survey evidence from firms around 
Route 128 in Boston of wider product lines, 
shorter product life cycles, greater emphasis 
on product quality, increased reliance on 
independent suppliers and subcontractors, 
and a more flexible organization of work 
that is supported by new compensation poli- 
cies. Banri Asanuma, (1988a), has found 
similar trends among Japanese firms, Valery 
provides more anecdotal evidence drawn 
from a wide variety of industries: internation- 
ally, and earlier Piore.and Charles F. Sabel 
(1984) described related developments among 
small businesses in Italy and Austria. 

A striking feature of the discussions of 
flexible manufacturing found in the business 
press is the frequency with which it is as- 
serted that successful moves toward “the 
factory of the future" are not a matter of 
small adjustments made independently at 
each of several margins, but rather have in- 
volved substantial and closely coordinated 
changes in a whole range of the firm's activi- 
ties. Even though these changes are imple- 
mented over time, perhaps beginning with 
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“islands of automation,” the full benefits are 
achieved only by an ultimately radical re- 
structuring. Henkoff (p. 74) noted that one 
of the lessons of Caterpillar’s program was: 
“Don’t just change selected parts of your 
factory, as many manufacturers have done. 
To truly boost efficiency...it’s necessary 
to change the layout of the entire plant." 
The first lesson that Dumaine drew from 
studying successful adoption of speed-based 
strategies was to "start from scratch." In 
discussing the adoption of "computer inte- 
grated manufacturing" (CIM), Valery (p. 15) 
stated that "nothing short of a total over- 
haul of the company's strategy has first to be 
undertaken." And in a parallel fashion, Wal- 
ter Kiechel III (1988, p. 42) noted: “To get 
these benefits (of more timely operations), 
you probably have to totally redesign the 
way you do business, changing everything 
from procurement to quality control." 

This paper seeks to provide a coherent 
framework within which to understand the 
changes that are occurring in modern manu- 
facturing. We ask, Why are these changes 
taking place? Is it mere coincidence that 
these various changes appear to be grouped 
together, or is there instead some necessary 
interconnection between them and common 
driving force behind them? What are the 
implications of the changes in manufactur- 
ing technology for inventory policy, product 
market strategy, and supplier and customer 
relations? What are the implications for the 
* make or buy" and vertical integration deci- 
sions and for the structure of business orga- 
nization more generally? i 

Our approach to these questions is a 
price-theoretic, supply-side one involv- 
ing three elements: exogenous input price 
changes, complementarities among the ele- 
ments of the firm's strategy, and non-convex- 
ities. The first element is the effect of techno- 
logical change in reducing a set of costs. The 
particular ones on which we focus include: 
the costs of collecting, organizing, and com- 
municating data, which have been reduced 
over time by the development of computer 
networks and electronic data transmission 
systems; the cost of product design and de- 
velopment, which have fallen with the emer- 
gence of computer-aided design; and the 
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costs of fiexible manufacturing, which have 
declined with the introduction of robots and 
other programmable production equipment. 
We take these relative price reductions, 
whose existence is well documented, to be 
exogenous. 

The direct effect of any of these price 
changes individually would be. to increase 
use of the corresponding factor: for example, 
the emergence (ie., falling cost) of CAD/ 
CAM encourages its adoption, and the re- 
duced cost of designing and beginning pro- 
duction of new products directly increases 
the attractiveness of expanded product lines 
and frequent product improvements. How- 
ever, with several relative prices falling, there 
are multiple interactions, both among the 
corresponding technological factors and be- 
tween them and marketing and organiza- 
tional variables. These interactions give rise 
to indirect effects that might in principle be 
as large as the direct effects, and opposite in 
sign. Here, the second element-of our analy- 
sis appears: These indirect effects tend, in 
the main, to reinforce the direct effects be- 
cause the corresponding relationships are 
ones of “complementarity.” Here, we use the 
term “complements” not only in its tradi- 
tional sense of a relation between pairs of 
inputs, but- also in a broader sense as a 
relation among groups of activities. The 
defining characteristic of these groups of 
complements is that if the levels of any 
subset of the activities are increased, then 
the marginal return to increases in any or all 
of the remaining activities rises. It then fol- 
lows that if the marginal costs associated 
with some activities fall, it will be optimal to 
increase the level of all of the activities in the 
grouping. 

As an illustration, let us trace some of the 
indirect effects of a fall in the cost of com- 
puter-aided design (CAD) equipment and 
software that leads to the equipment being 
purchased. Some CAD programs prepare ac- 


tual coded instructions that can be used by 


programmable manufacturing equipment, so 
one effect of the adoption of CAD may be to 
reduce the cost of adopting and using pro- 
grammable manufacturing equipment. Since 
the prices of that equipment are also falling, 
the effects of the two price changes on the 
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adoption of that equipment are mutually 
reinforcing. Of course, CAD also makes it 
cheaper for the firm to adopt a broader 
product line and to update its products more 
frequently. If the firm does so, than an indi- 
rect effect is to make it more profitable to 
switch to more flexible manufacturing equip- 
ment that is cheaper to change over. So, this 
indirect effect reinforces the direct effects of 
the: changing input prices. With short pro- 
duction runs, the firm can economize on 
inventory costs (such as interest, storage, 
and obsolescence) by scheduling production 
in a way that is quickly responsive to cus- 
tomer demand. Such a scheduling strategy 
increases the profitability of technologies that 
enable quicker and more accurate order pro- 
cessing, such as modern data communica- 
tions technologies. So, another indirect effect 
of falling CAD prices coincides with the 
falling price of data communication equip- 
ment. Thus, CAD equipment, flexible manu- 
facturing technologies, shorter production 
runs, lower inventories, increased: data com- 
munications, and more frequent product re- 
designs are complementary. However, the 
complementarities do not stop at the level of 
manufacturing; but extend to marketing, en- 
gineering, and organization. 

The marketing side of the analysis in- 
volves two additional elements besides those 
already mentioned. First, more frequent 
setups lower the inventory necessary to sup- 
port a unit of sales and thus also the marginal 
cost of output. This encourages lower prices. 
The second element arises because buyers 
value fast delivery. If most customers have 
good alternative sources of supply and only 
a few are “locked in," then the resulting 
relationship between delivery time and de- 
mand is convex. In that case, reducing or 
eliminating production delays makes it prof- 
itable to reduce other sources of delay as 
well Then, computerized order processing 
and a fast means of delivery are complemen- 
tary to the quick responsiveness of the mod- 
ern factory to new orders. 

On the engineering side, as product life 
cycles become much shorter than the life of 
the production equipment, it becomes in- 
creasingly important to account for the char- 
acteristics of existing equipment in designing 
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new products. At the same time, the emer- 
gence of computer-aided design has made it 
less costly to modify initial designs, to esti- 
mate the cost of producing various designs 
with existing equipment, and to evaluate a 
broader range of potential designs. These 
changes have contributed to the growing 
popularity among U.S. firms of “design for 
manufacturability,” in which products are 
developed by teams composed of designers, 
process engineers, and manufacturing man- 
agers (Robert Hayes, Steven Wheelwright, 
and Kim Clark, 1988), and the correspond- 
ing practice among Japanese automakers of 
providing preliminary specifications to sup- 
pliers who comment on the proposed design 
and supply drawings of parts (Asanuma, 
1988b)— innovations in engineering organi- 
zation that contribute to a more efficient use 
_ of existing production equipment and manu- 
facturing know-how. Moreover, taking ac- 
count of the limits and capabilities of pro- 
duction equipment in the design phase makes 
it easier to ensure that quality standards can 
be met, and so is complementary to a mar- 
keting strategy based on high quality. 

The firm’s problem in deciding whether to 
adopt any or all of these changes is marked 
by important non-convexities. These are first 
of the familiar sorts associated with indivisi- 
bilities: product line sizes are naturally inte- 
ger-valued. A form of increasing returns also 
figures in the model, because the marginal 
impact of increasing the speed with which 
customers are served increases the service 
speed. Beyond these, however, the comple- 
mentarities noted above can be a further 
source of non-convexities that are associated 
with the need to coordinate choices among 
several decision variables. For example, pur- 
chase of CAD/CAM technology makes it 
less costly for a firm to increase its frequency 
of product improvements, and more fre- 


quent product introductions raise the return | 


to investments in CAD/CAM technology. 
Thus, it may be unprofitable for a firm to 
purchase a flexible CAD/CAM system with- 
out changing its marketing strategy, or to 
alter its marketing approach without adopt- 
ing a flexible manufacturing system, and yet 
it may be highly profitable to do both to- 
gether. (In contrast, if the value of a smooth 
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concave function at some point in the inte- 
rior of its domain cannot be increased by a 
small change in the value of any single vari- 
able, then the function achieves a global 
maximum at that point.) These non-convexi- 
ties then explain why the successful adoption 
of modern manufacturing methods may not 
be a marginal decision. 

It is natural to expect the characteristics 
of the modern manufacturing firm to be 
reflected in the way the firm is managed and 
the way it structures its relations with cus- 
tomers, employees, and suppliers. Exploiting 
such an extensive system of complementari- 
ties requires coordinated action between the 
traditionally separate functions of design, 
engineering, manufacturing, and marketing. 
Also, according to transaction costs theories, 
the increasing use of flexible, general pur- 
pose equipment in place of specialized, sin- 
gle purpose equipment ought to improve the 
investment incentives of independent suppli- 
ers (Oliver Williamson, 1986; Benjamin 
Klein, Robert Crawford, and Armen Alchian, 
1978; Jean Tirole, 1986) and to reduce cost 
of the negotiating short-term contracts 
(Milgrom and Roberts, 1987) and so to fa- 
vor short-term contracting with independent 
suppliers over alternatives like vertical .inte- 
gration or long-term contracting. The sup- 
plier relations that mark modern manufac- 
turing firms—involving close coordination 
between the firm and its independently 
owned contractors and suppliers— appear to 
be consistent with these theories, and incon- 
sistent with theories in which joint planning 
can only take place in integrated firms. 

In this essay, we develop a theoretical 
model of the firm that allows us to explore 
many of the complementarities in modern 
manufacturing firms. The non-convexities 
inherent in our problem makes it inappro- 
priate to use differential techniques to study 
the effects of changing parameters. Instead, 
we utilize purely algebraic (lattice-theoretic) 
methods first introduced by Donald M. Top- 
kis (1978), which provide an exact formaliza- 
tion of the idea of groups of complementary 
activities. In problems with complementari- 
ties among the choice variables this ap- 
proach easily handles both indivisibilities and 
non-concave maximands while allowing 
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sharp comparative statics results. In particu- 
lar, we give conditions under which the set 
of maximizers moves monotonically with 
changes in a (possibly multidimensional) pa- 
rameter. Because these methods are quite 
straightforward and would seem to be of 
broad applicability in economics, but are not 
well known among economists, we describe 
them in some detail in Section I. 

Our model and its basic analysis are pro- 
vided in Section IJ. The firm in our model 
choices its price; the length of the pro- 
duct life cycle or frequency of product im- 
provements (a surrogate for quality); its 
order-receipt, processing, and delivery tech- 
nologies; various characteristics of its manu- 
facturing and design technologies as re- 
flected in its marginal cost of production and 
its costs of setups and new product develop- 
ment; its manufacturing plan, including the 
length of the production cycle (and, implic- 
itly, its inventory and back-order levels); and 
aspects of its quality control policy, all with 
the aim of maximizing its expected profits. 
Using reasonable assumptions about the na- 
ture and equipment costs, we find that the 
complementarities in the system are perva- 
sive. We use the firm’s optimizing response 
to assumed trends in input prices (the falling 
costs of communication, computer-aided de- 
sign, and flexible manufacturing) in the pres- 
ence of these complementarities to explain 
both the clustering of characteristics and the 
trends in manufacturing. 

In Section III, we turn our attention to the 
organizational problems associated with the 
new technologies. We-summarize and review 
the predictions of the model in the conclud- 
ing Section IV. 


I. The Mathematics of Complementarities 


Here we review some basic definitions and 
results in the mathematics of complementar- 
ities. The results permit us to make definite 
statements about the nature of the optimal 
solution to the firm's problem and how it 
depends on various parameters, even though 
the domain of the objective function may be 
non-convex (for example, some variables may 
be integer-valued) and the objective function 
itself may be non-concave, non-differentia- 
ble, and even discontinuous at some points. 


JUNE 1990 


For additional developments and missing 
proofs, see Topkis. 

We first introduce our notation. Let x, x' 
€ R”. We say that x > x' if x, 2 xj for all i. 
Define max(x,x') to be the point in R” 
whose ith component is max(x, x/), and 
min(x, x^) to be the point whose ith compo- 
nent is min(x; xf). This notation is used 
below to define the two key notions of the 
theory. The first notion is that of a super- 
modular function, which is a function that 
exhibits complementarities among its argu- 
ments. The second is that of a sublattice of 
R", a subset of R" that is closed under the 
max and min operations and whose structure 
lets us characterize the set of optima of a 
supermodular function. 


Definition 1: A function f: R" — R is super- 
modular if for all x, x' € R”, 


Q) f(x) + f(x’) 
< f(min(x, x’))  f(max(x, x’)). 


The. function f is submodular is — f is su- 
permodular. 
Inequality (1) is clearly equivalent to 


[/(x) — f(min(x, x’))] 
+[ f(x’) 7 f(min(x, x?))] 
< f(max(x, x’) — f(min(x, x’)): 


the sum of the changes in the function when 
several arguments are increased separately is 
less than the change resulting from increas- 
ing all the arguments together. The inequal- 
ity is also equivalent to 


f(max( x, x^)) - f(x’) z f(x) 
— f (min( x, x’)): 


increasing one or more variables raises the 
return to increasing other variables. These 
reformulations of the defining inequality 
make clear the sense in which the supermod- 
ularity of a function corresponds to comple- 
mentarity among its arguments. 

Note that any function of a single variable 
is trivially supermodular. This observation 
serves to resolve various questions about 
possible relationships between supermodu- 
larity and other concepts. However, even in 
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a multidimensional context, supermodularity 
is distinct from, but related to, a number of 
more familiar notions. First, supermodular- 
ity has no necessary relation to the concavity 
or convexity of the function: consider 
f(x, x3) 7 x2 + x5, which is supermodular 
for all values of a and b but may be either 
concave or convex (or both or neither). Nor, 
in the context of production functions, does 
supermodularity carry implications for re- 
turns to scale. For example, the Cobb- 
Douglas functions f(x, x.) =x?x? may 
show increasing or decreasing returns to scale 
but are supermodular for all positive values 
of a and b. This is most easily checked using 
Theorem 2, below, which states that a smooth 
function f is supermodular if and only if 
0?f/0x,0x,z 0 for i+ j. Thus, if f is super- 
modular and smooth, then the smooth su- 
permodular function — f shows weak cost 
complementarities as defined by William 
Baumol, John Panzar, and Robert Willig 
(1982, pp. 74-75). Even without smoothness, 
it is easily shown that a submodular function 
that is zero at the origin shows economies of 
scope as defined by Baumol et al. More 
generally, submodularity is related to, but 
distinct from, the notion of subadditivity 
that figures centrally in the study of cost 
functions.) For example, any function of a 
single variable is submodular, but obviously 
not all such functions are subadditive. 
Meanwhile, the functions on [0,1] X[0,1] 
given by f(x), x5) =1+ x; +x, + exx, are 
submodular for e> 0, supermodular for £z 
0, and subadditive for all e sufficiently close 
to zero in absolute value. 

Six theorems about supermodular func- 
tions are provided here. The first four to- 
gether provide a relatively easy way to check 
whether a given function is supermodular. 
Theorem 5 indicates how, in a parameterized 
maximization problem, the maximizer 
changes with changing parameters, while 
"Theorem 6 characterizes the set of maximiz- 
ers of a supermodular function. It is Theo- 
rem 5 that makes our comparative statics 
exercises possible. 

Let x4, denote the vector x with the ith 
component removed and let x. ,, denote x 


7A function f is subadditive if fo» fi) z fix 
y) for all x and y. 
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with the ith and jth components removed. 
Let subscripts .on f denote partial deri- 
vatives, for example, f,;= 0f/dx; fi; = 
8?f/àüx, xj. 

THEOREM 1: Suppose f: R" >R. If for all 
i, j, and xj, f(x Xj, X ,;) is supermodu- 
lar when regarded as a function of the argu- 
ments (x; xj) only, then f is supermodular. 


THEOREM 2: Let I-[ay,b]X--- X 
[a,, b,] be an interval in R" with nonempty 
interior and suppose that f: I R is continu- 
ous and twice continuously differentiable on 
the interior of I. Then f is supermodular on I if 
and only if for all i * j, fij 20. 


Theorem 2 is stated above in the form given 
in Topkis. For our application, we will need 
a slightly stronger theorem in which the con- 
dition that f is twice continuously differen- 
tiable is weakened to the condition that it 
can be written as an indefinite double inte- 
gral with a nonnegative integrand. The pre- 
cise extension is stated and proved in the 
Appendix. 


THEOREM 3: Suppose that f, g: R" —^ R are 
supermodular functions. Then f + g is super- 
modular. If, in addition, f and g are nonneg- 
ative and nondecreasing, then fg is supermod- 
ular. 


THEOREM 4: Suppose that f:R'*" >R is 
supermodular and continuous in its first .ar- 
gument. Then for all a, bER, the func- 
tion g:R" > R defined by g{x)= 
màX, ei p f Cy, x) is supermodular. 


PROOF: 

Since f is continuous in its first argument, 
the function g is well defined. For all x and 
x’, there exist y and y' with g(x) = f(y, x) 
and g(x’) = f(y’, x’). Then, 


g(x) t g(x) 2 fCr x) e f(y, x?) 
< f (max( y, y’), max(x, x’) 
+ f(min( y, y’), min( x, x’)) 
< g(max(x, x’)) 


+g{min(x,x’)). - O 
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In what follows we will be particularly 
concerned with constrained optimization of 
supermodular functions, and our results will 
depend on the constraint set having the right 
structure or shape, namely, that of a sublat- 
tice of R". 


Definition 2: A set T is a sublattice of R" 
if for all x,x'eT, min(x,x) €T and 
max(x, x^) e T. 

In our application, the definition of a sub- 
lattice represents the idea that if it is possi- 
ble to engage in high (respectively, low) lev- 
els of each of several activities separately, 
then it is possible to engage in equally high 
(resp., low) levels of all of the activities si- 
multaneously. Thus, for example, if S,,..., 5, 
are arbitrary subsets of R, then S; X --- XS, 
is a sublattice of R". However, the product 
sets are not the only sublattices. The sub- 
lattice structure also permits the possibility 
that some activities can be engaged in at a 
high level only if the others are also carried 
out at a high level. For example, if x > x’, 
then { x, x’} is a sublattice. 


Definition 3: Given two sets S, S' C R", we 
say that S is higher than S’ and write S > S" 
if for all x €S and x'€ S", max(x, x") ES 
and min(x, x^) € S’. 


THEOREM 5: Suppose f: RP** —^ R is su- 
permodular and suppose T(y) and T(y’) 
are sublattices of R". Let S(y)= 
argmax( f(z, y)lz € T(y)), and define S^) 
analogously. Then y 2 y' and T(y) 2 T(y) 
imply that S( y) = S( y^. 


PROOF: 

Let x € S(y) and x'€S(y^ and y2 y' 
so that y = max( y, y^ and y'= min( y, y^). 
Since T(y) > T( y^, max(x, x^) € T(y) and 
min(x, x^ € T(y^. From the definitions, 
f(x, y) = füanax(x, x’), y) and f(x’, y) = 
f(min(x, x^), y^, but since f is supermodu- 
lar, f(x, y) f(x’, y") € f(max(x, x’), y)+ 
f(min(x, x^), y^) from which the conclusion 
is immediate. o 


THEOREM 6: Suppose f: R" 9 R is super- 
modular and suppose T is a sublattice of R". 
Then the set of maximizers of f over T is also 
a sublattice. 


JUNE 1990 


PROOF: 
Apply Theorem 5 with y = y’. o 


Theorem 5 is particularly important for 
our application. When its conclusion holds, 
we shall say that the set of optimizers “rises” 
as the parameter values increases. What jus- 
tifies this language? The theorem implies, for 
example, that if x*(y) and x*(y’) are the 
unique maximizers given their respective pa- 
rameter vectors y and y' and if y > y’, then 
x*( y) = x*( y^. (For uniqueness implies that 
x*(y) = max(x*(y), x*(y^), from which 
x*(y) = x*(y’) follows) Alternatively, 
suppose we assume that f is a continuous 
supermodular function that T is compact 
sublattice, so that the set of maximizers cor- 
responding to any parameter vector y is 
compact. Then, by Theorem 5, there are 
greatest and least elements x(y) and x(y) 
in the set of maximizers S. One can show 
that both x(y) and x(y) are nondecreasing : 
functions of y. (Using Theorem € and the 
definitions, X(y’) < max(X(y), xX(y)) x 
x(y) and similarly x( y) = min(x( y), x( y?) 
2 x(y)) 


II. Complementarities in Production 


We study a model of a multiproduct firm 
facing a downward sloping demand curve. 
The firm may be a monopoly or monopolis- 
tic competitor. Alternatively, our model may 
be viewed as a building block for a model of 
oligopolistic markets. 

In the formal model, the firm chooses the 
levels of the following decision variables: 





Variable Interpretation 

p Price of each product 

q (Expected) number of 
improvements per product 
per period 

a Order receipt and 
processing time 

b Delivery time 

c Direct marginal costs of 
production 

d Design cost per product 
improvement 

e Extra set-up costs on 


newly changed products 
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m ` Number of setups per period 
r Probability of a defective batch 
S Direct cost of a setup 
w Wastage costs per setup 


In addition, we denote the number of prod- 
ucts by n. i 

The functional relationships and parame- 
ters that complete the model include the 
demand specification, the specification of the 
capital costs of different levels of the techno- 
logical variables, the functional relation link- 
ing the average delay between receipt of an 
order and its being filled to the number of 
products and of setups, the marginal cost of 
production, the marginal cost of reworking 
defectives, the cost of holding inventories, 
and a time parameter that will proxy for the 
state of technology and demand. More 
specifically, we have 


Parameter Interpretation 

p Marginal cost of reworking 
a defective unit 

T Calendar time 

t Cost of holding inventory 
per unit 

K Capital costs 
(k=x(a, b,c, d,e,r, s,w,T)) 

E Base demand per product 
(H= Bp, 4, n,7)) 

ô Demand shrinkage with 
delay time (6=8(t,7)) 

W Expected wait for a processed 
order to be filled - 


(e = w(m, r, n)) 


The total expected wait for an order to be 
received, processed, filled, and shipped, 
which determines the value of the shrinkage 
factor (8) on demand, is t= a+w+b, and 
realized demand is then p( p,q, n, 7)6(a + o 
+b, 7). Thus, the firm's payoff function 

is 


II( p,q, m,a,b,c,d,e,r,s,w,v) 
—(p-c-rp-i/m)nu(p,q.n,7)- 
x8(at+w(m,r,n)+b,7) 
— m(s-- w)  ng(d * e) 
— k(a,b,c,d,e,r,s,w,r). 
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The total profit II is the operating profit 
minus the fixed costs associated with ma- 
chine setups, product redesign, and the pur- 
chase of capital equipment. 

The first term (p—c—rp—i/m)npà is 
the operating profit. For each unit sold, the 
firm receives the price p and pays direct 
production costs, expected rework costs, and 
inventory holding and handling costs. In line 
with the Economic Order Quantity models 
commonly used for inventory analysis, we 
treat the average levels of work-in-process 
and finished goods inventories as being di- 
rectly proportional to demand and inverse- 
ly proportional to the number of setups. 
Similarly, back orders are directly propor- 
tional to demand and decreasing in the num- 
ber of setups. We have used the function ô, 
which we take to be uniform across prod- 
ucts, to model the cost of back orders; this 
takes the form of lost demand when delivery 
is delayed. We have also modeled the firm as 
setting a uniform price ( p) across the prod- 
uct line, which is reasonable given the sym- 
metry of the products in the model. 

The second term m(s +w) is the cost of 
the setups, which, as suggested above, con- 
sists of the number of setups times the sum 
of the direct costs plus wastage per setup. 
The term nq(d +e) is the cost of redesign 
over the period, including the extra setup 
costs on newly altered products. In this term, 
nq is the total number of redesigns or im- 
provements. 

The last term is x, which is the capital cost 
of selecting the various technological vari- 
ables, a, b, c, d, e, r, s, and w, at any date r. 
Among these technology variables, the order 
receipt and processing time (a) is determined 
by the technology used for communicating 
orders (mail, express courier, FAX, elec- 
tronic data communications networks, etc.) 
and by the means used to handle orders once 
received (manual entry, computerized order 
entry systems). Whichever choices are made, 
there are capital costs involved in setting up 
the corresponding systems. Similarly, differ- 
ent options exist that determine the speed of 
delivery (b) from inventory, and these too 
have differing capital costs. 

Our model allows us to represent many 
aspects and tradeoffs in the firm's choice of 
manufacturing strategy. For example, the 
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flexibility of design technology is modeled 
by the variable d. The introduction of com- 
puter aided design (CAD) lowers these mar- 
ginal costs of redesigning and improving 
products, but it also involves significant cap- 
ital expenditures on training, hardware, and 
software. Both of these effects are captured 
in our profit function. 

Flexibility of manufacturing equipment 
has a number of aspects, several of which are 
represented in our model. First, flexibility is 
often associated with low costs of routinely 
changing over from producing one good to 
another. Here, this effect is represented first 
through the variable s: more flexible equip- 
ment means lower setup costs in terms of the 
downtime and direct labor costs involved in 
resetting the machines, switching dies, etc. 
Also, more flexible equipment might involve 
less wastage (lower w) per setup. This 
wastage might be in the form of extra in- 
spection, scrap, rework, and repair costs that 
are necessary when a changeover is made. 
The precision of computer aided equipment, 
“design for manufacturability" (facilitated 
by CAD-CAM), and similar investments 
lower these costs. Finally, flexibility might 
involve costs of changing machinery over to 
produce new or redesigned products (low 
values of e). 

The technological quality variable (r) cap- 
tures a somewhat different feature of mod- 
ern manufacturing methods. Improving qual- 
ity on this dimension may involve investing 
in more precisely controlled machinery which 
may even constantly monitor and adjust it- 
self. It may also improve more prosaic but 
possibly more significant efforts aimed at 
changing attitudes toward quality, such as 
giving workers the ability to stop the pro- 
duction line when a problem arises. 

Although we do not explicitly model labor 
force decisions here, an element of the flexi- 
bility of modern manufacturing is associated 
with broadly trained workers and with work 
rules that facilitate frequent changes in activ- 
ities. In this context we may interpret invest- 
ments in flexibility in terms of worker educa- 
ton and industrial relations efforts, as well 
as the purchase of physical capital. Cer- 
tainly, flexibility in the labor force and in the 
capital equipment are mutually complemen- 


tary. 


JUNE 1990 


Finally, the choice of c, the marginal costs 
of production, has capital cost implications, 
if only through investing in learning how to 
control costs. 

Even before we make any assumptions 
about the form of the unspecified functions 
K, 14, 9, and w, certain complementarities are 
evident in the model For example, with 
more frequent changeovers (higher 4), the 
returns to more efficient technologies for re- 
designing products and changing over equip- 
ment (higher values of —d and —e) will 
naturally rise and, conversely, more efficient 
changeover and redesign technologies raise 
the marginal returns to increasing q. Simi- 
larly, an increase in the number of setups per 
period (m) and the concomitant reduction in 
inventories and back orders is complemen- 
tary with a reduction in the components of 
set-up costs (increases in — s. and — w). 
Technologically,, one expects that reduced 
set-up and changeover costs are bundled to- 
gether in the new equipment. However, con- 
clusions such as that one require that we 
make an assumption about the properties of 
the unspecified function x. To make further 
statements, we need to make assumptions 
about properties of all the unspecified func- 
tions. 

Our assumption about the form of 8 is the 
following: 


ASSUMPTION AJ: ô is twice continuously 
differentiable, nonnegative, decreasing and 
convex in t, nondecreasing in + and submodu- 
lar in (t, 7). 


The assumption that ô is decreasing in ¢ 
simply means that increased delay reduces 
sales, while convexity says that the larger is 
the delay, the smaller is the marginal impact 
of additional delay. Inclusion of r allows for 
a time trend in demand through ô. This 
trend must be nonnegative for our results, 
but it could be trivial. The submodularity 
assumption means that, as time passes, 6, 
becomes weakly more negative or, equiva- 
lently, the returns to reducing waiting time, 
— t, increase weakly. This might come about 
because the adoption of more modern manu- 
facturing methods by the firm's customers 
raises the importance of speedy service to 
them. 
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An immediate implication of the convex- 
ity assumption in Assumption Al is that 
activities that reduce the several components 
of delay time (a, b, and w) are mutually 
complementary, as can be easily verified by 
checking that the corresponding mixed par- 
tial derivatives of the profit function are 
positive. These complementarities may seem 
surprising, since the three components of 
waiting time are perfect substitutes for one 
another in determining the total delay. How- 
ever, as our analysis shows, the possibility of 
substituting these elements to achieve a fixed 
time delay is irrelevant to their assessment as 
potential complements within the corporate 
strategy. 

To complete our evaluation of the comple- 
mentarities associated with speed, we must 
take an assumption about the w function. 
Generally, we would expect w to be increas- 
ing in the probability of a batch requiring 
reworking (r) and decreasing in the number 
of setups (m). The one nonobvious element 
in Assumption A2 below is that r and m are 
complements in determining w: an increase 
in the number of setups (or decrease in batch 
size) is assumed to raise the impact on delay 
time of an increase in the probability of a 
batch being defective. This complementarity 
may be caused by the more frequent 
changeovers in the rework facility being re- 
quired by more frequent changeovers in the 
main facility. Otherwise, it seems natural to 
expect the.effect to be zero, which is also 
consistent with our assumption. 


ASSUMPTION A2: w= e(m,r,n) is twice 
continuously. differentiable, decreasing in m, 
increasing in r and supermodular in m and 
r, given n. That is, &,, € 0, w,20, and 
Bmp Z Q- 

As a consequence of (A1) and (A2), — a, 
— b, m and —r are mutally complementary 
in increasing demand. 

To complete our analysis of the marketing 
aspects of strategy, the form of the demand 
function u must be restricted. We make two 
assumptions. The first is a standard one: 


ASSUMPTION A3: y is twice continuously 
differentiable, increasing in q, and decreasing 
in p, while operating profits, defined by ( p — 
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c— rp — w/m)nu(p, q, n, 7)8(a + w + b, 7), 
are a strictly quasi-concave function of p. 


The first part asserts innocuously that con- 
sumers prefer lower prices and higher qual- 
ity. Since we will hold n fixed, we need make 
no assumptions on its effect, although it 
would be natural to assume that np is in- 
creasing in n. The assumption that demand 
is quasi-concave in prices is standard. The 
nonstandard part of our assumption about 
demand is contained in Assumption A4: 


ASSUMPTION A4: u(p,q,n,T) is non- 
decreasing in + and supermodular when re- 
garded as a function of — p, q and v, for 
given n. 


A4 is a complicated assumption. It would be 
satisfied, for example, by a multiplicatively 
separable specification of demand, p= 
A( p) B(q)C(7), as well as by additively sep- 


` arable demand A( p)- B(q)- C(t), and A’ 


«0, B’>0, and C’>0 in both cases. It 
asserts that the quantity demanded becomes 
(weakly) more sensitive to price and quality 
with passing time, and that at higher quality 
levels the quantity demanded is more sensi- 
tive to price changes. Again, we emphasize 
that we allow demand to be independent of 
7, but if a dependence exists, it should not 
be such as to offset the supply-side effects of 
technological progress embodied in the effect 
of 7 in the x function. | 

Our final assumption is the following one, 
on the « function. It is key because it em- 
bodies the presumed technological changes 
in the capital goods industries supplying the 
firm that are the basis for our arguments. 


ASSUMPTION AS: k«(-a,—b,—c,— d, 
—e,—r,—5, —w,T) is submodular. 


This assumption is stated in terms of the 
negatives of the natural decision variables 
because a and b decrease with improv- 
ed communication systems, better data 
transmission, entry, storage, manipulation, 
and retrieval systems, and speedier delivery 
methods, and the other choice variables de- 
crease with improved design, manufacturing, 
quality, and cost-control technologies, that 
is, with increases in 7. 
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Conceptually, A5 has two parts. The first 
is Our assumption about the time path of 
exogenous technological change: the incre- 
mental capital costs of modern techriologies 
for communication, delivery, design, and 
flexible production are falling over time. As- 
suming differentiability of x, so that Theo- 
rem 2 applies, these trends are captured in 
the inequalities 0% /dxd7 <0, x= —a, 
—b,—c,—d,—e,—r,—s,—w;: technologi- 
cal change among capital equipment suppli- 
ers lowers the costs over time of the firm’s 
increasing delivery speeds, using more flexi- 
ble manufacturing methods, reducing the 
probability of defects, reducing costs ‘of re- 
design and controlling production costs. No- 
tice that we require no assumptions about 
the relative rates at which these prices are 
falling, because all these price changes will 
turn out to have mutually reinforcing effects. 

The second part of A5 concerns the inter- 
relationships among investments in the new 
technologies. For example, assuming that the 
mixed partial derivative of « with respect to 
— d and — e is positive means that the level 
of investment in flexible equipment neces- 
sary to reduce extra set-up costs by a given 
amount is reduced by investments in flexible 
design equipment. Of course, if the technolo- 
gies were completely separable, the condi- 
tion would be met but, as we argued in the 
introduction, separability is not a realistic 
assumption. The cost of instituting both 
computer-aided design and flexible machin- 
ing systems (FMS) to achieve given levels of 
design and set-up costs is generally less than 
the sum of the costs of instituting the two 
separately because the CAD equipment may 
provide set-up instructions readable by the 
FMS machinery, eliminating the costly step 
of encoding (possibly with error) the design 
instructions into a form readable by the 
flexible machine. Other complementarities in 
the physical equipment are similarly repre- 
sented in x. Thus, CAD makes it less costly 
to reduce defects by making it much cheaper 
to design products that are easily manufac- 
tured, while a computerized order-entry sys- 
tem can eliminate the need to transcribe 
order information into a form readable by 
the manufacturing computers, saving costs 
and reducing errors. 
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Other interactions, based on substitute 
uses of resources, could work against our 
complementary assumptions. For example, if 
the firm faces a fixed capital budget or a 
rising cost of capital with increased levels of 
investment, or if there are constraints on 
space or personnel and computer based sys- 
tems for communication and design compete 
for these resources, then an investment in 
lowering a would raise the cost of invest- 
ments to lower d. The second part of A5 is 
the hypothesis that the technological com- 
plementarities we have identified are larger 
than the effects of any constraints on the 
resources that the systems must share. 

Note that throughout the analysis, we are 
holding the rework cost parameter p and the 
inventory holding cost parameter + fixed. 

The problem of maximizing II is not 
amenable to standard, calculus-based tech- 
niques. First, although demand is assumed 
to be a quasi-concave in price, we have made 
no other concavity assumptions. Indeed, the 
assumed convexity of à means that profit, 
exclusive of capital costs, is actually convex 
in the total delay between placing an order 
and receiving shipment, and since A5 places 
no restrictions on the concavity or convexity 
of x, II may well be convex in — a, — b, m 
and — r over some ranges. In this case, satis- 
faction of a first-order condition identifies a 
(local) minimum with respect to the variable 
in question. Moreover, it is natural to take m 
to be integer-valued of the form, nk, where 
k is the number of production cycles per 
period and n is the number of products. 
However, the methods developed in the pre- 
vious section are applicable here, once we 
have the necessary complementarities. In 
this, A5 plays a major role. 

We are now ready to state and prove our 
main results. The idea is to use Theorem 2 to 
show that the firm's objective function is 
supermodular in the firm's (sign-adjusted) 
decision variables and that consequently, by 
Theorems 5 and 6, the set of optimizers 
forms a sublattice that moves up over time. 
However, it is not true that II is a super- 
modular function of all its arguments, be- 
cause the mixed partial derivatives of that 
function in price and the determinants of 
waiting time have the wrong sign when the 
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FIGURE 1. OBTAINING 7 FROM II 


NOTE THAT : ; 
max, > p II = H(p,...,7) =7(— p,..-)7) 
WHILE max, » p,lI(p,...,7) = II(P5,..., 7) 
=a(—P,,...,7) 


price is set too low. To get around that 
difficulty, we consider the optimized value of 
profit with respect to price where the price is 
restricted by a lower bound P. (See Figure 
1.) Letting this bound replace the price as 
the choice variable in our problem, it is 
apparent that this change of variables leaves 
the optimal values of the non-price variables 
unchanged. By A3, for any fixed values of 
the other decision variables and parameters, 
the corresponding optimal value of p is 
unique, and the highest optimal value of P 
equals the optimal price p.* Moreover, as we 
now show, the new function is super-mod- 
ular in the sign-adjusted decision variables. 


THEOREM 7: Assume Al through A5. Then 
the function 


HAP ah i deed 
—5,—W,T) 
— max, pll( p, q,m, a, b, 
€, d,e,r,s,w,r) 
is supermodular on the sublattice of R". de- 


fined by the restrictions that all the decision 
variables be nonnegative. 


“This construct also ensures that the price at which 
demand is calculated will always weakly exceed marginal 
cost, CF rp t i/ m. 
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PROOF: 

By Theorem 3 and AS, it is enough that 
the function 7 + « be supermodular. If m + «x 
were twice continuously differentiable every- 
where, then by Theorem 2 it would be suf- 
ficient to check that all the cross-partials of 
a+ are nonnegative. The verification is 
routine, except that m +x may have no sec- 
ond derivatives on the set of points in the 
domain where 0(II+«)/dp =0. However, 
it is straightforward to verify that the slightly 
weaker assumptions of Theorem 2* in the 
Appendix are satisfied, so m +x is super- 
modular, as we had required. n 


THEOREM 8: Assume Al through A5 and 
that x is continuous. Let the individual deci- 
sion variables each be constrained to lie in a 
compact set consistent with the nonnegativity 
requirement, so that together they lie in a 
compact set that is a sublattice. Then the set of 
maximizers of x is a compact sublattice which 
rises with 7. 


PROOF: 

Apply Theorems 6 and 7. Note that this 
result allows us to restrict the decision vari- 
ables to. be integer-valued and to limit the 
number of available technologies to some 
finite set. The supermodularity of the func- 
tional form « + « is verified by investigating 
its derivatives on continuous intervals, and 
the restriction to a compact sublattice is 
imposed later in a way that permits a restric- 
tion to discrete choices. 

The key conclusion is that the sign- 
adjusted decision variables all rise over time. 
Thus, as time passes, one expects to see a 
pattern of the following sort linking changes 
in a wide range of variables: 


* Lower Prices, 

e Lower Marginal Costs, l 

e More Frequent Product Redesigns and 
Improvements, 

e Higher Quality in Production, Marked by 
Fewer Defects, 

* Speedier Communication with Customers 
and Processing of Orders, 

* More Frequent Setups and Smaller Batch 
Sizes, with Correspondingly Lower Levels 
of Finished-Goods and Work-In-Process 
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Inventories and of Back Orders per Unit 
Demand, 

* Speedier Delivery from Inventory, - 

e Lower Setup, Wastage, and Changeover 
Costs, 

e Lower Marginal Costs of Product Re- 
design. 


The conclusion in Theorem 6 that the set 
of optimizers forms a sublattice implies that 
if at any time there are multiple solutions to 
the optimization problem, then there is a 
highest and a lowest optimal solution in the 
vector inequality sense. Further, comparing 
any two firm's choices, if these differ, then 
the choice of selecting the higher, “more 
modern" level for each decision variable is 
also optimal, as is the vector made up of the 
term-by-term minimal values of the two 
firm's choices. More typically, however, we 
might expect a unique solution. 

In any case, the chosen levels move up 
together over time in response to the falling 
costs of faster communications, more flexible 
production, and more frequent redesign. 

The model we have presented is a static 
one, but it is nevertheless suggestive about 
the nature of the path to the modern manu- 
facturing strategy. Specifically, it suggests 
that even if the changes that take. place in 
the environment— especially the falling cost 
of the equipment used under the modern 
manufacturing strategy—happen gradually, 
the adoption process may be much more 
erratic, for two reasons. First, there are non- 
convexities, which mean that the optimum 
may shift discontinuously, with the profit- 
maximizing levels of the whole complex of 
variables moving sharply upward. This makes 
it relatively unprofitable to be stuck with a 
mixture of highly flexible and highly special- 
ized production equipment. One does not 
necessarily expect to find that the adoption 
of the new equipment is sudden; it may still 
be desirable to iron out the wrinkles in the 
new technology with an initial small scale 
adoption. What the theory suggests we 
should not see is an extended period of time 
during which there are substantial volumes 
of both highly flexible and highly specialized 
equipment being used side-by-side. Then, 
once the adoption is well underway, it should 
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proceed rapidly, with increasing momentum. 
Second, there are the complementarities, 
which make it relatively unprofitable to adopt 
only one part of the modern manufacturing 
strategy. The theory suggests that we should 
not see an extended period of time during 
which one component of the strategy is in 
place and the other components have barely 
begun to be put into place. For example, we 
should not see flexible equipment used for a 
long period with unchanging product lines. 
The conclusion of Theorem 8 that firms 
will increase quality in the sense of reducing 
the probability r of a defective batch is 
worth further comment. Many observers 
have noted a focus on increased quality of 
output among modern manufacturing firms. 
One would expect that design for manufac- 
turability would result directly in lower de- 
fect rates. However, the complementarities 
displayed in the model provide a second, less 
obvious incentive for increased quality. De- 
creases in the probability of defects are 
strictly complementary with increases in m 
through the effect on operating profit: de- 
mand grows with increases in m, and this 
increases the return to lowering costs by 
reducing the probability of reworking.? 
Recall that we have held n, the number of 
products, fixed throughout this analysis. In- 
spection of the profit function in light of the 
arguments in Theorem 7 should make the 
necessity of doing this clear: neither n nor 
its negative are naturaly complementary 
with the other decision variables. This shows 
up most clearly in the cost of redesign term, 
— nq(d + e), where increases in n make de- 
creases in d and e more attractive but in- 
creases in q less attractive. There are further 
potential complications through the demand 
term, and so without very special assump- 
tions we cannot include n in the cluster of 
complements. 
That we cannot include the number of 
products is somewhat surprising: surely 
broader: product lines would seem to be 


5Note too that decreases in.r are also strictly com- 
plementary with increases in the other quality variable, 
q, as well as with decreases in the delay in communicat- 
ing with customers and processing their orders (a) and 
in the time to deliver the inventory (b). - 
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complementary with reduced set-up costs, 
and this intuition has in fact been verified in 
simpler models (see Xavier de Groote, 1988). 
However, the ambiguity surrounding n in 
richer models appear to reflect something 
real. On the one side, there. are numerous 
examples of firms massively broadening their 
product lines with the adoption of modern 
manufacturing methods, and some of these 
were cited above. On the other, anecdotal 
evidence (for example, James. B. Treece, 
1989) as well as both the discussions of the 
* focused factory" found in the literature on 
manufacturing strategy (for example, David 
A. Garvin, 1988, especially Ch. 8) and some 
formal statistical analysis (Mikhel Tombak 
and Arnoud De Meyer, 1988) point to firms 
having reason to narrow their product lines 
when shifting to more modern manufactur- 
ing patterns and of their acting to do so. 


II. Manufacturing and Organization 


How is a manufacturing firm most effi- 
ciently organized and managed? Several of 
the trends analyzed in Section II have a 
direct bearing on this question. First, con- 
sider the complementarities that exist 
between the various functions in the firm: 
marketing, order-processing, shipping, engi- 
neering, and manufacturing. If the firm's 
problem were smooth and concave (despite 
the complementarities) and its environment 
were stationary and if the optimum is not on 
the boundary of the feasible set, the comple- 
mentarities would not pose a serious organi- 
zational problem: if none of the managers 
controlling the individual functions can find 
a small change that raises the firm's expected 
profits, then there is no coordinated change 
—]arge or small—that can raise profits. 
However, in our non-concave problem, it is 
possible that only coordinated ‘changes 
among all the variables will allow the firm to 
achieve its optimum. Non-convexities and 
significant complementarities provide a rea- 
son for explicit coordination between func- 
tions such as marketing and production. 


*A similar point is made by de Groote, who investi- 
gates a different model of complementarities between 
marketing and manufacturing. 
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(Extension of the methods in this paper to a 
game-theoretic context can be used to model 
this coordination problem and the role of 
the central coordinator: see Milgrom and 
Roberts, 1989.) 

Even without non-convexities, significant 
complementarities-in a rapidly changing en- 
vironment provide another reason for close 
coordination between functions. Think of the 
managerial planning process as an algorithm 
to seek the maximum of the profit function. 
Successful performance in the face of rap- 
id environmental change requires the use of 
fast algorithms (for example, Newton’s 
method), and these require a coordinated: 
choice of the decision variables that recog- 
nizes the interactions among these variables 
in the profit function. 

Second, suppose that the organization be- 
ing modeled is one where sales are made 
through several different stores. If the opti- 
mal speed of order-processing (a) jumps 
down, it may be desirable that all the stores 
install computerized systems linked to the 
manufacturing facility to track orders and 
sales. If there are fixed costs or other 
economies of scale in the computer system, 
then it is important that all, or nearly all, of 
the stores participate. However, unless all 
the costs and benefits of the change accrue 
to one agent, there arises a standard public 
goods, free-rider problem. Eliciting efficient 
cooperation from the store owners could be 
expensive and may provide a reason for ver- 
tical ownership of the distribution channel, 

Third, Oliver Williamson and Klein, 
Crawford, and Alchian (1978) have argued 
that the advantages of increased vertical gov- 
ernance grow as assets become increasingly 
specialized. This occurs, it is argued, because 
the returns from specialized investments are 
vulnerable to appropriation. Then, as 
Williamson and Jean Tirole (1986) have ar- 
gued, fear of appropriation causes insuffi- 
cient investment to be made or, as we have 
argued (Milgrom and Roberts, 1987), it en- 
courages the parties to waste resources by 
investing in bargaining position. Following 
this line of argument, let us equate “special- 
ization” of assets with inflexibility of retool- 
ing to produce different products, so that it 
may be measured by e. The net costs of 
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governance, bargaining, and deterred- or dis- 
torted incentives are y(— v, — e), where v is 
a vector measure of the extent or complexity 
of vertical governance. We formalize a ver- 
sion of the hypothesis that increased flexi- 
bility of assets reduces the marginal value of 
governance activities with: 


ASSUMPTION A6: The function w v, 
— €) is sub-modular. 


THEOREM 9: Assume that 41— 46 hold and 
consider the profit function: 


v(—p,m,q,-a,—b,—c,-d,—e,-r, 
—5,—w,T)—y(—v,— e). 
Let each decision variable be constrained as in 


Theorem 8. Then the set of optimizers of 
v — y is a sublattice and rises with v. 


PROOF: 
A direct consequence of Theorems 3, 5, 6, 
and 7, and A6. o 


Thus, given Assumptions A1—A6 another 
predicted attribute in the characteristic clus- 
ter for flexible manufacturing companies is 
low vertical governance, for example, the 
extensive use of independently owned sup- 
pliers and subcontractors. This characteristic 
is an especially interesting one, given the 
usual conception of the difference between 
internal and market organization. Although 
uncertainty is not formally part of our 
model,’ running this sort of “tight,” low 
inventory operation with frequent redesign- 
ing of products in a world of uncertainties 
would surely require close coordination and 
communications with suppliers. Yet accord- 
ing to our theory, the modern firm— despite 
its close relationships with suppliers and cus- 


"However, introducing uncertainty would cause no 
difficulties because the expectation of a supermodular 
function is supermodular. See Milgrom and Roberts, 
1989. 

*For a model of some aspects of this issue, see 
Milgrom and Roberts, 1988. In that model, inventories 
play a buffering role whose importance is reduced when 
communication is increased. 
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tomers— will have little formal vertical gov- 
ernance. . 

Economists sometimes emphasize the need 
for close communication in the presence of 
supply or demand uncertainty as a reason 
for vertical integration (for example, Ken- 
neth Arrow, 1975). If we were to formulate 
this alternative hypothesis using a submodu- 
lar governance cost function A(m,v), we 
would arrive at the conclusion that v in- 
creases over time and that more extensive 
vertical governance is part of the cluster of 
characteristics of a modern manufacturing 
firm. The anecdotal evidence contained in 
press reports suggests to us that this conclu- 
sion is wrong, and that the former hypothe- 
sis A6 is the better one. 


IV. Conclusion 


The cluster of characteristics that are of- 
ten found in manufacturing firms that are 
technologically advanced encompasses mar- 
keting, production, engineering, and organi- 
zation variables. On the marketing side, these 
firms hold down prices while emphasizing 
high quality supported by frequent product 
improvements. Customers orders are filled 
increasingly quickly, with back-order levels 
being systematically reduced. In terms of | 
technology, modern manufacturing firms ex- 
ploit rapid mass data communications, pro- 
duction equipment with low setup, wastage, 
and retooling costs, flexible design technolo- 
gies, product designs that use common in- 
puts, very low levels of inventories (of both 


. Work in process and finished goods), and 


short production cycle times. They also seem 
to push differentially to increase manufactur- 
ing quality and. simultaneously, to control 
variable production costs. At the engineering 
and organizational levels, there is an integra- 
tion of the product and process engineering 
functions and an extensive use of indepen- 
dently owned suppliers linked with the buy- 
ing firm by close communications and joint 
planning. 

We have argued in this paper that this 
clustering is no accident. Rather, it is a result 
of the adoption by profit-maximizing firms 
of a coherent business strategy that exploits 
complementarities, and the trend to adopt 
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this strategy is the result of identifiable 
changes in technology and demand. Our for- 
mal model includes eleven decision variables 
from the claimed cluster of complements 
plus a parameter to account for the passage 
of time. There are thus 66 potential cross 
effects among the twelve variables, and all of 
these are nonnegative: there are extensive 
complementarities in marketing, manufac- 
turing, engineering, design, and organization 
that make it profitable for a firm that adopts 
some of these characteristics to adopt more. 
We have also argued that the non-convexi- 
ties in the problem mitigate against any 
smooth distribution. of these characteristics 
among firms. For this reason, we are hopeful 
that empirical work will provide evidence of 
distinctly separated clusters of firm charac- 
teristics as support for our theory. Given our 
assumptions about time trends in prices, we 
also expect to find an increasing proportion 
of manufacturing firms adopting the modern 
manufacturing strategic cluster that we have 
described. 


APPENDIX 


THEOREM 2*: Let I=[a,,5,)X <+- X[a,, bp] be an 
interval in R” with nonempty interior and let f: 1 — R. 
Suppose that for every pair of arguments ij, there exists a 
function fij: I  R such that f is the indefinite integral of 
EH That is, for fixed xj, and for xj > x, and xj > xj, 


fti xi XN) + FOG Hy X01) 


at A CETTE ANTS hat A CTE SEE LOTD) 
xp exi : 
=f f CEN, dsdt 
xp tx, 


If each f,, is nonnegative, then f is supermodular on I. 


Remark 1: In our application, f is continuous on I and 
twice continuously differentiable on a set S with 
0?f/0x, 8x, 20 on S. Moreover, for all X. ; the set 
(I= S) (xix =X j} is a curve. So, taking fi; = 
8? f/üx, 8x, where defined and f,, — 0 elsewhere, Theo- 
rem 2* implies that f is supermodular. 


PROOF: 

In view of Theorem 1, it suffices to establish the 
conclusion for the case n = 2. Given any two unordered 
points x and x’ with, say, x, > xj and x$ > x;, 


f (max(x, x^)) + f (min(x, x) -f(x)- f(x) 


= f” (5,0) ddt > 0, 
xi Xz 
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from which it follows that f(x)+ f(x’) < f (max(x, x’)) 
+ f(min(x, x’)). a 
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Malthusian Selection of Preferences 


By INGEMAR HANSSON AND CHARLES STUART* 


We study natural selection of preferences using a golden-age model with endoge- 
nous population. In equilibrium, all agents have preferences with maximum 
biological fitness given resource constraints and total population is the maximum 
the environment can sustain. Naturally selected agents follow the golden rule, 
acting as if they maximize the undiscounted sum of per-capita felicities of current 
and future generations. Selected preferences and hence work, saving, consump- 
tion, and population density vary predictably with environmental differences. 


The preferences observed in living popula- 
tions may be partly the result of biological 
selection that took place during pre- 
industrial times. Consider, for. instance, life 
in a Malthusian environment, by which we 
mean an environment in which per-capita 
consumption drives population growth. Sup- 
pose the environment contains two families 
or “clans,” all members of the first prone by 
heritage to indolence and immediate con- 
sumption and all members of the second 
with stronger, heritable tendencies to work 
and save. With greater labor and capital, the 
second clan may have greater sustainable 
per-capita consumption and hence greater 
population growth. As time passes, this clan 
would make up an ever-increasing part of 
the total population. Put differently, the 
population would asymptotically be charac- 
terized by the strong preference to work and 
save. 

. We expand upon this idea in the current 
paper, first developing a model of equi- 
librium demographics in a pre-industrial 
Malthusian environment, then characterizing 
naturally selected preferences as the prefer- 
ences that maximize the sustainable number 
of offspring, finally illustrating how the anal- 
ysis can be used to study aspects of eco- 


` *University of Lund and University of California at 
Santa Barbara, respectively. Thanks are due to a referee 
and to George Borjas, Ulla Larsen, Góran Skogh, Larry 
Summers, Lars-Gunnar Svensson, and Jack Votey. Re- 
search support from the Bank of Sweden Tercentenary 
Foundation is gratefully acknowledged. 
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nomic development.! The specific focus is on 
selection of preferences for saving and labor 
supply. We stress that saving here is re- 
stricted to intergenerational saving, meaning 
sacrifice of consumption by one generation 
that benefits later (younger) generations. The 
most important forms of intergenerational 
saving are doubtless in-kind transfers of hu- 
man capital via bearing, support, upbring- 
ing, and education of children. Other forms 
include bequests and cash transfers from 
older individuals and government to younger 
individuals.? i 


Equilibrium models of population have been used 
by economic demographers—see, for example, Ronald 
Lee (1987). In broad terms, the evolution of behavior 
and of preferences is the subject of modern evolutionary 
biology—see Edward O. Wilson (1980) for a survey. 
Formal analysis of genetic and cultural mechanisms for 
the evolution of what economists might call preferences 
has been conducted by Charles Lumsden and Wilson 
(1981). Robert Boyd and Peter Richerson (1985) focus 
on cultural evolution of preferences. Studies of eco-. 
nomic behavior in a biological perspective include 
W. D. Hamilton (1964), John Maynard Smith (1982), 
Armen Alchian (1950), Gary Becker (1976), Jack Hirsh- 
leifer (1977), Paul Rubin and Chris Paul (1979), Howard 
Margolis (1982), and Robert Frank (1987). 

Intergenerational saving is fundamentally different 
from intragenerational saving (Franco Modigliani and 
Richard Brumberg, 1954), which includes pension annu- 
ities and other wealth accumulated early in an indi- 
vidual's life cycle and decumulated later in the same 
individual's life cycle. Empirical evidence suggests that 
intergenerational saving makes up a substantial portion 
of total saving. For instance, Laurence Kotlikoff and 
Lawrence Summers (1981) estimate that intergenera- 
tional saving may be as high as 80 percent of total 
saving. Modigliani (1986), on the other hand, places the 
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I. Concern for Own and Future 
Generations! Consumption 


A. Very-Long-Run Equilibrium 


We use a paradigm of non-intermarrying 
clans that each consists of a sequence of 
overlapping generations. A generation is eco- 
nomically active for only one time period 
and only one generation is active in any 
period.) Each generation in a given clan has 
the same behavioral tendencies, or in eco- 
nomic terms the same preferences, but dif- 
ferent clans may have different behavioral 
tendencies. This effectively treats preferences 
as stable in that they are passed down with- 
out mutation from generation to generation 
within a clan.* 

' We assume that the total population of all 
clans is bounded by what biologists call the 
carrying capacity of the environment. Carry- 
ing capacity captures the idea that there is a 
problem of the commons. To model the 
problem, we abstract from migration and 
treat the natural environment as a crowded 
public good that enters as an input in each 
clan's production function. We begin by 
characterizing the equilibrium population 
when carrying capacity is constant and then 
generalize to allow carrying capacity to 
change at an exogenously given rate of tech- 
nological growth, g. Denote the total popu- 
lation of all clans in period t by N,. Then by 
choice of units, the level of the crowded 
public good in each clan's production func- 
tion (the "effective" level of the crowded 
factor) is 1/N,. We assume that per-capita 
output net of depreciation in a specific.clan 
is a strictly concave, twice continuously dif- 
ferentiable, increasing function of the levels 
of the public good and of per-capita capital 











share at as low as 20 percent. Neither of these studies 
fully includes intergenerational saving in human capital, 
which may be substantial (Jacob Mincer, 1974; Arleen 
Leibowitz, 1974). 

^We model each generation as a representative agent, 
abstracting from free riding within a generation. 

^Passage of preferences by Mendelian genetics or by 
learning from parents are both compatible with the 
treatment here. 
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in the clan, k,. This output is q(1/ N,, k,), 
where q(-) is assumed stationary and invari- 
ant across clans. Because we wish to focus 
on the size of total population and not on 
the effective amount of the crowded fac- 
tor, we invert 1/ N, and work below with the 
function f(-) defined by f(N,, k) = 
q(1/ NN, k,). An increase in the total popula- 
tion of all clans increases crowding of the 
natural environment, which reduces per- 


` capita output, so fy «0. We assume that 


output equals zero if either input equals zero. 
The marginal product of capital, f,, is as- 
sumed to satisfy the Inada conditions f, — oo 
as k > 0 and f, — 0 as k >k « oo. We also 
impose “survivability restrictions" to ensure 
that total population in the model does not 
vanish. Specifically, we assume that for any 
positive given value of per-capita capital, 
per-capita output becomes infinite as total 
population goes to zero; formally, for k > 0, 
f(N, k) ^ oo as N — 0. (This restriction is 
stronger than needed and is weakened 
below.) 

© Because consumption is a biological ne- 
cessity, we assume.that greater per-capita 
consumption in a clan, c,, leads to greater 
fitness. We represent the necessity formally 
by writing the rate of population growth 
from period ¢ to t+1 in a clan as a twice 
continuously differentiable, stationary func- 
tion p(c), with —1«xp(0) «0, p(c,)>0 
for some c, < oo, and partial derivatives p, > 
0 and p,, x 0. An interpretation in the spirit 
of Malthus is that greater average consump- 
tion in a clan results. in better nutrition 
and protection from environmental hazards, 
which raises the average number of children 
that survive to bear offspring. 


5The fitness of a trait may be defined as the relative 
rate of population growth of bearers given the selective 
pressures of the environment. 

If preferences are stable across generations, then 
naturally selected preferences must reflect the effects of 
environmental pressures over generations so selection 
prior to the demographic transitions associated with 
industrialization would partly explain preferences to- 
day. The Malthusian assumption that fitness rises in 
per-capita consumption is intended to reflect conditions 
during much of this long stretch of economic history. 
(To the extent that greater per-capita consumption con- 
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In any period, the active generation in a 
given clan splits per-capita output between 
consumption and investment. The split satis- 
fies 


(1) f(N, k;) 
Ed [ku (19 p(e))- k,| $ 


With the model specified, we characterize 
naturally selected behavior. We do so by 
studying a very-long-run equilibrium in 
which there is no tendency for the mix of 
behaviors in the population to change as a 
result of selection.” We treat this equilibrium 
as a steady state in which per-capita capital 
and consumption are identical across clans 
and constant and the total population of all 
clans is constant? To understand how our 
assumptions about crowding of the natural 
environment can lead to constant total popu- 
lation, note that with sufficiently great total 
population, the level of effective natural en- 


tributes to greater fitness in developed economies by 
improving nutrition and protection from the environ- 
ment, the current analysis also characterizes a force of 
selection in developed economies.) Note that except 
during the past 100-150 years in developed countries, 
wealthier individuals, who are individuals with greater 
total consumption, have indeed tended to have more 
children (Becker, 1981; George Boyer, 1989; also Lee, 
1987). More recently in developed countries, one some- 
times observes negative correlations between wealth and 
fertility, although the overall tendency may not be 
strong. To the extent that population growth has been 
relatively flat in per-capita consumption in post- 
industrial times, little selection of the sort modeled here 


may have taken place since industrialization, which- 


would mean that the (stable component of the) prefer- 
ences observed in today’s populations would largely be 
explained by pre-industrial selection. In any case, it is 
irrelevant given our desire to characterize the prefer- 
ences of currently living agents that household fertility 
and income may be inversely related today in developed 
countries: if selection is slow, selection today pre- 
dictably affects preferences only in the distant future. 

"Traditionally, long-run refers to allocations that re- 
flect full adjustment of factors and outputs to given 
tastes and technology. We use very-long-run to reflect 
full selection of preferences in addition to adjustment of 
factors and outputs. 

®The total population of all clans might be written as 
the product of the number of clans and the average size 
of a clan. Although this product is uniquely determined 
in equilibrium, the number or average size of clans are 
not. 
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vironment in eàch clan's production function 
(1/N,) would be low enough to imply levels 
of output and hence consumption that would 
make population growth negative. Similarly, 
sufficiently low total population would by 
the survivability restriction lead to great 
enough per-capita output and hence con- 
sumption, for any given, positive level of 
per-capita capital, to imply positive growth 
of total population. 

Three conditions determine the very-long- 
run equilibrium values of per-capita capital, 
per-capita consumption, and total popula- 
tion in the commons. First, because these 
three variables are constant in steady state, 
the constraint (1) becomes 


(2) f(N, k) 2 e kp(c), 


where un-subscripted variables denote 
steady-state values. Equation (2) is the stan- 
dard steady-state condition that constant 
per-capita capital requires. investment of 
kp(c) in order to compensate for population 
growth. ; 

Second, no clan can have p(c)>0 in 
steady state and all clans that do not vanish 
asymptotically must have precisely zero 
growth, so the equilibrium per-capita con- 
sumption of any clan is uniquely determined 
by 


(3) p(c) =0. 


We refer to (3) as the equilibrium condition 
for a Malthusian population in an environ- 
ment with constant carrying capacity: it re- 
quires per-capita consumption at the level 
that just keeps total population constant. 

. Third, because very-long-run equilibrium 
entails full selection of behavior, we require 
that selected behavior have maximum fit- 
ness. This requirement ensures that equilib- 
rium is locally stable in that the equilibrium 
population cannot be invaded successfully 
by a small clan with other feasible behavior 
that would induce the invader to grow faster 
than the clans in the equilibrium.’ The fit- 


?Thus equilibrium behavior is an evolutionarily stable 
strategy in the sense of Maynard Smith (1982). 
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ness of a trait or the relative rate of popula- 
tion growth of bearers of the trait due to the 
selective pressures of the environment is fully 
indexed by the absolute rate of population 
growth of bearers, or, via p(-), by the per- 
capita consumption of bearers. The condi- 
tion for maximum fitness is thus the con- 
dition for maximum induced per-capita 
consumption. To derive the condition, note 
that for a given size of total population, (2) 
determines a clan’s per-capita consumption 
as a differentiable, concave function of its 
level of per-capita capital. Thus treat c as a 
function of k, differentiate (2) with respect 
to k, and set dc/dk = 0 to obtain 


(4) f. (NS k) = p(c), 


which is the golden rule. The simple but 
important conclusion: in an environment in 
which per-capita consumption drives popu- 
lation growth, selection implies a tendency 
toward golden-rule saving. 
Conditions (2)-(4) determine unique very- 
long-run equilibrium values of per-capita 
capital, per-capita consumption, and total 
population in the commons. In equilibrium, 
all agents behave identically and total popu- 
lation is the maximum the environment can 
sustain in steady state (carrying capacity). 
To see that total population equals carrying 
capacity, note first that (3) yields the 
steady-state level of per-capita consumption 
consistent with zero population growth. Sub- 
stitute this zero-growth consumption level 
into the steady-state condition (2), obtaining 
an equilibrium condition in N and k. By 
taking k as given and solving the condition 
for N, one can view the condition as deter- 
mining the total population supported by 
the environment in steady state as a function 
of per-capita capital. Differentiate the ex- 
pression with respect to k and set dN/dk — 0 
to characterize the behavior that maximizes 
total sustainable population. The result is 


(4). Thus maximization of fitness implies: 


maximization of total sustainable population 
here.!? i 


This means that equilibrium in our-model is one of 
pure “K-selection,” defined as selection that favors 
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We denote equilibrium values with stars 
and illustrate the relationships between k*, 
c*, and N* in Figure 1. The figure corre- 
sponds to a numerical example in which 
p(c) 7 c5 1 and f(N,k)-— (k/N)95 — 
0.5k +0.5. The solid peaked curve indicates 
the relationship between per-capita capital 
and the population growth rate (fitness) in a 
clan when the total population of all clans 
equals one. The equation for this curve is 
derived generally, which is to say for an 
arbitrary level of total population, by using 
the steady-state condition (2) and the popu- 
lation growth function to find the level of 
steady-state consumption and hence the rate 
of population growth consistent with a given 
level of per-capita capital. The broken curve 
is derived from the same equation except 
that total population is lower (N = 0.96) and 
hence steady-state per-capita consumption 
and population growth are greater for each 
value of steady-state capital. At a sufficiently 
low population level, the survivability re- 
quirement ensures that f( N, k) > 0. This oc- 
curs on the broken curve at points above 
zero. Such a low level of total population 
cannot be part of a very-long-run equilib- 
rium, however, because clans that choose k 
at or close to the golden rule given by (4) 
would have p(c)>0 and hence would al- 
ways grow at a positive rate even though the 
capacity of the environment to support pop- 
ulation is limited and constant. Thus if N — 





efficient use of resources at population levels at or near 
the maximum sustainable level (Wilson, 1980, pp. 
39—49). Note that “K-selection” does not necessarily 
favor large numbers of offspring per individual. Selec- 
tion that favors large numbers of offspring per individ- 
ual is referred to as “r-selection,” which typically de- 
Scribes evolutionary pressures in shifting habitats in 
which only a small share of total offspring find re- 
Sources needed to survive and reproduce. In man, r- 
selection might characterize some aspects of a “frontier 
environment" in which total population is far below the 
maximum sustainzble level. The equilibrium in our 
model could be converted to one of (deterministic) 
r-selection by remcving feedbacks whereby increases in 
population are self-limiting; formally, this could be 
done by deleting total population from the model and 
by deleting the Malthusian condition (3). Equilibrium 
would then be characterized by two equations, (2) and 
(4), in two unknowns, k and c. 
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Population Growth 
` inaClan, pc) 
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Per-Capita Capital 
in a Clan, k 


Total Population 
of All Clans (N) = .96 


Tota! Population 
of All Clans (N) = 1.0 


FIGURE 1 


0.96 in the example, total population would 
tend to rise. The unique very-long-run equi- 
librium occurs in the example at N*—1 
(solid curve) and k*=1. That is, because 
p(c*)=0 in equilibrium, we have c*-—1 
in this case. The steady-state condition is 
then f —c* or (k*/N*)95 —0.5k* -- 0.5 =1, 
and the golden rule is f, = p(c*) or 
Q.S(k*N*) 95 —0,5 — 0, which together im- 
ply N* —1 and k* —1. Because no greater 
value of N is possible in steady state, total 
population equals the carrying capacity of 
the environment. 

Very-long-run equilibrium may be com- 
pared to the traditional notion of long-run 
competitive equilibrium. Traditional long- 
run equilibrium is characterized by profit 
maximization and adjustment of the popula- 
tion of firms so that equilibrium profits equal 
zero, Here, very-long-run equilibrium in an 
environment without growth is characterized 
by population-growth (fitness) maximization 
and adjustment of total population so that 
equilibrium population growth equals zero. 
As traditional long-run equilibrium suggests 
the direction of market adjustment at any 
point of time, one may interpret very-long- 


- run equilibrium as suggesting the direction 


of selection of behavior at a point of time. 
It is straightforward to generalize the anal- 
ysis to situations in which the carrying ca- 
pacity of the environment grows at rates 
other than zero. Specifically, we denote 
growth-deflated population, N,/(1+ g)‘, by 
M, and we take per-capita output in a given 
clan to be f( M,, k,). Writing production as 
a function of growth-deflated population in 
this way captures the idea that population is 
limited by carrying capacity in that, in steady 
state, per-capita output and capital are con- 
stant so growth-deflated population, M,= 
N,/(1-- g)', must be constant, which is to 
say that total population must grow at the 
rate of increase in the carrying capacity of 
the environment. We continue to assume 
that crowding reduces per-capita output, or 
fy < 9, that output equals zero if either input 
equals zero, and that f(-) satisfies the Inada 
conditions f, oo as k —0 and f, -0 as 
k — k € co. To ensure that total population 


That is, we assume that growth is purely “ natural- 
environment-augmenting.” 
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Population Growth 
in a Clan, p(c) 
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Per-Capita Capital 
in a Clan, k 


Growth-Deflated 
Total Population 
of All Clans = M* 





FiGURE 2 


not vanish asymptotically, we assume that- 


the level of consumption defined by p(c*) — 
g is finite and technologically feasible in that 
there are values M » 0 and k » 0 such that 
f(M, k)- c* — kg — 0. 

To characterize the behavior selected in 
the very-long-run in nature, the Seay ste 
resource constraint is now 


(5) f(M, k) =c+kp(6). 


Further, because the carrying cabadis of the 
environment now grows at fate g, all clans 
that do not vanish asymptotically inust grow 
at rate g. Thus per-capita steady-state con- 
sumption is uniquely determined by 


(0: ple) = 8, 


instead of by the earlier condition that p(c) 
= Q0. An implication of (6) is that per-capita 
consumption in steady state rises with the 
rate of technological. growth. Thus if one 
defines subsistence consumption as the level 
of per-capita consumption that would leave 
the population of a clan unchanged, which is 
the root of p(c)- 0, then positive techno- 
logical growth would cause equilibrium con- 


sumption to exceed subsistence consump- 


tion. 

Equation (4), which specifies that maxi- 
mum fitness is achieved under the golden 
rule, continues to hold when g is not con- 
strained to equal zero. Thus conditions 


(4)-(6) now determine unique very-long-run 
equilibrium values of capital, consumption, 
and growth-deflated total population. The 
equilibrium is illustrated in Figure 2, which 
is similar to Figure 1 except that the peak of 
the steady-state relationship between capital 
and the implied level of population growth 
has height g. Here, very-long-run equilib- 
rium is characterized by population-growth 
maximization and adjustment of total popu- 
lation so that total population equals the 
carrying capacity of the environment at each 
point of time. (Thus the rate of population 
growth equals the rate of increase in carrying 
capacity. ) 


B. Representing Selected Behavior 
with Preferences 


The very-long-run equilibrium  charac- 
terized above describes naturally selected 
behavior. Economists generally represent be- 
havior as the outcome of constrained opti- 
mization. Accordingly, we now represent 
naturally selected behavior as the outcome 
of constrained optimization and we charac- 
terize the preferences that are consistent with 
very-long-run equilibrium. There are many 
possible representations or "specifications." 
Although behavior in very-long-run equilib- 


-rium can be captured with simpler specifica- 


tions, it is nonetheless useful to adhere to a 
traditional formulation in which generation 
z selects a consumption sequence that maxi- 
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mizes the discounted sum of the clan’s cur- 
rent and future felicities to an infinite time 
horizon. One may think of this specification 
as a purely intergenerational rendering of 
Robert Barro's (1974) dynasty model? The 
time horizon is assumed infinite because 
very-long-run equilibrium is a steady state. 
It turns out that the specific selection pro- 
cess posited here imposes restrictions on the 
felicity discount rate but does not uniquely 
determine the shape of the felicity function. 
In searching for the population-growth-max- 
imizing discount rate, we therefore do not 
wish to restrict the discount rate to a range 
for which the discounted sum of an infinite 
sequence of felicities is finite. We thus as- 
sume a constant planning horizon of T peri- 
ods, which is the same for. all generations, 
and we interpret behavior over an infinite 
horizon as behavior when T goes to infinity. 

In more detail, the contribution to genera- 
tion z’s utility from consumption in period 
t >z is the discounted felicity of per-capita 
consumption in ¢ times the ratio of the 
population in ¢ to that in z, 


(1) «C [i 2e] ple] 


ep G-Q/a) 7 


where felicity is increasing, concave, and 
twice continuously differentiable with u,>0 
and u, x 0, and where ô is the rate at which 
a generation discounts the felicity of con- 
sumption of the clan's future generations. A 
higher discount rate 6 means that the cur- 
rent generation is more selfish or, pro- 
saically, that parents show less love for 
progeny. It should be stressed that the analy- 
sis here is intergenerational and not intra- 
generational, so à is the geometric rate of 
generation preference and not the rate at 
which an individual might discount own fu- 
ture consumption. 

To see the restrictions selection imposes 
on utility, we first characterize a clans be- 


Andrew Abel and Kotlikoff (1988) provide empiri- 
cal support for the Barro model. 
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havior given a utility function with arbitrary 
felicity and an arbitrary discount rate. For a 
given finite time-horizon, the first-best. allo- 
cation for generation z maximizes ` 


(8 U,=u(e,) + ules) 


(2), 


1-8 


140 


T 2; u(c,) II 


z+T nu 

tz v=z : 
subject to budget constraints (1) and an ini- 
tial value of k,. The first-order conditions 
that characterize the clan’s behavior are 


ye Qu ap ztT ' pol 
9p oe areal s utc, 
e ez E we) T, 


IM jo» 2 | 


[122 (14 p(c,) 
I8] 
ap 
| -afit kage] 0 
zxtzzT, 


(0) AAA +1)- Aa + p(c,.1)) -0, 
ztlztzxz-T, 


where the A, are Lagrange multipliers. 

To represent selected behavior in the 
very-long-run, we find the preference param- 
eter 6 that solves (9) and (10) in very-long- 
run equilibrium when T — oo. Because each 
generation would want the same conditions 
to be satisfied in steady state, the equilib- 
rium is time-consistent when the generations 


D We define IIZ1(-) =1. 
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in the clan act sequentially. To proceed, take 
(9) for two adjacent periods, apply (10), and 
evaluate the resulting expression at k, — k*, 


c = c* and M, = M* for all t: 
g ĝu ap z+T 
eee epleue * 
a ðc ðc [Eso 
MM 
(= 113 "Mass 
ap 





14 f, 
an De 96 


PO ea) 





vot+2- 


| u(c*) 


It is easy to show that the discount rate that . 


characterizes equilibrium behavior cannot be 
less than p(c*). To see this, suppose to the 
contrary that ô< p(c*). Then as T — oo, 
(11) can only hold if f, — oo, which requires 
that Kk — 0. Because zero capital does not 
characterize fitness-maximizing behavior in 
very-long-run equilibrium, it must be that 
ô z p(c*) In this case when 7'— oo, (11) 
implies 


(12) 8 — f. ( M*, k*). 


` Equation (12) is based solely on the as- 
- sumption that one can represent the behav- 
ior of a clan with positive steady-state capi- 
tal as the outcome of maximizing (8). Under 
this representation, (12) says that one views 
the clan as accumulating capital to the point 
where capital's marginal product equals the 
clan's intergenerational discount rate. Con- 
sequently, if one observes a clan in a steady 
state with k = k* > 0 and M = M*, then the 
implied (or “indirectly observed") value of 
the discount rate. must be precisely the 
marginal product of capital evaluated at the 
equilibrium values of total population and 
per-capita capital. Because a very-long-run 
equilibrium is a steady state, (12) also char- 
acterizes the naturally selected discount rate 
in very-long-run equilibrium when clans are 
seen as maximizing (8). Combining (12) with 
the golden rule (4), which is the condition 
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for maximum fitness, we have the result that 
the naturally selected value of the discount 
rate equals the rate of population growth in 
vecy-lonesrun equilibrium: 


(13) = p(c*). 


That is, discounting by 6* just balances the 
increasing weights that population growth 
places on the felicities of successive future 
per-capita consumption levels, so U, is the 
simple sum u(c,)+--- t u(c,, p) to a dis- 
tant horizon 7. The intuition is that maxi- 
mum fitness requires maximum steady-state 
per-capita consumption, which in turn re- 
quires that no generation be weighted less 
than any other generation in the utility of 
the economically active generation.!^ Clearly, 
this reflects substantial concern about the ` 
consumption of future generations. 


C. Implications and Discussion 


First and foremost, the analysis predicts 
that agents sacrifice substantial own con- 
sumption to provide for consumption of 
progeny. The prediction is consistent with 
observed sacrifices in the form of bearing, 
support, upbringing, and education of chil- 
dren, other transfers from older to younger 
individuals, and bequests. Such agreement 
between prediction and observation suggests 
that the analysis captures some of.the forces 
that have shaped naturally selected prefer- 
ences. . 

More precisely and more usefully, the 
analysis predicts that the value of the pure 
rate of generation preference (5) can be as- 
certained directly as the rate of population 
growth over a long past. To illustrate, con- 
sider the European countries, which have 
relatively better historical demographic data 
than do the LDCs and which have had rela- 
tively less migration than developed former 
European colonies such as the United States. 
(The analysis above abstracts from migra- 


14With ô> p(c*), a suboptimality would arise as 
agents would attempt to shift resources to themselves 
from future generations by saving too little or by insti- 
tuting pay-as-you-go Social ERU, (Hansson and Stu- 
art, 1989). 
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tion, but one might as an approximation 
extrapolate values of ô from European coun- 
tries to the United States because many U.S. 
immigrants came from Europe.) Good de- 
mographic-data before about 1500 are scant 
for any country; a baseline might be Ansley 
Coale’s (1974) estimate that the world’s 
population grew at an estimated annual rate 
of 0.056 percent from the time of Christ to 
1750. Population growth rates have been 
greater, however, in Europe in recent cen- 
turies. For a sampling, the yearly growth 
rate in Germany from 1816 to 1910 based on 
census data was 1.02 percent (John Knodel, 
1974). German population growth was re- 
strained by net emigration, but during 
1871-1910, a period of relatively great emi- 
gration, "external migration reduced the 
population growth that would have accrued 
from natural increase alone by only 10 per- 
cent" (Knodel, p. 192). In England, yearly 
population growth occurred at a 0.35 percent 
rate between 1541 and 1751 and at a 1.11 
percent rate between 1751 and 1871; net 
emigration rates over the period were gener- 
ally.less than 0.2 percent/year (E. A. Wrigley 
and R. S. Schofield, 1981). Population growth 
rates in France and Sweden from the mid- 
1700s to 1871 were typically somewhat lower 
than in England (Wrigley and Schofield, pp. 
213-14). Thus the predicted value of ô in 
living Western populations might lie be- 
tween perhaps zero and a few percent per 
year, or between perhaps zero and 1.0 per 
generation. ; 

To derive further implications, one may 
think of the marginal product of capital in 
the clan's production function ( f.) as its real 
interest rate. Thus because (4) characterizes 
maximum fitness in any steady state given 
only that fitness increases in per-capita con- 
sumption and because (12) holds in any 
steady state in which behavior is represented 
as the outcome of maximizing (8), the analy- 
sis suggests that selection may act generally 
to drive real interest rates to equality with 
the rate of population growth.? Thus in 


'SThat is, equality of interest and population growth 
rates does not require the Malthusian condition (6) that 
population growth equal the growth in carrying capac- 
ity. 
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steady state, the analysis views the real inter- 
est rate as based on productivity and not on 
time preference, because selection causes 
time preference to conform to productivity. 
Still more fundamentally, real interest rates 
today may have been determined by the 
growth of carrying capacity in pre-industrial 
times, because population growth conforms 
to the growth in carrying capacity in very- 
long-run Malthusian equilibrium and be- 
cause this equilibrium is intended to charac- 
terize current preferences as the preferences 
selected over a relatively long past that in- 
cludes pre-industrial times. 

The analysis thus permits.a simple predic- 
tion about real interest rates in modern times. 
From (12), the real interest rate equals the 
discount rate 6 in equilibrium. We argued 
above that ô may lie between zero and a few 
percent per year. Today's real interest rate 
should therefore also lie between zero and a 
few percent. Modern data on real interest 
rates do not reject this prediction (see, for 
example, John Huizinga and Frederic 
Mishkin, 1983), but we emphasize that the 
current analysis neglects a large number of 
factors that could potentially influence the 
prediction. For instance, the analysis here 
abstracts from the effects of government pol- 
icy and international trade. 

The essence of the story of selection in 
this section is the hypothesis that among 
different saving behaviors (preferences), fit- 
ness is greatest for the saving behavior that 
leads to the greatest sustained level of per- 
capita consumption. That the predictions of 
the analysis conform roughly to observed 
data suggests that there is predictive value to 
the hypothesis. As a first analysis, however, 
our formalization abstracts from several 
forces of natural selection that may have 
influenced the preferences of currently living 
agents. One important concern is the puzzle 
that, contrary to Malthus's famous predic- 
tion, rates of population growth have persis- 
tently been lower than rates of growth in 
carrying capacity in industrialized econo- 
mies. Apparently, demographic transitions 
in Western countries have involved “ volun- 
tary” reductions in the number of children 
per household. It is interesting to speculate 
on how natural selection could lead to vol- 
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untary reductions in fertility at all, of course, 
because maximum relative population 
growth of a trait (fitness) would seem to 
require that carriers of the trait reproduce to 
the maximum sustainable extent allowed by 
available resources, S. Here i is a good purae for 
future research.!é 

A second set of concerns is that some 


might argue that humans do not exhibit care. 


for progeny to the extent that behavior. is 
best described as maximization of the undis- 
counted sum of per-capita felicities to an 
unbounded horizon; that is, the current 
analysis may predict too little effective dis- 
counting of future generations. Note, how- 
ever, that the current treatment abstracts 
from several real-world forces that might 
raise effective discounting of future genera- 
tions, such as free riding within clans, ran- 
dom hazards in the environment, and expen- 
diture of resources on war and defense. (In 
our analysis, interactions among clans are 
limited to negative “crowding” externalities 
in production.) 

A third concern is that we do not under- 
take-rigorous, global stability analysis. To do 
so carefully would involve the complication 
of specifying dynamics out of equilibrium 
when the population consists of coexisting 
. clans that grow at different rates. It is easy 
to show, however, that if all clans were to set 
the same level of capital, total population 
would eventually converge to some level, 
and if- all clans set k = k*, growth-deflated 
total population would converge to M*. Sim- 
ilarly, if M = M*, clans that maintain con- 
stant per-capita capital of k* would grow 
faster than clans that maintain any other 
levels of per-capita capital, so non-k* clans 
would be outcompeted asymptotically ‘by 
clans that set k = k*. 

A fourth concern is that there are many 
alternatives to (8), which specifies prefer- 


analysts who take preferences as given have ex- 
plained “voluntary” reductions in fertility as a response 
to greater costs of having children (Mincer, 1963; Robert 
Willis, 1973) or as related to increases in child quality 
(Becker and H. Gregg Lewis, 1973). There is doubtless 
truth in both explanations, but neither deals with the 
deeper issue of how voluntary reductions could be con- 
sistent with natural selection of preferences. 
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ences as separable and Barro-dynastic. Noth- 
ing assures that the specification (8) of the 
form of concern for future consumption is 
correct. Alternatively, we could have speci- 
fied generation z's optimization as one of 
maximizing U(c,,k,) with U = —oo if k,* a 
and U > — oo if k, =a. Malthusian selection 
of the parameter a would lead to a= k* in 
very-long-run equilibrium so generation z 
would always leave a bequest of k*. This 
alternative specification would predict the 
same tendency to save but would admit less 
flexibility of behavior than (8).!? 

A fifth coricern is that the. current analy- 
sis, like much of the work in evolutionary 
game theory, is “ecological.” In particular, 
we assume that there is no mutation, no 
intermarriage, that no biological constraints 
restrict a clan's discount rate, and that, be- 
fore very-long-run adjustments take place, 
there. are many clans and a nondegenerate . 
distribution of discount rates that includes 
6*. Perhaps the most important issue with 
the ecological approach here involves the 
assumptions that preferences are stable 
across generations and free of mutation. 
These assumptions may not describe selec- 
tion: of behavior by “cultural” processes in- 
volving innovation and imitation of other 
clans, which may be.important in industrial- 
ized economies.!? ; 


Thus the current analysis does not predict the 
flexibility or sensitivity of saving behavior to changes in 
economic conditions. To predict this sensitivity, it would 
be necessary to cheracterize naturally selected behavior 
in an environment in which economic conditions change. 
A related point is that even if population growth is 
assumed strictly concave in per-capita consumption, the 
selection process here does not restrict felicity to be 
strictly concave in c or “risk averse.” This is because we 
abstract from randcmness in the environment. With 
such randomness, selection may determine the curva- 
ture of felicity. 

18Such cultural selection may operate on time-frames 
of less than a single generation. Because we wish to 
study first the case. of stable preferences, which is the 
traditional assumption of choice among economists, 
and because elements of preferences selected by rapid 
cultural processes may be relatively less stable biologi- 
cally, we have abstracted from rapid cultural selection 
here. On the other hand, if agents imitate others who 
have greater per-capita consumption, then the results of 
the.current analysis could be strengthened in that con- 
vergence to very-long-run equilibrium could be faster. 
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Before extending and applying the analy- 
sis in the next section, we note that the 
results to this point illustrate a fundamental 
property of naturally selected preferences: in 
equilibrium, marginal rates of substitution 
between variables that affect fitness equal 
marginal rates of substitution between vari- 
ables in utility. A general demonstration of 
this property is in Section III. In the case of 
intergenerational saving, impacts on a clan’s 
fitness may be taken as impacts on the clan’s 
population in a future period. The popula- 
tion in future period x is 


x-1 
TT [1+ p(c,)] times the population 
U-—z 


in period z. 


For x » t 1, the ratio of the derivatives of 
population with respect to c, and c,,,, which 
is the marginal rate of substitution between 
c, and c,,, as a determinant of population 
in period x, equals one in equilibrium be- 
cause c, = c,,,. From (13), the marginal rate 
of substitution between c, and c,,, in the 
behavioral utility function, (89U,/0c,)/ 
(dU, /8c,, 4), also equals one in equilibrium. 


II. Preferences to Work and Save, and an 
Application to Development Economics 


A. Overview 


We now extend the selection model to 
include a consumption-leisure tradeoff and 
we illustrate the usefulness of the approach 
by applying it to study the effects of the 
environment on very-long-run equilibrium. 
The analysis bears on Toynbee's (1956) 
"challenge and response" hypothesis, which 
explains economic development as a re- 
sponse to environmental challenge. Toynbee 
lists a number of historical examples of eco- 
nomic development in response to challenges 
of “hard countries," “new ground,” “blows,” 
(external) “pressures,” and “penalizations,” 
which include phenomena such as discrimi- 
nation. The generality of Toynbee’s hypothe- 
sis is controversial but can be studied using 
tools developed here. We consider a specific 
case, namely, that measured (material) eco- 
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nomic development tends to rise and popu- 
lation density tends to fall with distance 
from the equator. Quite simply, the “chal- 
lenge" in this case may be that surviving 
harsh winters requires shelter, clothing, se- 
cure storage facilities, and other material 
commodities not needed in the tropics. We 
capture this type of environmental challenge 
or “harshness” as a shift parameter in the 
fitness function p(-) and show that in- 
creases in it may lead to lower population 
density and greater per-capita labor, capital, 
and consumption in very-long-run equilib- 
rium. Thus aspects of economic development 
máy be explained by biological selection of 
economic behavior. 


B. Extending the Analysis 


Nature is probably too economical to give 
us a strong taste for something that does not 
raise fitness. In thinking about man's taste 
for leisure, for instance, one might argue that 
leisure contributes to fitness by providing 
valuable regeneration of the body, defense, 
exercise, education, social coordination, and 
conservation of calories. We summarize these. 
benefits of leisure by including per-capita 
leisure (7) as an argument in the clan's popu- 
lation growth function with p,» 0. We also 
include an environmental shift parameter, f 
(for bad), such that in a neighborhood of an 
initial very-long-run equilibrium, increased 
B corresponds to a harsher environment and 
means that population growth is lower, pg « 

0. Because output net of capital formation 
can be “consumed” as shelter, clothing, and 
storage, we assume that greater 8 increases 
the marginal fitness of consumption relative 
to that of leisure. To capture this simply, we 
assume that p,, 7 0 and pig 0. We include 
per-capita labor, h,21-— l, as an additional 
argument in production and abstract from 
influences of the environment on production 
so per-capita output in a clan is f( M, kp h,). 
We maintain our earlier assumptions about 
f(-) and assume that an increase in any 
input, including the effective among of natu- 
ral environment, raises the marginal prod- 
ucts of the other two inputs. Because a rise 
in population increases crowding of the nat- 
ural environment, which reduces f, and f,, 
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we have fyg <0 and fu, « 0. To allow be- 
havior to adjust via selection to environmen- 
tal conditions, we include /, and f as argu- 
ments in u(-). Bs 

Once again we wish to characterize the 
behavior that maximizes fitness in steady 
state and hence is selected in nature in the 
very long run. To derive the conditions for 
maximum steady-state population growth, 
form the Lagrangian for optimization of 
P(e, 4, B) Sabe to the ia oiae condi- 
tion - 


(14) c= f(M,k, h) - kp(c,1,B), 


with total population and hence M taken as 
given. The first-order conditions for fitness- 
maximizing behavior simplify to 





(15) f= P(c,1,B), 

1 MF P 1 
16 = eee 
2o fat kpi dur. 


plus (14). The golden rule 2 (15) 3 is unchanged 
from the previous section. Condition (16) 
characterizes the fitness-maximizing mix of 
` consumption and leisure. In addition to 
(14)-(16), the growth rate of total popula- 
tion equals the growth rate of the carrying 
capacity of the environment in very-long-run 
equilibrium: 


07) p (o, B) =g. 


For given £, the four conditions (14)-(17) 
determine the very-long-run equilibrium val- 
ues k*, [*, c*, and M*. 

Generation z is now represented as maxi- 
mizing 


ZEF 


(18) U- 2 (ess B) 


c 1+ plenta BI 
«T [ERA 


paz 


As before, the equilibrium solution is consis- 
tent with maximization of (18) if and only if 


(19) Geass 
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Further, straightforward differentiation of 
(18) reveals that at c* and /*: : 





TE 9U/8c, - u+ pÉL/ Q8) 
9U/0l u+ pl, /(1+8)’ 
where | 
2 l HET 
Qa ues y 
væt+1.- 
GF 


In the limit when T — oo, we have Ü — 00, 
so the marginal rate of substitution between 
consumption and leisure in naturally se- 
lected preferences equals the marginal rate . 
of substitution in fitness,. p./p,. As before, 
the marginal rate of substitution between c, 
and c,,, in selected preferences equals the 
marginal rate of substitution between c, and 
C,41 8$ a determinant of population. 


C. Application:to Development Economics. 


To see how changes in the environment 
influence very-long-run behavior, we per- 
form comparative statics on the equilibrium 
conditions (14)-(17). For brevity, we report - 
the results given that p(-) is linear in c and 
l, or specifically that p(-) is of the form 

















e(B)c + e(B) + p(B): 
w ue 
+| Sunt ealz 


oo de. dM dh : 
(24) ap fag Pag? 
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where D= —[fufyy (fis) feg > 0. be- 
cause f(-) is concave. Equation (21) shows 
that an increase in environmental harshness 
as it is defined here reduces equilibrium pop- 
ulation density. From equation (22), labor 
rises with environmental harshness because 
the relative value of consumption rises ( p,s 
> 0) and because the reduction in popula- 
tion density raises the marginal product of 
labor both directly (second term) and indi- 
rectly via increased. capital (third term). 
Equation (23) verifies that capital also rises 
as a result of greater marginal productivity. 
Finally, equation (24) shows that per-capita 
consumption increases because population 
declines and per-capita labor rises. (The in- 
crease in per-capita capital has no impact on 
per-capita consumption at golden-rule sav- 


ing.) 


Remark 1: In broad terms, the method here 
allows classification of how the properties of 
the environment affect naturally selected 
preferences and thereby influence directly 
measurable magnitudes such as population 
density, capital, leisure, and material con- 
sumption. The calculations we report are 
an abbreviated illustration of the general 
method, however, as we impose specific re- 
strictions on p(*) and f(-). That is, we 
assume that p(-) is linear in c and /, that 
per-capita output, f(-), does not depend di- 
rectly on the environment (8), and that sec- 
ond-order cross derivatives among factors, 
including the effective amount of natural 
environment, are positive. Relaxing these as- 
sumptions would expand the number of ef- 
fects encompassed by the analysis and make 
the signs of (21)—-(24) ambiguous. If a harsher 
environment were to decrease output for 
given- inputs, for instance, the tendencies in 
equations (21)-(24) might be strengthened. 
On the other hand, one can readily verify 
that if a harsher environment reduces the 
marginal product of capital greatly but 
changes the average product only slightly, 
capital may decline with environmental 
harshness. Thus generally, the validity of 
Toynbee's challenge and response hypothesis 
depends on how an increase in environmen- 
tal challenge is assumed to influence fitness 
and production. 


HANSSON AND STUART: SELECTION OF PREFERENCES 541 


Remark 2: An interpretation of the analysis 
in this section is that selection in an agricul- 
tural environment with harsh winters may 
lead to selection of preferences that sustain 
an equilibrium with low population: density 
and high per-capita capital, labor, and con- 
sumption. As suggestive examples, selected 
behavior (preferences) in such an environ- 
ment might reflect strong ethics to work, to 
accumulate equipment and structures, to 
avoid depleting land, and to devote re- 
sources to upbringing and education of chil- 
dren.??* [f preferences that support high 
labor and capital are otherwise difficult to 
generate and if such preferences aid industri- 
alization, then the analysis helps explain why 
industrialized countries are overrepresented 
in regions with harsh winters.) An addi- 


19Cross-section evidence on OECD economies 
(Hansson and Stuart, 1988) suggests that the taste for 
labor is indeed greater in Northern Europe than in 
Southern Europe. . 

In this spirit, one might expect greater educational 
expenditures in more northerly (harsher) areas. Rough 
data in UNESCO (1986, Table 4.1) indicate generally 
greater public expenditures on education as a percent- 
age of GNP in Northern than Southern Europe. In 
1975, for instance, these percentages were 7.8 in Den- 
mark, 6.5 in Finland, 7.1 in Norway, and 7.1 in Sweden, 
but only 2.0 in Greece, 4.5 in Italy, 4.0 in Portugal, and 
2.1 (in 1976) in Spain. (Because GNP per-capita is 
generally greater in Northern than in Southern Europe, 
the differences in educational expenditures as a share of 
GNP reported here understate differences in per-capita 
educational expenditures. Data on direct general ex- 
penditures per capita by state and local governments in 
the United States for 1983-1984 (U.S. Department of 
Education, 1987, Table 25) provide a similar but per- 
haps weaker pattern. Counting as northern states those 
states that bound Canada or the Great Lakes (except 
for Indiana and Illinois, which bound southern Lake 
Michigan) and counting as southern states those states 
that bound Mexico or the Caribbean plus Hawaii, one 
calculates average per-capita educational expenditures 
across northern states of $901 and average expenditures 
across southern states of $728. One should probably 
interpret the latter numbers with caution, however, in 
part because it is unclear how long selection has been 
operating across the U.S. states. 

? Two thousand years ago, the Mediterranean coun- 
tries were the most developed in the Western hemi- 
sphere and Northern Europe was undeveloped. To make 
this consistent with the theory here, one would allow 
regional differences in production. The current treat- 
ment abstracts from such differences in that we assume 
that 8 does not directly influence per-capita output. 
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tional part of the picture, which is not incor- 
porated into our model, is that greater per- 
capita capital (including human capital) may 
feed back positively on the rate of technolog- 
ical growth (Robert Lucas, 1985; Paul 
Romer, 1986). With such a feedback, envi- 
ronmental differences that lead to differ- 
“ences in tastes for work and saving might 
ultimately explain persistent pena differ- 
ences in growth rates. 


Remark 3: George Stigler and Becker (1977) 
argue that it is methodologically productive 
to view tastes as not differing “importantly 
between people" (p. 76). In our analysis, 
preferences selected in a given environment 
are homogeneous, so our very-long-run equi- 
librium has the property desired by Stigler 
and Becker. Preferences in small countries 
with little recent immigration may thus be 
quite homogeneous. Preferences in “ melting- 
pot" countries such as the United States may 
be less homogeneous. 


Remark 4: In this application, preferences 
for sacrificing own consumption to benefit 
offspring and preferences for the mix of own 
consumption and leisure are determined by 
natural selection. However, the idea that 
natural selection determines behavior. repre- 
sented by economists as the outcome of con- 
strained optimization presumably has more 
general applicability. For instance, nothing 
restricts the theory here of sacrifice for 
progeny and work effort to Homo sapiens. 
To the extent that other species have shorter 
lives and ages of maturity, of course, selec- 
tion of their economic behavior may be more 
rapid. , 


III. A General Statement 


“A general statement of the principle un- 


derlying this paper is that in very-long-run . 


equilibrium, utility mirrors fitness in the 
sense that marginal rates of substitution in 
utility equal analogously defined marginal 
rates of substitution in fitness. To show this 
compactly, consider traits or preferences that 
determine the value of a vector x of behav- 
ioral variables that in turn determine fitness, 
p(x). An element x, of the vector can be 


4 
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interpreted as a commodity indexed by time 
and state of the world. (Note that x is of 
finite dimension here.) The individual or in- 
dividual clan faces an individual resource 
constraint of the form G(x, M) — Q, where 
M is total population deflated by the rate of 
growth of carrying capacity. An aggregate 
resource constraint H(x, M) —-0 captures 
limits to the carrying capacity of the envi- 
ronment. In the absence of biological con- 
straints on feasible traits, behavior in very- 
long-run equilibrium is characterized by 
maximization of p(x) subject to G(x, M) = 
0 when M is àeld constant at a value that 
satisfies H(x, M) = 0. Economists then rep- 
resent behavior as the choice of x that maxi- 
mizes U(x) subject to.G(x, M) = 0. Assum- 
ing differentiability and comparing the two 
m: of first-order conditions, it must be that 
U, /U, = p, / Py,» OT that in very-long-run 
equilibrium, ‘marginal rates of substitution in 
the naturally selected behavioral utility func- 
tion equal marginal rates of substitution in 
fitness written as a function of behavioral 
variables. Note that this result does not rely 
on an assumption that-agents actually maxi- 
mize utility. One may conveniently describe 
naturally selected behavior in very-long-run 
equilibrium as the outcome of utility maxi- 
mization even if individuals do not actually 
maximize utility. 


IY. Conclusion 


We use natural selection to explain behav- 
ior, or equivalently, preferences. We focus 
specifically on irtergenerational saving and 
on consumption-ieisure choice, using a neo- 
classical golden-age model in which popula- 
tion is endogenized through classical 
Malthusian assumptions about the economic 
determinants of population size. The model 
depicts a world in which different "clans" 
have different preferences and hence differ- 
ent relative rates of population growth or 
fitness. In the very-long-run, the economy 
rests in an equilibrium in which (1) 100 per- 
cent of the population have preferences with 
maximum fitness ziven resource constraints 
and (ii) total population equals the carrying 
capacity of the environment. A generic char- 
acteristic of equilibrium is that marginal rates 


VOL. 80 NO. 3 


of substitution in utility equal marginal rates 


of substitution in fitness written as a func- - 


tion of behavioral variables. We show that 
selection implies a tendency toward golden- 
rule saving, which means that agents act as if 
they maximize the undiscounted sum of 
per-capita felicities of current and future 
generations. Selection also predicts that pop- 
ulation density, work, saving, and consump- 
tion should vary across regions in response 
to regional differences in the natural envi- 
ronment. 
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Fertility and the Personal Exemption: Implicit 
Pronatalist Policy in the United States 


By LESLIE A. WHITTINGTON, JAMES ALM, AND H. ELIZABETH PETERS* 


The effect of the income tax system has 
been largely ignored in existing empirical 
work on fertility.! This neglect is surprising. 
It has often been demonstrated that tax im- 
pacts on behavior can be quite substantial, 
even in areas in which economic costs and 
benefits are not commonly thought to play a 
dominant role.? Moreover, the governments 
of many countries act as if they believe that 
they can affect the birthrate through tax 
incentives. A concern about population de- 
cline has induced some governments to adopt 
explicitly pronatalist policies (Michael S. 
Teitelbaum and Jay M. Winter, 1985). France 
and West Germany have extensive systems 
of family allowances. East Germany and 
Hungary have implemented policies that in- 
clude one-time birth payments and paid ma- 
ternity leave (Henk J. Heeren, 1982, Jerome 
S. Legge, Jr., and John R. Alford, 1986). 
Canada and Singapore also attempt to in- 
fluence fertility rates through tax policies 
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lege, and the University of Michigan-Dearborn, and 
three referees for helpful comments. 

A few economists have examined this issue. Peter H. 
Lindert (1978) estimates child costs over time and in- 
cludes the tax benefits as an input to those costs. T. 
Paul Schultz (1981) speculates that the tax system might 
affect the demand for children. More recently Thomas 
Espenshade and Joseph J. Minarik (1987) estimate the 
impact on fertility of the 1986 reform act. In their 
analysis, however, they ignore the potential effect of a 
change in the cost of a child and focus, instead, on 
income effects. | 

This literature is enormous. A useful starting point 
is Henry J. Aaron and Joseph Pechman (1981). 


(Malcolm Gillis, Dwight H. Perkins, Michael 
Roemer, and Donald R. Snodgrass, 1983). 
At the other extreme, China has received 
international attention for the extreme anti- 
natalist economic incentives that it has insti- 
tuted. A few studies report that these poli- 
cies have some impact on fertility rates, but 
the magnitude of that impact is still an issue 
(Kingley Davis, Mikhail S. Bernstam, and 
Rita Ricardo-Campbell, 1986). 

The United States has not implemented 
such explicit policies as family allowances or 


paid maternity leaves. However, the federal 


income tax has a feature that may implicitly 
affect the decision to have a child: the per- 
sonal exemption for dependents. As noted 
by Joseph Pechman (1983), the personal ex- 
emption is justified not as a policy for influ- 
encing the fertility decision, but as relief for 
low-income hoüseholds and families of the 
burden of taxation; its amount is roughly 
based on. the income needed to maintain an 
adequate diet. Nevertheless, the personal ex- 
emption also represents a clear subsidy for 
each child, a subsidy whose value depends 
upon the marginal tax rate of the family. 
The effect of this tax feature on the aggre- 
gate fertility rate in the United States has 
not been previously examined. 

In this paper, we estimate an aggregate 
fertility equation for the United States from 
1913 to 1984. Fertility is modeled as a func- 
tion of various economic and demographic 
factors, including the tax value of the per- 
sonal exemption. The. primary result is that 
the personal exemption has a positive and 
significant effect on the national birthrate, 
and this result is robust to a variety of 
specifications. Although the elasticity of the 
birthrate with respect to the exemption is 
not large, it appears that the United States 
— and perhaps other countries as well —can 
influence to a degree the fertility decisions of 


` its citizens through deliberate changes in tax 
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policies. 
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TABLE 1 — CHILD COSTS AND THE PERSONAL EXEMPTION FOR DEPENDENTS 


r 


Annual Child Costs Tax Value of 
Medium Income : 


Study Group? 
Turchi (1983) 

(1981 Dollars). 

Avg Male Child . . $3470.76 
Espenshade (1984) _ 

(1981 Dollars) ' i 

One Child - 5900.04 

2nd Child Addition 3255.48 
: 3rd Child Addition 2311.20 

. Olson (1983) 

(1982 Dollars) 

One Male Child 7293.12 

2nd Child Addition ` 5015.16 - 
BLS (1982) 

(1981 Dollars) 

One Child _ 3405.00 
_ 2nd Child Addition . : 2938.44 

3rd Child Addition 2958.84 
USDA (1982) 

(1981 Dollars) 

Avg Urban Child 4466.64 


Personal 
Exemption 
As a Percent of 
Personal | Annual Expense 
Exemptior? in Percent 
$325.C0 9.36 © 
325.00 $51 
32500 —. . 9.98 
325.00 : 14.06 
295.00 4.04 
295.00 5.88 
325.00 . 9.54 
325.00 11.06 
325.00 10.98 
l 325.00 -7.28 . 


,Child costs derived from Williams (1987), 11-145. (Monthly ex»enses X12 months) 
>In 1981 the value of the personal exemption is 1,000 dollars per dependent and the 
average marginal tax rate is 32.5 percent; in 1982 the personal exemption is 1,000 


dollars per dependent and the average marginal tax rate is 29.5 percent. 


Section I examines the impact of the per- 
sonal exemption on the costs of a child. 
Section II presents the empirical model of 
fertility, and. discusses the data and variables 
that are used. Section III presents the empir- 
ical results. Conclusions and policy implica- 
tions are summarized in Section IV. + 


I. Child Costs and the Personal Exemption 


The standard economic model of fertility 
posits that children provide their parents 
with positive utility in either consumption or 
production. The cost of a child depends üpon 
the cost of time inputs and the cost of the 
goods and service inputs used in child rear- 
ing? The cost of a child also depends upon 


. See Gary Becker (1960) and Schultz (1973) for a 
theoretical discussion. For examples of empirical studies 


linking fertility and economic variables, see William P. . 


Butz and Michael P. Ward (1979), Marc Nerlove and T. 
Paul Schultz (1970), Michael Wachter (1975). `- 


the annual tax savings generated by that 
child, equal to the value of the personal 
exemption multiplied by the marginal tax 
rate of the parent claiming the child as a 
dependent. This annual subsidy rises with 
the marginal tax rate and with the size of the 
personal exemption. . 

The value of the personal exemption is not 
inconsequential relative to the annual cost of 
raising a child. Robert G. Williams (1987) 
has compiled data on child costs from a 
number of economic studies. As shown in 
Table 1, these estimates vary considerably. 
However, it is obvious that in all cases the 
tax subsidy represents a large portion of the 
annual monetary expenses of child rearing. 
Using a medium socioeconomic status as a 
measure, the tax value of the personal ex- 
emption represents from 4 to 9 percent of 
the annual estimated monetary cost of rais- 
ing one child. For subsequent children, the 
tax value of the personal exemption in- 
creases as a portion of the annual child 
costs, ranging from almost 6 to 14 percent. 
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It is also important to recognize that the 
personal exemption is an ongoing support 
item. The parents of a child receive this tax 
subsidy for every year that the child is filed 
as a dependent. In most cases this will be a 
minimum of eighteen years. Thus the per- 
sonal exemption is a stream of subsidies to 
birth, not a one-shot payment. At a discount 
rate of 10 percent, the present value of an 
annual stream: of payments of $325—the 
average tax value of the personal exemption 
in 1981—is nearly $3000; at a discount rate 


of 5 percent the present value rises to almost - 


$4000. 
The personal exemption assumes a much 


different role than other economic subsidies . 


to children that have been examined. For 
example, the relationship between birthrates 
and various income maintenance programs 
like Aid to Families with Dependent Chil- 
dren (AFDC) has been explored, and the 
results generally fail to prove that higher 
welfare payments lead to higher fertility.* 
AFDC, however, is a system of payments 
extended only to those with economic need, 
usually only to single parent families, and is 
not necessarily received over the entire de- 
pendency period of the child. 

It is clear that the personal exemption 
encourages fertility by decreasing the rela- 
tive cost of children. Of course, whether and 
how much individuals in fact respond to this 
incentive is an empirical issue. 


ax The Empirical Framework 


The previous section argued that the cost 
of children affects the fertility decision, and 
that these costs depend in part upon the tax 
value of the.personal exemption. However, 
there are clearly other factors that influence 
the birth decision. This section discusses 
those other factors and presents the basic 
model of fertility. 

We estimate the following reduced form 
equation for the period 1913, the year in 
which the modern federal income tax came 


4See Glen G. Cain (1977) and David T. Ellwood and 
Mary Jo Bane (1985). 
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into existence, to 1984: 


(1) General Fertility Rate 
= fl, + B; Personal Exemption 


- + Income 
+ B,Unemployment 
+ B,Infant Mortality 
+ B,Immigration + B, Female Wage 
+ B,Birth Control + BjWorld War II 
+ B,Time Trend. 


Table 2 gives the variables, definitions, and ` 
means. 

The dependent variable, the general fertil- 
ity rate, is the birthrate per thousand women 
between the ages of 15 and 44. This is the 
group commonly considered to be at risk of 
pregnancy. The general fertility rate is less 
sensitive to changes in the age and sex struc- 
ture of the population than the crude 
birthrate. The fertility rate series is reported 
in Appendix 1. 

Due to biological constraints, the birth of 
a child will lag the decision to have a child. 
For this reason we estimate the fertility 
equation in lagged form. Several different lag 
structures are used to test the sensitivity of 
our results to assumptions about the timing 
of the fertility decision process. The correct 
lag structure is difficult to identify. One ap- 
pealing structure is an inverted V pattern 
with weights initially increasing and then 
decreasing. The rationale for this form comes 
from the biological average of 24 to 31 
months required to produce a birth (T. Paul 
Schultz, 1981). Under this structure, the 
variable is lagged four periods from the cur- 
rent period, and the peak comes in the £ —2 
period. The constructed variable W, be- 
comes: 


(2) W,- Wi X, w,X,_1 + WO X, 


TW X, 4 WX, 4, 
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TABLE 2— VARIABLE DEFINITIONS AND MEANS 


Variable Name 
General Fertility Rate 
Personal Exemption 


Male & Asset Income 


‘Unemployment 
Infant Mortality 
Immigration 


Female Wage 


Pill 
WWII . 
Time Trend 


Female Education 


Definition 


Births per 1000 women 
aged 15 to 44 

Real tax value of ` 
personal exemption 

Real after-tax personal 
income per family 
net of female earnings 

Unemployment rate of 
civilian labor force. 

Infant mortality per 
1000 live births 

Immigration of at-risk 
group as a percent 
of resident at-risk group 

Average after-tax 
female wage 

Dummy variable equal 

_to one in years 

1963 to 1984 " 

Dummy variable equal 
to one in years U.S. 


- was in World War II 


Time trend equal to one 
in 1913 and increasing 
by one each year 

Female high school 
graduates each year 
as a percent of female . 
population 


Mean 


95.50 


100.40 


1466.37 


0071 


43.02 


2.009 


Standard 
Deviation 


19.64 
65.88 


2982.78 
0.053 


26.84 


0.0035 


0.532 


0.464 
0.256 


20.92 


0.004 
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where w,<w,<w, and w> w> ws. The 
lag structure thus declines around the mean 
lag of two years, which is the average time 
required to produce a birth. A two-year, 
rectangular lag structure is also examined: 


(3) W,—wX,tw,X, 4t w4X, 5, 


where w, = w, = w,. Several other.lag struc- 
tures have been estimated with no significant 
impact on the results. 

The independent variable of primary in- 
terest is the real tax value of the personal 
exemption, equal to the personal exemption 
multiplied by the average marginal tax rate. 
The federal income tax began in 1913, and 
the personal exemption became a feature of 
the federal tax system in 1917.. We deflate 
values of the personal exemption using the 
Consumer Price Index. We use average 
marginal tax rates estimated by Robert. J. 


Barro and Chaipat Sahasakul (1983, 1986).° 
Although Congress has changed the person- 
al exemption only nine times between 1913 
and 1984, the real tax value of the exemp- 
tion exhibits substantial fluctuation due to 
changes in the average marginal tax rate and 
in the price level. See Figure 1 and Appendix 
T for this series. 

Additional independent abe include 
both those that affect birthrates by changing 


* *Barro and Sahasakul (1983, 1986) calculate the av- 
erage marginal tax rate from 1916 to 1983. Their 
methodology is employed to calculate the rate for 1984. 
Though the empirical work reported here looks at val- 
ues 1913 to 1984, there was no personal exemption prior 
to 1917. Thus no values of PE are missing. Barro and 
Sahasakul report average marginal tax rate series 


weighted by adjusted gross income and by number of 


returns filed; both approaches are calculated arithmeti- 
cally and geometrically. All four series were tried in 
estimating the model with no substantial difference in 
the results. 
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Exemption 


Personal 


1910 1920 1930 1940 
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1950 1960 1970 1980 1990 


Year 


FIGURE 1. REAL TAX VALUE OF PERSONAL EXEMPTION 


the demand for children and biological fac- 
tors that influence the supply of births. In- 
come is measured as the sum of male earn- 
ings and non-wage family property income.$ 
Because women in general still commit more 
time to the care of children than do men 
(Arleen Leibowitz, 1975), women's wages are 
a reasonable proxy for the time cost of chil- 
dren. Consequently, if mother's earnings are 
included in family income, the estimated co- 
efficient on that variable confounds the esti- 
mates of the income effect with those of a 
price effect. To estimate a pure income ef- 
fect, we therefore consider family income net 
of mother's earnings. If children are a nor- 
mal good, we expect the sign on income to 
be positive. However, Gary Becker and H. 
Gregg Lewis (1973) argue that high income 
families may invest more in the quality of 
each child, so that income may have a nega- 
tive effect on quantity. 
` Measuring the cost of time inputs presents 
major difficulties. In fact, the lack of an 
adequate measure of the value of women's 
time may be partially responsible for the 
relatively small number of time-series studies 
on fertility. Women's wages were not regu- 


Male earnings are from the Statistical Abstract, 
various years, and Current Population Reports, Series 
P-60. Average property income is derived from the 
national income accounts. 


larly reported in the first half of the century, 
so there is no continuous wage series avail- 
able for statistical work. In addition, ás much 
of the cross-section work on fertility during 
the last decade has noted, the observed mar- 
ket wage is a biased measure of the value of 
time for women who do not work in the 
market." This paper constructs a wage series 
from various sources. June O'Neill (1985) 
reports the ratio of female to male wages 
from 1955 to 1982. Using these ratios and 
data on production wages, we construct a 
female wage series. We then multiply wages 
by the average marginal tax rate to obtain 
the after-tax female wage. See Appendix 2 
for details about the construction of this 
series. Though the series represents only one 
sector, wage trends across sectors c over time 
are highly correlated. 

The infant mortality rate is included to 
capture two possible effects. The death of a 
child could cause an increase in the birthrate 
if families are concerned about completed 
family size; this is commonly called the re- 
placement effect: However, infant mortality 
increases the cost of producing a surviving 
child. If the cost effect dominates, then an 
increasing infant mortality rate could lower 
the birthrate. 


"See, for example, J ames Heckman (1974). 
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Unemployment may also have an ambigu- 
ous effect on fertility rates. Unemployment 
results in lower transitory income, so that 
high rates of unemployed workers would 
lower birthrates if children are a normal 
good. However, unemployment is also likely 
to lower the opportunity cost of spending 
time in the production of children. The time 
effect would result in a higher birthrate. Be- 
cause of its transitory nature, we predict that 
unemployment will primarily affect the tim- 
ing of births rather than the number of 
ever-born children. 

Birthrates also differ across cultures. 
Changing immigration rates in the United 
States may therefore account for a portion 
of the change in U.S. fertility over time. 

A dummy variable equal to one during 
World War II (1941-45) accounts for the 
absence of young men during the war years, 
and a dummy variable equal to one for the 
years that the birth control pill has been 
‘widely available (1963-1984) is also in- 
cluded. Finally, we use several specifications 
of a time trend to capture any unobserved 
socioeconomic factors that might affect fer- 
tility and that have changed over time. 


Ii. Estimation Results 


Table 3 reports estimation results. Gener- 
alized least squares estimation is performed 
with a Yule-Walker first-order autocorrela- 
tion correction scheme. Six models are pre- 
sented, including those with differing lag 
structures. Figure 2 compares the actual fer- 
tility rate with the rate predicted from model 
4. In general, the fit is good. However, actual 

, rates are slightly higher than predicted dur- 
ing the peak baby boom years and somewhat 
lower than predicted during the years of the 
Great Depression. 

Of primary concern here is the impact of 
the real tax value of the personal exemption. 
The personal exemption has a positive and 
significant impact on birthrates in all speci- 
fications in Table 3. The results are robust to 
the specifications of different lag structures, 
models 1—4, to the exclusion of a time trend, 
model 5, and to the substitution of female 
education for female wages, model 6. The 


coefficients range in value from 0.121 to - 
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0.236, and the estimated elasticities of fertil- 
ity with respect to the personal exemption 
range from 0.127 to 0.248. These estimated 
elasticities are consistent with the wage elas- 
ticities of fertility estimated in our study and 
smaller than those reported in other time- 
series studies.) The largest values occur with 
a five-period inverted V lag structure. These 
results suggest that an increase in the tax 
value of the personal exemption of 50 dol- 
lars will increase the general fertility rate by 
6 to 12 births per 1000 women at risk. 

Perhaps surprisingly, the coefficient on real 
personal income per household is often neg- 
ative, although not significantly different 
from zero. The negative sign on income is a 
fairly common result in cross-sectional tests.? 
With a few exceptions such as Marc Nerlove 
and T. Paul Schultz (1970), time-series stud- 
ies have generally found a positive impact.!? 

Other variables generally have predicted 
signs. The coefficients on the infant mortal- 
ity rate are positive, indicating that in the 
United States the replacement effect domi- 
nates the cost effect, but the coefficient is 
rarely statistically significant. The positive 
coefficient is consistent with the estimates by 
Schultz (1974) in Taiwan, Nerlove and 
Schultz (1970) in Puerto Rico, and Michael 
P. Shields and Ronald L. Tracy (1986) in the 
United States, Unemployment has a nega- 
tive effect on childbirth, which suggests that 
the temporary income effect dominates the 
value of time effect of unemployment. 

The performance of the measure of female 


wages is generally disappointing; the sign is 


usually negative, but the coefficient is often 
insignificantly different from zero. It might 
be argued that female wages and fertility are 
simultaneously determined, so that inclusion 
of a female wage creates endogeneity prob- 
lems. We tested for endogeneity using the 
Jerry A. Hausman (1978) specification test. 
This test consisted of regressing the sus- 


Sour wage elasticities range from 0.03 to 0.18. Butz 
and Ward (1979) report a wage elasticity of 0.751 -to 
1.846. 

See also Julian Simon, 1969. 

See, for example, Wachter (1975), Butz and Ward 
(1979), Shields and Tracy (1986). 
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TABLE 3—IMPACT OF THE PERSONAL EXEMPTION ON FERTILITY IN THE UNITED STATES, 1913-1984 


Independent Variable 1 2 
Personal Exemption 0.121** 0.191** 
(2.714) (4.040) 
Male & Asset Income — 0.0004 0.0004 
(0.147) (0.141) ` 
Unemployment —73.430** —36.862 
(2.147) (1.108) 
Infant Mortality 0.083 0.303 
(0.325) (1.263) 
Immigration T1A24** 1529.20** 
(2.487) . (3.183) 
Female Wage 5.647 —2.157 
(0.360) (0.153) 
Pill —10.856* — 8.958 
(1.772) (1.622) 
WW II —17223**  —10.49** 
(3.452) (2.596) 
Time Trend —0.539 — 0.389 
(1.002) (0.785) 
Female Education - - 
Intercept 102.979** 79.961** 
(4.175) (3.373) 
R? 0.916 0.931 
Elasticity of 
Fertility with 
Respect to Exemption 0.127 0.201 


Absolute value of t-statistic in parentheses. 
Model 1: No lags on independent variables. 
Model 2: 
Model 3: 
‘Model 4: . 
Model 5: 
trend. 
Model 6: 


Model 
3 4 5 6 
0.230** 0.157** 0.236** 0211** 
(4.805) (3.959) (5.018) (4.56) 
— 0.0004 —0.0019 — 0.0005 0.001 
(0.174) . (0.760) (0.258) (0.644) 
—31.455 — 30.328 —34.639 . —16.828 
(0.998) (1.000) (1.112) (0.526) 
0.310 0.296 0.439** 0.180 
(1.284) (1.300) (2.470) (0.750) 
1372.87** 319.55 1433.43** 1181.48** 
(3.119) (1.244) (3.324) (2.723) 
— 8.712 — 11.261 — 13.804 - 
(0.661) (0.867) (1.200) 
— 6.922 ` — 5.561 — 7.854 — 5.952 
(1.364) (1.067) (1.599) (1.197) 
— 5.353 0.016 — 4,997 — 4.171 
(1.356) (0.004) (1.279) (1.240) 
— 0.377 0.025 - —0.471 
(0.803) (0.051) (1171) 
- - c - — 2196.46* 
(1.781) 
81.628** 93.978** + 14.488** 93.270** 
(3.531) (4.114) (3.519) (3.975) . 
0.941 0.943 0.940 0.944 
0.242 0.165 0.248 0.221 


Three-year rectangular lag on personal exemption; no lags on other independent variables. 
Five-year inverted V on personal exemption; no lags on other independent variables. 
Two-year lag on all indeperdent variables except pill, World War II and time trend. 
Five-year inverted V on personal exemption; no lags on other independent variables; no time 


Five-year inverted V on personal exemption; no lags on other independent variables, female 


education used as a proxy for female wage. 


**: Significant at the 5 percent level. 
*: Significant at the 10 percent level. 


pected endogenous variable-female wages-on 
all exogenous variables in the fertility regres- 
sion plus the variables education, race, and 
urban population. The actual value of the 
wage and the residual from this regression 
were then included in a fertility regression. 
Endogeneity is not a significant problem if 
the /-statistic on the residual is insignificant. 
Identification was achieved by including 
variables—education, race, and urban popu- 
lation—in the wage equation that are ex- 
cluded from the fertility equation. The t-sta- 
tistic on the wage residual was 0.555, which 
indicates that endogeneity is not a severe 


problem. As an additional test of the sensi- 
tivity of our results to the possibility of 
endogenous wages, Table 3 reports a re- 
duced form fertility regression that uses fe- 
male education as a proxy for female wages 
(model 6). 

We examined a variety of other specifica- 
tions, and the results are generally consistent 
with those presented in Table 3.1! The re- 
sults are robust to the inclusion of additional 
variables such as the racial distribution of 


"These results are available upon request. 
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FIGURE 2. GENERAL FERTILITY RATE— ACTUAL AND PREDICTED 


the population and the percent of urban 
population. These variables, however, did not 
contribute significantly to the explanatory 
power of the equations, and they were 
dropped from the analysis. The results are 
also robust to various lag structures for the 
independent variables and to the specifica- 
tion of various time trends, (for example, no 
time trend, a linear time trend, and a 
quadratic time trend). We also tested for 
structural differences in the estimates across 
different subperiods. Regressions done sepa- 
rately for periods 1913-48 and 1949-84 both 
yielded positive and significant coefficients 
on the tax value of the personal exemption. 
A. Chow test on this split sample indicated 
that there was no structural difference in the 
full regression and the sub-regressions. 12 Be- 
cause the largest variations in the tax value 
of the personal exemption occurred during.a 
short period of time, 1940—49, the subperiod 
results are somewhat sensitive to the particu- 
lar years included in each subperiod. On the 
whole, however, these results support the 
hypothesis that an increase in the tax value 
of the personal exemption leads to an in- 
crease in the demand for children. 


12The Chow test yielded on F-statistic of 0.420 with 
8 and 56 degrees of freedom, which is less that the 
critical value of 1.69. 


IV. Implications for Public Policy ` 


Our empirical results indicate that tax pol- 
icy, at least in the form of the: personal 
exemption, has an impact on aggregate fam- 
ily birth decisions. The policy ramifications 
for this particular tax feature are potentially 
important. Consider, for example, what our 
estimates imply about the impact of the Tax 
Reform Act (TRA) of 1986 on the birthrate. 
Thé TRA changes the tax value of the per- 
sonal exemption in several. ways. The statu- 
tory value of the personal exemption in- 


` creases from $1,080 in 1986 to` $2,000 in 


1989 when fully phased in; thus the nominal 
value of the personal exemption virtually 
doubles. However, this increase is dampened 
because marginal tax rates are reduced for 
many taxpayers. The overall impact will 
depend on whether the increasing nominal 
value of the personal exemption or.the falling 
marginal. tax rate dominates. Further, the 
personal exemption is fully applicable only 
to taxpayers with taxable incomes of 
amounts less than $149,250; the birth sub- 
sidy is not extended to the highest income 
households. Finally, a large number of 


i ferry. A. Hausman and James Poterba (1987) calcu- 
late that 47.7 percent of taxpayers will have a marginal 
tax rate decrease of 0 to 10 percent, and 11.3 percent 
will have a decrease greater than 10 percent.’ 
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households will no longer be within the tax 
system under the new tax law. The zero 
bracket amount is increased substantially; 
thus the lowest income groups will be out- 
side of the tax system. 

Consider a married couple filing jointly 
with income of $16,000. Prior to TRA, this 
couple received a personal exemption per 
dependent of $1,080 and faced a marginal 
tax rate of 16 percent; the tax value of the 
personal exemption in 1986 dollars was 
therefore $173. In 1989, earning the same 
income, this couple now would receive a 
personal exemption per dependent of $2,000 
and face a marginal tax rate of 15 percent. 
Assuming a 4 percent rate of inflation, the 
tax value of their personal exemption in 
1986 dollars would now be $267, an increase 
of almost 55 percent. A family who faced a 
marginal tax rate of 49 percent on taxable 
income of $109,400 prior to TRA would 
experience a much smaller change in the tax 
value of the personal exemption. Formerly, 
this subsidy had a value to them of $529. 
Under TRA this subsidy is worth $533, an 
increase of less than 1 percent. The average 
increase in the tax value of the personal 
exemption is fifteen percent. 

Overall, the estimates in this paper indi- 
cate that the TRA changes in the personal 
exemption may generate a significant in- 
crease in the birthrate. Using the most con- 
servative coefficient estimate of the impact 
of the personal exemption on birthrates 
(0.121), the increased personal exemption 
when fully phased in will cause an increase 
of 7.53 births per thousand women at risk. 
This represents an 11 percent rise in the 
current birthrate. Middle income families will 
receive the largest birth incentive, while low 
and high income groups will experience a 
disincentive to births. Although the elasticity 
of births with respect to the personal exemp- 
tion is small, statutory changes in the value 
of this subsidy have usually been relatively 
large. In particular, the changes due to TRA 
are longer than the changes during all but a 
few other years in our time-series. Thus our 
estimated fertility response is substantial. 

Of course, there are other aspects of TRA 
that might mitigate the fertility effect of an 
increase in the personal exemption. For ex- 
ample, the lower marginal tax.rates insti- 
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tuted by the law will increase the opportu- 
nity cost of spending time with children and 
may lower the demand for children. In addi- 
tion, if TRA leads to an increase in after-tax 
income, and if the income elasticity for child 
quality is large, the demand for numbers of 
children may decline. In this paper we do 
not address these other factors and thus the 
total impact of TRA on fertility is likely to 
be smaller than the estimates that we present 
here. Because of the potentially large impact 
of this tax policy tool on fertility rates, em- 
pirical work using micro structures data or 
data from countries with different tax struc- 
tures would be useful to provide additional 
evidence about the size of this impact. 


APPENDIX 1 


General Fertility Rate and the Personal Exemption 
for Dependents, 1913-1984 








Real Tax 
Value of 

Year Birthrate PE 
1913 . 124.7 0 
1914 126.6 0 
1915 125.0 0 
1916 123.4 0 
1917 121.0 19,27 
1918 119.8 23.94 
1919 111.2 20.07 
1920. 117.9 15.33 
1921 119.8 34.32 
1922 111.2 36.65 
1923 110.5 25.83 
1924 110.9 27.34 
1925 106.6 22.85 
1926 102.6 21.13 
1927 99.8 24.61 
1928 93.8 31.96 
1929 89.2 27.29 
1930 89.2 18.40 
1931 84.6 14.91 
1932 81.7 28.36 
1933 76.3 31.95 
1934 78.5 33.91 
1935 71.2 36.98 
1936 75.8 50.12 
1937 711 42.79 
1938 79.1 32.22 
1939 77.6 36.53 
1940 79.9 53.33 
1941 83.4 102.49 
1942 91.5 137.70 
1943 94.3 141.20 
1944 88.4 243.83 
1945 85.9 238.40 
1946 101.9 193.16 
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Year - 


1947 
1948 
1949 
1950 
1951 
1952 
1953 
1954 
1955 
1956 
1957 
1958 
1959 
1960 
1961 
1962 
1963 
1964 
1965 
1966 
1967 
1968 
1969 
1970 
1971 
1972 
1973 
1974 
1975 
1976 
1977 
1978 
1979 
1980 
1981 
1982 
1983. 


1984 - 


Year 
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Birthrate 


113.3 
1073 
107.1 


106.2 


1115 
113.9 
1152 
1181 
118.5 
1212 
122.9 
1202 
118.8 
1180 
1172 
1122 
108.5 - 
105.0 
96.6 
91.3 
87.6 
85.7 
86.5 
87.9 
81.8 
734 
69.2 
68.4 
66.0 
65.8 
66.8 
65.5 
612 
68.4 
674 
673 
65.8 
65.4 
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Real Tax 
Value of 
PE 


168.90 
149.79 
.147.05 
163.10 
178.14 
- 18943 
186.51 
165.46 
170.57 
171.00 
165.12 
158.66 
162.19 
158.28 
160.71 
161.58 
161.61 
142.73 
134.60 
133.94 
133.80 
145.10 
142.62 
130.58 
132.99 
144.85 
140.87 
130.49 
122.36 
120.08 
116.11 
118.98 
132.93 
123.17 
119.31 
102.04 
. 92.49 
83.90 


Average Female Real Wage, 1913-1984 
(1967 = 1.00) 


Wage 


Year 


Wage 
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Year l Wage Year. - Wage 





Iz Eg n EE 
1935 ` 0.959 1960 1.776 
1936 0.928 1961 1.739 
1937 ,0.981^ 1962 1.777 
1938 0.988 1963- 1.812 
1939 1.000 1964 1.855 
1940 1.043 1965  . 1.903 
1941 : 1.084 1966 : 1.859 
1942 . l 1.147 1967 . 1.918 
1943 , 1.278 1968 1.979 
1944 1.351 1969 2.063 
1945 1.358 1970 2.064 
1946 1.359 1971 2.057 
1947 1.368 1972 1.094 
1948 : 1.405 1973 2.061 
1949 1.323 1974 2.034 
1950 . 1239 1975 ` . ` 2403 
1951 | 1235 1976 2.170 
1952 . 1.287 1977 2.187 
1953. 1.423 1978 : 2.277 
1954 . 1.404 1979 2.206 
1955 1.661 1980 2.136 
1956 1.669 1981 2.106 
1957 1.729 1982 2.173 
1958 - 1.746 1983 2216 
1959 1.765 1984 : 2.240 


Construction of Wage Series Years. 1914, 1920-1948: 
From Historical Statistics, Series D830-844, Average 
Hourly Earnings of Production Workers by Sex. 

Years 1913, 1915—1919: Estimated based on average 
wage growth in production, from Series 802-810 and 
Series 765-778. 

Years 1955-1982: O'Neill (1985) calculates a fe- 
male/male wage ratio based on annual wage earnings 
and average hours worked by both sexes. This ratio is 
transformed into a female/average wage ratio, so that 
information on average production wages can be used 
to determine female production wages. Using the rela- 
tionship: 


l Female/Male = (Average/Male) 
X (Female/Average) : 


the female/average ratio can be calculated from 
O’Neill’s (1985) existing series on female/male wages 
and data on male wages and average wages found in 
Statistical Abstract, various years. This ratio is adjusted 
upward by 0.027, to account for the difference between 
O’Neill’s ratio and the observed female/male wage ra- 
tio in production in 1939, a year in which both are 
available. Finally, the adjusted female/average ratio is 
multiplied by the average wage in manufacturing to 
arrive at a female wage rate. 

Years 1949-1954, 1983-1984: Using median wage 
earnings, and adjusting for an approximate 9 percent 
hour differential between men and women, the ratio of 
female to male wages is calculated. This is transformed 
into a female/average ratio using the techniques 
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described above. This ratio is multiplied-by the average 
manufacturing wage. 

We compared this series to a decennial s series pro- 
vided by Claudia Goldin. This series trends in the same 
way as Goldin’s series. 
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The Relationship Between the Marginal Cost of Public 
Funds and Marginal Excess Burden 


By ROBERT K. TRIEST* 


A large literature has recently developed 
which attempts to measure the marginal cost 
(shadow price) of public funds.! The motiva- 
tion for much of this literature is the rec- 
ognition that many of the tax instruments 
used by governments generate substantial 
amounts of excess burden (deadweight loss). 
Jerry A. Hausman (1981), for example, esti- 
mates that for an “average” prime-aged 
American male with a wage of ten dollars in 
1975 the excess burden generated by the 
federal income tax was over fifty percent of 
the revenue raised from taxing that individ- 
ual. One might suspect that such large excess 
burden estimates would require a change in 
the way government project costs are calcu- 
lated in benefit-cost analysis. If the revenue 
required to finance a project is raised through 
distortionary taxation, then intuition would 
suggest that the excess burden generated by 
the increase in tax rates needed to finance 
the project should be incorporated into the 
cost of the project. i 

However, this analysis is not entirely con- 
sistent with the theoretical literature on the 
optimal provision of public goods which are 
financed by distortionary taxation. When 
distortionary taxes are used to finance public 
goods, and the level of public good provision 
does not affect. consumption of the taxed 
goods, the standard Samuelsonian rule which 
equates the sum (over consumers) of the 


*Department of Economics, University of California, 
Davis, CA 95616. This paper is based on chapter two of 


my dissertation at the University of Wisconsin-Madi- 


son. I wish to thank the members of my dissertation 
committee, Martin David, Robert Haveman (chair), and 
Arthur Goldberger, for very helpful advice and com- 
ments. I have also benefited from comments by Don 
Fullerton and Bruce Hamilton. An anonymous referee 
provided unusually helpful comments and corrected an 
error in the figure. 

*Don Fullerton (1989) provides a recent survey and 
analysis of this literature. 
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marginal rates of substitution between a pri- 
vate good and a public good with the 
marginal rate of transformation between the 
two goods must be modified to account for 
the marginal effect of increases in tax rates 
on the revenue raised from the exist- 
ing taxes.? The intuition behind this result is 
that we need to correct. for indirect 
“leakages” from existing government rev- 
enue sources when considering an incremen- 
tal expenditure. If the increase in tax rates 
needed to finance the project causes a reduc- 
tion in the revenue raised by the current tax 
structure, then we need to account for this 
loss in calculating the cost of the project. 

It is important to note that this result 
concerns the values of uncompensated de- 
mand elasticities. An example first provided 
by Atkinson and Stern (1974) illustrates this 
point. Consider a world where the only tax is 
on labor earnings, and incremental expendi- 
ture on the public good does not affect the 
demand for leisure. In this case, the marginal 
rate of transformation will overstate the cost 
of the public good as long as the uncompen- 
sated wage elasticity of labor supply is nega- 
tive. This is because an increase in the tax 
rate causes an increase in labor supply, lead- 
ing to an indirect increase in government 
revenue. 

From the excess burden perspective de- 
scribed earlier, this example is very puzzling. 
It is well known that even if the uncompen- 
sated own price elasticity of a good is zero, a 
tax on that good can still result in a large 
deadweight loss. We instead need the com- 
pensated elasticity to be zero to ensure that 
there will be no excess burden. However, the 


?This is shown by Anthony Atkinson and Nicholas 
H. Stern (1974) and David E. Wildasin (1979). The 
effect of provision of the public good on consumption 
of the taxed goods must also generally be taken into 
account. 
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labor supply example seems to indicate that . 


the cost of an incremental project might 
need to be adjusted downward to account 
for the effects of distortionary taxation even 
when the marginal excess burden generated 
by the taxes is positive. 

The purpose of this paper is to provide an 
explanation of this puzzle. The basic argu- 
ment is that the most appropriate measure of 
excess burden is based on a price vector 
which differs from the one which would be 
most commonly used to measure benefits in 
benefit-cost analysis. Since excess burden and 
benefit-cost analysis are based on different 
price vectors, an index number correction 
must be made in order to relate the two 
concepts. Due to this, the marginal cost of 
public funds may be less than one even 
ünder a tàx system with large associated 


values of both total and marginal excess 


burden. 

Section I of the paper briefly describes a 
measure of excess burden and evaluates how 
it relates to benefit-cost analysis. In Section 
II, the relationship between marginal excess 


burden and the marginal cost of public funds: 


is derived. Section HI provides a numerical 
example which indicates that marginal ex- 
cess burden may bė a very misleading indi- 
cator of the marginal cost of public funds. 
Section IV concludes the paper with a dis- 
cussion of the results. 


_ Y. Excess Burden and Benefit-Cost Analysis 


Although the basic idea behind excess 
burden is very simple, there is a large litera- 
_ ture on the appropriate way to measure it. A 
recent review of this literature is provided by 
Alan J. Auerbach (1985). In order to sim- 
plify the analysis, we will consider only the 
case of a single consumer economy with 
fixed producer prices. The fixed producer 
prices assumption is equivalent to assuming 
that the production possibilities frontier is 
linear. These assumptions allow us to ignore 
problems of income distribution and pro- 
ducer price changes caused by government 
policy. 
Measures of excess burden attempt to 
. measure, in monetary units, the decrease in 
the consumer’s utility due to imposition of a 
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tax system minus the value of the real re- 


sources which are captured by the govern- 
ment through the tax system. In other words, 
we want to measure to what degree the loss 
of the consumer's welfare due to the tax 
system exceeds the loss which would have 
resulted from the use of a non-distortionary 
(lump-sum) tax. In the case of a lump-sum 
tax, the utility loss and the value of the 
resources captured by the government are 
equal and offsetting. However, when distor- 
tionary taxes are used, the decrease in con- 
sumer welfare exceeds the revenue raised by 
the tax system. 

The various measures of excess burden 
which have been proposed differ in (a) how 
the change in the consumer's utility is mea- 
sured and (b) how the magnitude of the tax 
revenue is measured. A concensus appears to 
have formed that the most acceptable way of 
measuring excess burden is to subtract the 
actual revenue raised by the tax system from 
the equivalent variation associated with the 
imposition of the tax system. This measure, 
which was proposed independently by John 
A. Kay (1980) and Elisha A. Pazner and 
Efraim Sadka (1980), may be written alge- 
braically as 


() e(gu) -e(p u) - tx»), 


where p is the pretax (producer) price vector, 
tis the vector of commodity taxes, q is the 
post-tax price vector (which is equal to p+ t), 
e(p, u) is the expenditure function evaluated 
àt price vector p and utility level u, and 
x(p, y) is the vector of Marshallian demands 
evaluated at prices p and income level y. 
The utility level at which the expenditure 
function is evaluated, 1, is the level of well- 
being corresponding to the post-tax world; it 
may be written in terms of the indirect util- 
ity function as u, = v(q, y). Verbally, equa- 
tion (1) may be interpreted as the difference 
between the maximum amount the consumer 
would have been willing to pay to avoid the 
imposition of the tax system and the amount 
of revenue raised. This measure has the de- 
sirable properties of decreasing due to any 
tax reform which either (i) holds tax revenue 
constant, but results in a higher level of 
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post-tax utility, or (ii) keeps post-tax utility 
constant, but results in an increase in tax 
revenue. In addition, it will be zero if only 
lump-sum taxes are imposed and will be 
positive for any tax system which is less 
efficient than lump-sum taxation. 

One reason why the Kay-Pazner-Sadka 
excess burden measure has such desirable 
properties is that both the welfare loss and 
tax revenue are measured in terms of pro- 
ducer prices. The equivalent variation (EV) 
can be rewritten as the difference between 
the pretax and post-tax utility levels mea- 
sured in terms of producer prices: 


(2) EV = e(q, u) — e (p, u) 


Ey e(p, u) 
= e (p, uo) — e (p, m4). 


Similarly, tax revenue is equal to the value, 
in terms of producers prices, of the differ- 
ence between the bundle of goods consumed 
without taxes and the bundle of goods con- 
sumed after the tax system is imposed: 


(3) t-x(q, y) = (a7p):x(q, y) 
=q: x(q, y) —p-x(q, y) 
c 


(where the last step follows from p-x(p, y) = 
q:x(q, y) = y). It would be inappropriate to 
substitute compensating variation (CV) for 
equivalent variation in the excess burden 
formula, since compensating variation is à 
measure of the value of the change i in utility 
in terms of the final post-tax price vector: 


(à) — CV=e(4, uo) = e(p uo) 
= (a 14) — e(q, a). 


As pointed out by Kay (1980), Pazner and 
Sadka (1980), and Wilfried Pauwels (1986), 
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difficulties arise when excess burden is de- 
fined using the compensating variation to 
measure the decrease in welfare due to taxa- 
tion. The reason for this is that while com- 
pensating variation is based on the post-tax 


| price vector, tax revenue is a measure of the 
‘value of resources based on the pretax price 


vector. Because. of this, combining the two 
into a sensible measure of excess burden is 
very difficult. 

Recently, several authors have seéralized 
the Hicksian consumer's surplus measures to 
the case where a monetary measure of the 
distance between ug and u is obtained in | 
terms of any fixed price vector: 


(5) e(r,uo)— e(r a) | 
= e(r,0(p, y) - e(r.v(q. »)). 


where r is any constant reference price vec- 
tor.? Picking r equal to p (the original price 
vector) yields the equivalent variation. Set- 
ting r to q (the final consumer price vector) 
yields the compensating variation. As long 
as r is held constant over all alternative final 
price vectors being compared, this expres- 
sion is a consistent ordinal indicator of the 
utility associated with. these alternatives (this 
follows from e(r, ug) being constant over all 
alternatives and e(r,u,) being increasing in 
uy). 

In performing a benefit-cost analysis of a 
proposed project, an explicit choice must be 
made for the reference price vector. In actu- 
al benefit-cost studies, net benefits are 
normally based on the current distorted con- 
sumer price vector rather than on the pro- 
ducer price vector. For example, an individ- 
ual’s answer to a question asking how much 
she would be willing to pay to have a partic-. 
ular project built is given in terms of the 
prices that the consumer currently faces 
rather than the prices she would face if all 
distortions were removed. While any price 


"Angus Deaton (1980) and Hal R. Varian (1984) 
present this measure in their discussion of consumer's 
surplus and show that the compensating and equivalent 
variations are special cases of it. Mervyn A. King (1983) 
calls this measure the “welfare gain.” 
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vector can serve as the reference vector, mea- 
sures based on current consumer prices have 
the important advantage of being easily in- 
terpretable by both the policy analysts and 
those affected by the project. If net benefits 
are measured in terms of current consumer 
prices, then in order to be consistent the 
measure of the project’s cost should also be 
in terms of these prices. 

Suppose that the current tax vector is t,, 
the consumer price vector is q, (qi = p-- tj), 
and a change in the tax vector from t, to 
t, is introduced in order to finance a new 
project (q, = p+t,). A appropriate measure 
of the cost of the project in terms of current 
prices is 


(6) e(q;, v(q;, »))- e (q; v(q;, »)). 


This is the equivalent variation (starting from 
price vector qı) associated with the imposi- 
tion of the additional taxes needed to pay 
for the project.* The cost measure (equation 
(6)) is closely related to the first two terms of 
the Kay-Pazner-Sadka measure of excess 
burden (equation (1)). If we change the ref- 
erence price vector of the cost measure from 
qı to p, then it becomes identical to the first 
two terms of the excess burden measure. The 
key difference is that excess burden is based 
on producer prices, while the cost measure 
for use in benefit-cost analysis is based on 
current consumer prices. 

Figure 1 illustrates the importance « of the 
choice of the reference price vector.) Sup- 
pose a consumer allocates his income be- 
tween two goods, x, and x,. The pretax 
budget constraint is AE and the consumer is 
initially at point A’. After a tax is imposed 
on x,, the budget constraint becomes AF 
and the consumer moves to point B’. The 
revenue raised by this tax (letting x, serve as 


numeraire) is AB. The equivalent variation. 


“See Triest (1987b) for analogous benefit and cost 
measures which explicitly model the introduction of the 
new public good. 

5This figure is similar to Figure 1 in Pazner and 
Sadka (1980), where they demonstrate that basing ex- 
cess burden. on compensating variation may lead to 
incorrect welfare rankings. 
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associated with the imposition of the tax is 
AC. Therefore, the deadweight loss of the 
tax (using the Kay-Pazner-Sadka measure) i is 
BC. Notice that even though consumption of 


‘the taxed good does not change due to the 
imposition of the tax, a positive quantity of 


deadweight loss is generated. 

The loss in consumer welfare due to the 
imposition of tne tax in terms of post-tax 
prices is DB'. This measure of welfare loss is 
less than the amount of revenue raised 
(A’B’). However, as noted earlier, this does 
not imply that there is no deadweight loss 
since the welfare loss and tax revenue are 
now based on different sets of reference 
prices. 

In calculating the marginal cost of public 
funds to use in benefit-cost analysis where 
benefits are calculated in terms of current 
consumer prices, we need to find how much 
the welfare cost measure based on the dis- 
torted price vecto- ( DB") increases when tax 
revenue is increased by one dollar. In the 
next section, it is demonstrated that in this 
example the marginal cost of public funds is 


equal to one. When the taxed good has an 


uncompensated own price elasticity of zero, 
the marginal increase in the welfare cost 
measure is exactly equal to the extra revenue 
raised. 


VOL. 80 NO. 3 


Ii. Marginal Excess Burden and the 
Marginal Cost of Public Funds 


The marginal excess burden associated 
with a particular tax is usually defined as the 
increase in excess burden which results from 
a one-dollar increase in tax revenue gener- 
ated through use of that tax. Suppose the 
current tax vector is t, (with consumer price 
vector q5). The marginal excess burden asso- 
ciated with increasing the tax on good i is 
then the partial derivative of the excess bur- 
den measure with respect to the tax on good 
i divided by the partial derivative of tax 
revenue with respect to the tax on good i: 


8 [e(p,v(p. »)) — e(p.v(a,. »)) 





—t,x(q;,y)] 
94; 
8|t;x(a;, »)] 
| 013 
| = 8e(p,v(q;, y)) | 
MM NM P 
— [[t;x(a5, »)] l 
ĝt 


This expression is closely related to the 
marginal cost of public funds. The marginal 
cost of public funds may be defined as the 
increase in the cost measure wħich results 
when an additional dollar of tax revenue is 
raised. If the benefit-cost analyst treats re- 
quired tax revenue as the cost of the project, 
then the marginal cost of public funds is the 
minimum required benefit-cost ratio for in- 
cremental expenditures. As we will see be- 
low, the marginal cost of public funds de- 
pends critically on the price vector which 
benefits are measured in terms of. 

When benefits are measured in terms of 
some reference price vector r and revenue is 
raised through an increase in the tax on 
good i, the appropriate measure of the 
marginal cost of public funds is the partial 
derivative of equation (6) with respect to £; 
divided by the partial derivative of tax rev- 
enue with respect to ¢; (this equation is 
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derived in the Appendix): 


delr, vlaz »)) 

i 0q5 
(9 -Taxan nl] - 
t5; 


[ES 
E gv(r, y - EV) 
| dy. | 


x, (a5; y) ji 


0i 
x(q, y) 


‘i 3.5; 
Jj 


1+ 


where EV is defined implicitly by v(r, y — 
EV) = v(q;, y). Equation (8) is an appropri- 
ate expression for the marginal cost of pub- 
lic funds for a project whose benefits will be 
measured in terms of price vector r. When 
producer prices (p) are used as the reference 
prices in the project evaluation, then the 
marginal cost of public funds is equal to one 
plus marginal excess burden. 

However, as argued in the preceding sec- 
tion, benefit-cost analysis is rarely done in 
terms of undistorted prices. When benefits 
and costs are instead measured in terms of 
the final price vector, q3, the first term in the 
above expression becomes one. In this situa- 
tion, the marginal cost of public funds de- 
pends only on the ratio of the indirect loss of 
tax revenue due to the marginal increase in 


` the price of good i, 


àx;(q5, y) 
042; 


Zinn) 


> 


- to the direct increase in tax revenue, 


x,(q4, y). This measure appears throughout 
the literature, although its relationship to 
marginal excess burden and the choice of q, 
as the reference price vector has not been 


i 
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noted. When benefits are measured in terms 
of prices which differ from q,, the first term 
acts as an index number adjustment for the 
differences in the marginal utility of income 
under the two price vectors. 

In Section I, it was argued that the prices 
facing consumers at the time the project 
evaluation is performed, q}, is normally used 
as the reference price vector in benefit-cost 
analysis. For a small project, the pre-project 
and post-project consumer price vectors. will 
be approximately the same (q; = q2). In this 
case, the marginal cost of public funds will 
be greater than one only to the extent that 


"the increase in the tax rates used to generate 


revenue results in a decrease in the weighted 
(by tax rates) sum of the uncompensated 
demands for the taxed goods. 

When marginal excess burden is calcu- 
lated on the basis of the Kay-Pazner-Sadka 
measure, p is used as the reference price 
vector rather than q,. In this case, the 
marginal cost of public funds corresponding 
to the q, reference price vector must be 
multiplied by the ratio of 


dv(q; y) i ‘dv(p, y — EV) 4 
ay ay 


The inverse of the marginal utility of income 
serves the purpose of quantifying the level of 
utility associated with a given price vector. 
The second term in the marginal cost of 
public funds expression is a measure of the 
value, in terms of current consumer prices 
(a2), of thé indirect loss in welfare due to 
raising . an, additional dollar of revenue 


6This is the same as the expression derived by Atkin- 


son and Stern (1974) and Wildasin (1979) when the 
increase in the level of public good provision accompa- 
nying the tax increase is ignored. Neville Topham (1984, 
1985) attempts to link the marginal cost of public funds 
to marginal excess burden, but unfortunately bases his 
analysis on excess burden measures based on either the 
compensating variation or Marshallian consumer’s sur- 
plus. Similarly, Edgar K. Browning (1987) bases his 
measure of the marginal cost of public funds on the 
compensating variation. 

"Where EV is defined implicitly by o(q2, y) =v, 
y — EV). 
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through increasing the tax on good i. In 
order to convert this to a measure of welfare 
loss in terms of price vector p, we must 
multiply by the ratio of the actuai marginal 
utility of income over what the marginal 
utility of income would be if the consumer 
faced. price vector p. The equivalent varia- 
tion is subtracted from the income term in 
v( p, y — EV) so that the marginal utility of 
income is evaluated at the actual current 
level of utility in both the numerator and 
denominator of the ratio. 

The marginal cost of public funds in terms 
of reference price vector p will exceed one if 


8v(q;, y) 
dy 


is greater than 


dv(p, y -EV) 
dy 


even if the increase in the tax in question 
causes no reduction in the demand for any 
of the taxed goods. An intuitive explanation 
for this phenomenon is that the dollar value 
of a given reduction in welfare will be greater 
in terms of p than in terms of q, whenever 
each dollar of income is worth more to the 
consumer under prices q, than under prices 
p. Therefore, if the marginal utility of in- 
come is greater in terms of price vector q3 
than it is in terms of price vector p, then we 
must adjust upward a measure of marginal 
welfare change measured in terms of prices 
q2, in order to convert it to a measure of 
value in terms of price vector p. 


IIl. A Numerical Example | 


A numerical example suggests that chang- 
ing the reference price vector can make a 
great deal of difference. Suppose there are 
two goods, consumption. (the. numeraire) and 
leisure (with relative price w). The con- 


*^ similar point is made by Pazner and Sadka (1980). 
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TABLE 1— MARGINAL Cost OF PUBLIC FUNDS SIMULATIONS 
PARAMETER SETTINGS? 





Wage Elasticities? 


a-113 a-113 
B-0113 80.565 B=0.113 B — 0.565 
y=2663 y=3115 y=2527 y=3318 


a=56.5 a=56.5 





Uncompensated 0.01 0.01 0.07 —0.07 

Compensated 0.35 171 ` 0.40 1.63 
Marginal Cost of Public Funds: ; : 
Reference Wage — $5 (Gross Pretax Wage) 1.26 3.12 1.31 2.97 y 
Reference Wage = $4 1.13 1.77 1.17 1.69 
Reference Wage = $3 (Net Post-tax Wage) 1.01 110 1.05 0.96 
Excess Burden (Kay-Pazner-Sadka Measure): i 

665.0 4450.0 753.0 4245.0 


*The parameters were chosen so that in all simulations $5167.80 revenue is raised. 


See the text for further details. 


ball elasticities were evaluated at w = $3.00 and Y = $1000. 


sumer’s preferences are represented by the 
indirect utility function: 


S - 
(9) v(w,Y) =e” +v- 5+3 


where Y is unearned income and a, fj, and y 
are parameters. The implied expenditure and 
labor supply functions are 


a a Y 
(10) e(w,u) =e 9"u- -w+ —;— 


B BP B 
and 
(11) h(w,Y) =y+ aw + BY. 


This is the specification used by Hausman 
(1981). For an “average” prime-aged mar- 
ried male, Hausman estimated y to be ap- 
proximately 2663, a to be 11.3, and f to be 
—0.113 (where h is annual hours of work, 
and w and Y are both measured in dollars)? 
All calculations using this model were per- 


?The y estimate is conditional on certain demo- 
graphic characteristics; see Gary Burtless' comment on 
Hausman (1981, p. 76) for details. The B estimate is 
actually the median of an estimated distribution for f. 


formed assuming a gross wage of $5 and 
gross unearned income of $1000. 

Table 1 shows the marginal cost of public 
funds for a slight increase in the income tax 
starting from a tax rate of forty percent (net 
wage of $3). The calculations were done for 
four different combinations of parameters in 
order to see how the results vary with the 
labor supply elasticities. The parameters were 
picked such that the same amount of rev- 
enue is raised in each simulation. Three dif- 
ferent reference prices (wages) were used: the 
gross wage, the final net wage, and an inter- 
mediate value. Since the numerator of the 
marginal cost of public funds expression 
(equation (8)) is equal to one when the net 
wage ($3) is used as reference wage, the 
numerator for any other reference wage can 
be easily calculated by taking the ratio of the 
marginal cost of public funds with that refer- 
ence wage to the marginal cost of public 
funds with the reference wage set at $3. 

As one would expect, the excess burden of 
the tax is positive for all four sets of parame- 
ter values. Excess burden is largest in the 
simulations with high compensated wage 
elasticities. Marginal excess burden may be 
quickly calculated from this table by sub- 
tracting one from the marginal cost of public 
funds figures which use the pretax wage as 
the reference wage. Again as expected, the 
two simulations with relatively large com- 
pensated wage elasticities also have the. 
largest values of marginal excess burden. 


D 
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The marginal cost of public funds calcu- 
lated using the net post-tax wage as the 
reference wage follows a pattern quite. dif- 
ferent from marginal excess burden. For the 
parameter values in the first column, which 
are based on Hausman’s (1981) labor supply 
estimates, the marginal cost of public funds 
is 1.26 when it is based on marginal excess 
burden (using the pretax wage as the refer- 
ence wage), but only 1.01 when using the net 
post-tax wage. 

The most striking example in this table is 
the results of the simulation reported in col- 
umn four. In this case, the compensated 
wage elasticity is fairly large, while the un- 
compensated elasticity is negative. When the 
undistorted wage is used as the reference 
price, the marginal cost of public funds is 
approximately three. However, when the net 


wage is used as the reference price, the, 


marginal cost drops below one. Thus; there 
is the potential for large errors in using a 
marginal cost of public funds calculated in 
terms of prices which differ from those which 
are used to measure benefits. Even if there is 
a large marginal excess burden (in terms of 
the Kay-Pazner-Sadka excess burden mea- 
sure) associated with increasing the tax on a 
good, the marginal cost of public funds 


- (when benefits are measured in terms of 


post-tax prices) associated with raising rev- 
enue from that tax may be less than one. 
As the numerical example shows, the 
marginal cost of public funds may be very 
sensitive to the choice of the reference price 
vector. In doing applied welfare analysis, it 


is very important to use a single set of refer- - 


ence prices for all quantities being com- 
pared. Using a marginal excess burden mea- 
sure based on one reference price vector to 
calculate the marginal cost of public funds 
when benefits are calculated using a different 


reference price vector may produce very mis- 


leading results. 
Iv. Concluding Remarks 


This paper has demonstrated the impor- 
tance of carefully linking the marginal cost 
of public funds to the benefits measure actu- 
ally used in project evaluation. A given esti- 
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mate of the marginal cost of public funds is 


of no use unless it is shown how it relates to 


a particular framework for project evalua- 
tion. Estimates of the marginal cost of public 
funds based on unusual welfare measures 
must be treated cautiously since it is hard to 
relate them to standard benefit-cost analysis. 

When benefits are measured in terms of — 
current consumer prices, then the marginal 
cost of public funds depends only on un- 
compensated price elasticities. This result is 
surprising, since excess burden depends on 
compensated demand curves. This paper 
shows that the reason for this paradoxical 
result is that excess burden is normally de- 
fined in terms of undistorted prices. An in- 
dex number problem then arises when trying. 
to use marginal excess burden to calculate 
the marginal cost of public funds, which 
must be based on current consumer prices. 

As noted by previous authors, even if the 
only taxed good has a negative uncompen- 
sated own-price elasticity, the marginal cost 
of public expenditure may exceed one if the 
additional expenditure results in a decrease 
in the demand for the taxed good. Hcwever, 
this is quite different from the usual excess 
burden argument. When public expenditures 
increase consumption of taxed goods at the 
margin, then the public expenditure effect 
will work toward driving the marginal cost 
of public expenditure below one. 


APPENDIX 
Derivation of Equation (8): 


In order to derive equation (8), we must differentiate 
an expenditure function with respect to one of the price 
arguments of tbe indirect utility function it is defined 
over. Triest (1987a) shows that the following relation- 
ship holds: 


Jelp, o(a, y) 
oq; 


je | 

ay 

7G Tay EV)T' 
| dy | 


where EV is defined implicitly by v(q, y) = v(p, y — EV). 
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A proof of this relationship is included here for the sake 
of completeness: 

The first step is to show that de(p, v(q, y))/dv= 
1/[8v(p, y — EV)/8y]: 


e(p,v(q. »)) ^ e(p.v(p, y —EV)) 


=y-EV 
de(p,v(p,y—EV)) ` 
ay 
_ 9e(p,y(p,y—EV)) 8v(p, y -EV) _ 
n av EE ae 
de(p,v(q,y)) 9e(p.v(p,y —EV)) 
dv Hu dv 
de(p,v(a,y)) — 1 
dv . ðv(p, y- EV) 
° 


We can now use this result to derive de(p, v(q, y)) 
3a: 


de (p o(a, y)) _ de(p,v(a,y)) 3v (a, y) 
8qi dv oq; 


| v(a, y) | 
dl în l 
~ [ dv(p, y — EV) 
| ay | 
multiplying and dividing by [dv(q, y)/8y]/[8v(p, y — 
EV)/8 y] yields (by Roy's identity): 


8v(q, y) 
dy 
=— 414”) T35@, y-EV)]: 
| ay | 


We can now use this identity to derive equation (8): 


jue LIC 241 


9492; 
t5 
3o(22.) 
ay 
Xin) T35@, y EV) ] 
[<A] 


[ye B 2 + x; (q2; »| 


J 
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(where EV is implicitly defined by v(q5, y) = v(r, y + 
EV) . 
9v(a5. y) Ye 3x (a27) |. 
<a 2J 3 
y j qzi 
~ | dv(r, y EV) E 
dy 


x (45, ») 


REFERENCES 


Atkinson, Anthony and Stern, Nicholas H., 
“Pigou, Taxation and Public Goods," Re- 
view of Economic Studies, January 1974, 
41, 119-28. 

Auerbach, Alan J. “The Theory of Excess 
Burden and Optimal Taxation," in Alan J. 
Auerbach and Martin Feldstein, eds., The 
Handbook of Public Economics, Vol. 1, 
New York: North-Holland, 1985. 

Ballard, Charles L., Shoven, John, B. and Whal- 
ley, John, “General Equilibrium Computa- 
tions of the Marginal Welfare Costs of 
Taxes in the United States," American 
Economic Review, March 1985, 75, 128—38. 

Browning, Edgar K., “On the Marginal Wel- 
fare Cost of Taxation," American Eco- 
nomic Review, March 1987, 77, 11-23. 

Burtless, Gary, “Comments” on paper by Jerry 
A. Hausman, in Henry J. Aaron and 
Joseph A. Pechman, eds., How Taxes Af- 
fect Economic Behavior, Washington: The 
Brookings Institution, 1981. 

Deaton, Angus, “The Measurement of Wel- 
fare: Theory and Practical Guidelines,” 
LSMS Working Paper No. 7, The World 
Bank, 1980. 

Fullerton, Don, “If Labor Is Inelastic, Are 
Taxes Still Distorting?” National Bureau 
of Economic Research Working Paper No. 
2810, January 1989. 

Hausman, Jerry A., “Labor Supply,” in Henry 
J. Aaron and Joseph A. Pechman, eds., 
How Taxes Affect Economic Behavior, 
Washington: The Brookings Institution, 
1981. 

Kay, John A., “The Deadweight Loss from a 
Tax System,” Journal of Public Economics, 
February 1980, 13, 111-19. 

King, Mervyn A., “Welfare Analysis of Tax 
Reforms Using Household Data," Journal 


566 THE AMERICAN ECONOMIC REVIEW 


of Public Economics, July 1983, 21, 183- 
214. 

Pauwels, Wilfried, “Correct and Incorrect 
Measures of the Deadweight Loss of Tax- 
ation,” Public Finance / Finance Publics, 
1986, 41, 267-75. 

Pazner, Elisha A. and Sadka, Efraim, 2. A 
Burden and Economic Surplus as Consis- 
tent Welfare Indicators," Public Finance/ 
Finance Publiques, 1980, 35, 439—49. 

Stuart, Charles, " Welfare Costs per Dollar of 
Additional Tax Revenue in the United 

-- States," American Economic Review; June 
1984, 74, 352-62. 

Topham, Neville, *À Reappraisal and Recal- 
culation of the Marginal Cost of Public 
Funds," Public Finance ee Publi- 
ques, 1984, 39, 394—405. 


JUNE 1990 


, “Excess Burden and the Marginal 
Cost of Public Spending,” Economic Let- 
ters, 1985, 17, 145-48. 

Triest, Robert K., (1987a) “The Effect of In- 
come Taxation on Labor Supply,” unpub- 
lished doctoral dissertation, University of 
Wisconsin-Madison, 1987. 

, (1987b) “The Relationship Between 
the Marginal Cost of Public Funds and 
-Marginal Excess Burden,” Working Pa- 
pers in Economics No. 194, Johns Hop- 
kins University, 1987. 

Varian, Hal R, Microeconomic Analysis, 2nd 
ed., New York: Norton, 1984. 

Wildasin, David E., “Public Good Provision 
with Optimal and Non-Optimal Commod- 
ity Taxation: The Single-Consumer Case,” 
Economic Letters, 1979, 4, 59—64. 


Effects of Spatial Price Discrimination 
on Output, Welfare, and Location 


By HONG HWANG AND CHAO-CHENG MAI* 


It is well recognized that the problem of 
price discrimination must address itself to 
the question of whether total output is greater 
or less under discriminatory monopoly than 
under simple monopoly. If discrimination 
does not increase output, such pricing is 
undesirable from the standpoint of social 


welfare even though it proves to be more : 


profitable for the monopolist. To answer this 
question, nearly 50 years ago, Joan Robin- 
son (1933) advanced the criterion of “ad- 
justed concavity” to decide whether price 


discrimination between two submarkets - 


would increase or decrease total output if a 
monopolist were seeking maximum profits. 
Since then, a number of writers have ex- 
tended and further revised her analysis, 
including Edgar Edwards (1950), Eugene 
Silberberg (1970), Melvin Greenhut 
and Hiroshi Ohta (1976), James Smith and 
John Formby (1981), Richard Schmalensee 
(1981), John Formby et al. (1983), Hal Var- 
ian (1985), Jun-ji Shih, Chao-cheng Mai, and 
Jung-chao Liu (1988), and Hiroshi Ohta 
(1988). As has been well known since 1933, 
when the two demand curves are linear, dis- 
crimination does not change output. But, as 
demonstrated by Schmalensee in a linear 
case and generalized by Varian, welfare is 
reduced by allowing discrimination. 

The above analyses are examined in the 
context of traditional non-spatial economy 


*Department of Economics, National Taiwan Uni- 
versity, Taipei, Taiwan 10020, R.O.C. and Sun Yat-Sen 
Institute for Social Sciences and Philosophy, Academia 
Sinica, Nankang, Taipei, Taiwan 11529, R.O.C. We 
would like to thank two anonymous referees, Richard 
Arnott and H. Ohta for their very useful suggestions, 
which led to substantial improvement in the paper. Part 
of the work was done while the first author was a 
visiting, associate professor at the economics department 
of Clemson University. He would like to thank the 
department for hospitality. 
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in which distance costs are insignificant and 
negligible. In a spatial world, Greenhut and 
Ohta (1972) considered a case where con- 
sumers' :demand curves are linear and the 
radius of a monopolist’s market area is a 
variable and then demonstrated that a spa- 


‘tial monopolist who adopts a spatially dis- 


criminating price would always produce a, 


‘larger output than under an f.o.b. mill price 


policy. William Holahan (1975) further veri- 
fied that spatial price discrimination results 
in greater welfare than a mill price policy. In 
addition, Martin Beckmann (1976) analyzed 
the same problem by assuming that the ra- 
dius of a monopolist's market area is fixed 
exogenously and showed that the total out- 
put under discriminatory pricing is the same 
as under f.o.b. mill pricing. Beckmann's re- 
sult is the same as Robinson's. Ohta (1988) 
extended these analyses by showing, on a 
more disaggregate level, who gains from dis- 
crimination under various alternative de- 
mand and cost conditions. Common to these 
writers. is their basic assumption that the 
firm's location is predetermined. This as- 
sumption must be relaxed, however, insofar 
as location is an important variable and firms 
choose different locations under different 
pricing systems. 

. In this paper, we will treat the location as 
an endogenous variable and study the effects 
of spatial price discrimination on output, 
welfare and location of a monopolist in the 
context of spatial economy. More specifi- 
cally, we will investigate the following issues: 
(i) Will the Robinson output theorem and 
the Schmalensee welfare theorem be un- 
scathed in a spatial economy, when the ra- 
dius of the monopolist’s market area is fixed 
while its location is a decision variable? (ii) 
What is the optimum location under discrim- 
inatory pricing compared with the one under 
f.o.b. mill pricing? 
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FIGURE 1. A LOCATION LINE -- 


.I. The Spatial Model with an 
Endogenous Location 


Consider a simple setting: A spatial 
monopolist will locate its plant on a line in 
between two different markets, each of which 
is located at a point. It serves both markets 
by charging an f.o.b. mill price or discrimi- 
natory prices without fear of competition or 
resale: between markets. The markets are s 
miles apart and are connected by a highway 
as shown in Figure 1. Let x and (s — x) be 
the distances of its plant from markets 1 and 
2, respectively. i 

‘For the purpose. of this paper, we ‘shall 
assume linear demand curves at each mar- 
ket: 


(1) a= 


(D ama ip mac b[m, ers -3)]. 


a — Bp, = a— B(m,* rx) 


where p,, m; and q; are the delivered price, 
mill price, and sales in market i (i=1.2), 
respectively; r is the constant transport rate; 
and a, B, a, and b are all positive constants. 
The slopes of the demand curves are mea- 
sured by 


1 1 
Ses anid eec 


B b 


Hence the greater the values of £ and b, the 


flatter are the corresponding demand curves. 

: It warrants mention that in a spatial econ- 
omy, the total price paid by consumers (i.e., 
delivered price) differs from the net market 
price (net of transportation costs) received 
by the producer (i.c., mill price). 
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. The demand elasticity of market i as per- 
ceived by the producer is defined as follows: 


m, 


. with 


(4) &95-—-— 


with | 


where a JB and a/b are the price intercept 
values of demands. 

Two related notés concerning equations 
(3) and (4) are warranted. First, the greater 
the price intercept value of demand, the less 
is the demand elasticity evaluated at the 
same price. Second, a rise in x (i.e., moving 
away from market 1) increases the demand 
elasticity in market 1 but decreases that in 
market 2. 

In what follows, we shall use the spatial 
framework to derive the simple monopoly 
and the’ discrimination solutions, respec- 
tively. i 


: A. Simple Spatial Monopoly 


Under f.0.b. mill pricing, a spatial mono- 
polist will quote a single price for its prod- 


VOL. 80 NO. 3 


uct. The profit function is given by 
(5) v*(m,x) 
7 [e- B( m rx)]m- c) 
+{a—b[m+r(s—x)]}(m-c), 


where c denotes the constant marginal cost. 

The monopolist operates on two variables 
to maximize its profit: the f.o.b. price, m, 
and the location, x. This can be accom- 
plished by breaking the profit maximization 
problem down and solving it in two stages: 
select the optimum location, and then deter- 
mine the optimal price. We proceed by max- 
imizing profit with respect to m at any given 
location to obtain the comparative static ef- 
fect of x on m. The first-order condition for 
maximization is as follows: 


(6) —— (a a)- Brx — br(s — x) 

4 (B b)c-2( B b)m 
- 0, 

with the second-order condition: 


8? ns 


(7) y B5) <0. 


Solving (6) yields 


(8) m* 
(a a) - Brx —br(s- x) (B b)c 
i 2(B +5) , 
where ms is the equilibrium price under 


simple monopoly. 
From (8), we can easily derive the effect of 
x on m: 


dm*$  —r(B-b 
(9) cA dL ES 


An intuitive interpretation of equation (9) 
follows. First, the aggregate demand curve 
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faced by the producer is derivable as Q — 
q,- d= [(a+a—brs) — r8 —0)x] - (B + bm. 
If the demand in market 1 is steeper (flatter) 
than that in market 2, that is, B < b(B > b), 
then we see that moving the location away 
from market 1 increases (decreases) the 
quantity intercept and hence shifts out (in) 
the aggregate demand curve so that the opti- 
mal mill price increases (decreases). 

Moreover, substituting (8) into (1) and (2) 
yields 


(108) af = sa a Be—261x) 
x(B+b)-B(ata) 
+ B?rx  Bbr(s — x)],. 
(10b) -T 


x { [2a — be -2br(s — x)] 
x(B--b)-b(a- a) 
+ Bbrx  b^r(s - x)]). 
We now turn to the problem of choosing 
the optimal location. Via the envelope theo- 


rem of comparative statics and noting that 
(mS — c) > 0, we can obtain! 





(11) E 
dx dx 
--r(m*-c)(B-b)z0 
as BSb 
and 
(12) oe oo 


Ht is easy to demonstrate that the solutions for this 
problem are unchanged if we solve for a simultaneous 
optimization over prices and location. 
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Equation (12) shows that the optimized 
profit function is strictly convex with respect 
to x. It implies that any intermediate loca- 
tions between markets 1 and 2 are strictly 
excluded in our simple location problem. 
Thus, combining (11) with (12), we can con- 
clude that 


(13a) x5=0if B» b, 


(13b) xS-sif «b. 

Equation (13) is a very sensible outcome. 
As mentioned before, when f « b, increas- 
ing x shifts. out aggregate demand (i.e., total 
sales) and hence increases the monopolist's 
profits. As such, the monopolist will select 
its optimum location at market 2 with a 
flatter demand. The converse is true if B > b. 
Thus, the slope effect of demand alone mat- 
ters in location decisions under mill pricing. 

From equations (10a,b) and (13a,b), we 
can evaluate total output under simple 
monopoly as follows: 


(14a) @Q5(x=0) 
= di lo + G\x=0 
= 5 [(a+a)— brs 
—(B+b)c] if B» b, 
(14b) Q3(x-2s) 
- cst eos 
1 
7 3l(a*a)- Brs 
~(B+b)c] it 8 « b. 


B. Spatial Price Discrimination 


‘We shall now consider the case where the 


monopolist charges different mill prices at 
each market. The profit function is specified 
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as follows: 
(5) ?( mj, m, x) 
= [«- B(m * rx)] (m, — c) 
t(a-b[mi*r(s—x)]) 
x(m;-c). 
The padua will nain its profit at | 


any location by choosing; m, at each market 
to yield the first-order conditions: 





On? 
(16a) Jm, 7 a -2Bm, - Brx + Bc — 0, 
1 
ðn? 
(16b) a — a —2bm;- br(s— x) 
2 l 


+ bc=0. 


By solving (16a,b), we obtain the maximiz- 
ing value of m, and m, for a given value - 


of x: 
(17a) nb esr. 
(17b) n?-2[2-76-2-4. 


The effects of a change in the producer's 
location on the equilibrium mill prices can 
be evaluated bv 








is dm? r 
(28a) dx r 
isE dm? r 

(18b). dx 2^. 


It follows from (18) that under spatially 
discriminatory pricing, the producer will ab- 
sorb one-half of the transport cost which is 
otherwise charged to the consumers? This 


?See, for example, the óriginal proof in H. W. Singler 
(1937). 
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result comes from the linearity of the indi- 
vidual demand functions. 

By substituting (17a,b) into (1) and. (2), 
we get 


aQa— Drx — pc 
on pe 
ED 
f :a—br(s—x)- bc 
(19b) gui HR 


2 


Note that m? , m?, and the corresponding 
v? are functions of x. Again by the envelope 
theorem of comparative statics, we have 


(20) SENE 
-r[b(m,—c)-B(m,— 9] 
(21) PPO Pid 


Equation (21) indicates that the profit 
function is strictly convex with respect to x. 
Thus, there is no possibility of an intermedi- 
ate location. Furthermore, the choice be- 
tween markets 1 and 2 as the optimum loca- 
tion depends upon the relative magnitudes 
of @?(x=0) and vP(x-s) whose values 
are derived as follows: 


(22) 1?(x=0) = zs (a-e)? 
: brs — be)? 
tape rs— bc), 
(3) e" (xs) e qs (a rs - Bo? 
) w(x=s =p a 
1. 2 
tap be)’. 


From equations (22) and (23), we can 
calculate the difference of the two profit 
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levels: 


(24) A=7?(x=0)-7?(x=5) 


=F |a- a)t[e+ 2 2) 2! 


The sign of À is in general ambiguous as it 
depends on the two effects: (i) the quantity 
intercept effect, (a—a), and (ii) the slope 
effect, (b — B). To signify the effect of an 
endogenous location on the: output levels 
under the two pricing schemes, we shall as- 
sume in what follows that a and a are equal 
(i.e., a— a). The case in which a# a is ex- 
plored in the Appendix. Under such a cir- 
cumstance, equation (24) reduces to 


(25) a-Z(e« 2) 0-52 
| BSD. 
It immediately follows from (25) that 
(26a) x =0if B <b; | 
(26b) SE if B» b. 
By substituting (17a,b) into (1) and iQ 


and noting (26a,b), we get total output un- 
der price discrimination: 


(27a) Q?(x=0) 


= di lxo + 47 lx=0 

= (ta) brs e( 8+5)], 
(27b) Q?(x=s) 

= Pleat d2 less 

- ls o) -Brs - (B b)]. 


II. Comparisons 


We are now in a position to make compar- 
isons. of the optimum location, output and 
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welfare under the two pricing policies. First 
of all, let us consider the optimum location. 
From equations (13a,b) and (26a, b), we can 
conclude that 


S=0 and 


(28a) 
xP=s if B» b, 
(28b) xS=s and 
=0if B«b.. 


This leads to the following proposition: 


PROPOSITION 1: The optimum locations 
under both pricing policies are corner solu- 
tions, Assuming that the two linear demand 
curves have an equal quantity intercept, the 
monopolist locates at the market with a flatter 
demand under f.0.b. pricing while at the mar- 
ket with a steeper demand under discrimina- 


tory pricing. 


This is a very striking result. The location 
pulls under the two pricing systems are so 
diverse that the patterns of the optimum 
locations are completely different. 

Next, . the comparison of output effects 
under f.o.b. pricing and discriminatory pric- 
ing will be made by observing the following 
two cases. Consider first the case of b> f, 
where x5=s and x?=0. Using equations 
(14a, b) and (27a,b), we can evaluate 


(29a) Q?(x=0)-Q*(x=s) 


clos -9)] <0. 


D g the other case of b « B, in which 
— 0 and x? =s, yields 


Q9b) Q^(x-s)-Q*(x-0) 
1 
-3ls(5- 8)] « 0. 


From (29a,b), we establish the following 
proposition: 


PROPOSITION 2:. When the monopolist’s 
location is treated as an endogenous variable 
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and the two linear demand curves have the 
same quantity intercept, total output under 
discriminatory pricing is definitely. less than 
that under f.o.b. pricing. 


This outcome is sharply different from 
that obtained in the existing literature. In 
Spaceless economics, Robinson (1933) and 
Silberberg (1970) demonstrate that total out- 
put is unchanged by discrimination when 
demand curves are linear. In tbe spatial 
economy, with predetermined location and 
fixed market radius, Beckmann (1976) ob- 
tains the same result.? On the other hand, 
Greenhut and Ohta (1972) assume that mar- 
ket radius is variable and the firm’s location 
is given exogenously to show that total out- 
put is greater under spatial discrimination 
than under simple spatial monopoly, which 
is contrary to ours. 

Finally, let us turn to the comparison of 
welfare effects under the two pricing policies. 
Within a nonspatial framework, Schmalensee 
(1981) has demonstrated that in a linear 
demand case, although total output remains 
unchanged, Marshallian welfare (consumers' 
surplus plus producer's profit) is reduced by 
allowing discrimination. Apparently, it is im- 
portant to see whether this result can be 
applied to the spatial economy. First of all, 
using the demand functions (1) and (2), we 
can compute the consumers' surplus at each 
market as follows: 


l/a 1 
(30a) CS,= 2B. pi} i= 3p 4^ 
30b) CS Pee : 2 
( ) =z p n]a7 376 


Next, the producer's profit accruing from 
market i is defined as 


(31) m-(m,—-c)q, for i=1,2. 


Using equations (30a, b) and (31), the wel- 
fare under simple spatial monopoly is deriv- ~ 


For any given x, it can be easily shown that Q? = 
s 
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FIGURE 2. WELFARE COMPARISON 


able as 
(32) WS=CS$+ CSE + m? mj, 
1 
— 2Bb(B- b) 


x (8*5) b(af + Blas 


C2Bb(af- a$)). 


where q? and q§ are defined in (10a, b). 
Similarly, the welfare under spatial price 
discrimination is given by 


(33) WP=CSP+CSP + nP 42, 


where q? and q? are defined in (19a, b). 
Comparing (32) and (33), we obtain 


(34) w?-ws20, 


as it depends on particular parameter values 
of demand, cost, and transport rate struc- 
tures. For example, let s=c=B=1, a=a 
=10 and b — 2.6, the relation in (34) can be 
graphed in Figure 2 in terms of transport 


HWANG AND MAI: EFFECTS OF SPATIAL PRICE DISCRIMINATION 573 


rate structure. It shows that WP 2 WS if 
r Z r* —1.1749. 


PROPOSITION 3: When the firm's location 
is treated as an endogenous variable, the wel- 
fare level under spatial price discrimination 
could be greater than that under simple 
monopoly even if both demand curves are lin- 
ear. 


This result weakens Schmalensee's conclu- 
sion derived from.a nonspatial framework 
that f.o.b. mill pricing is always preferred. It 
is also different from Holahan's (1975) claim 
derived from a spatial model that the welfare 
under discriminatory pricing is greater than 
that under.mill pricing. This difference is 
due to his assumption that the monopolist S 
location is fixed regardless of pricing struc- 
tures. 


HI. Conclusions 


This paper has endeavored to examine the 
effects of spatial price discrimination on out- 
put, welfare, and location of a monopolist. 
The main conclusions are provided as fol- 
lows: 


(1) When the two demand curves are linear, 
there is no possibility of an intermediate 
location. The monopolist will locate at 
the site of the market with steeper de- 
mand under discriminatory pricing, while 
at that of the market with flatter demand 
under f.o.b. pricing. 

(2) When the location is treated as a deci- 
sion variable and both demand curves 
are linear with the same quantity inter- 
cept values, then the total output under 
discriminatory pricing, is less than that 
under mill pricing, but the welfare effect - 
of spatial price discrimination is indeter- 
minate. These results are sharply differ- 
ent from the ones derived in previous 
studies. 


Although our analysis may be too simple 
to draw the conclusion that price discrimina- 
tion should be prohibited, it has at least 
demonstrated an example in a: spatial con- 
text that questions the blanket prohibition of 
price discrimination. 
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APPENDIX 


This appendix discusses the optimum location under 
discriminatory pricing when the two quantity intercept 
values are not equal. 

Let us first rewrite equation (24) in the text as 
follows: 


A M=a?(x=0)—7?(x=s) 
-2 fa- aye :je- p]: 0. 


If A7 0, then it implies that the firm will locate at 
market 1 (i.e, x — 0). The converse is true if A « 0. In 
what follows, let us consider the case of A » 0. Under 
such a circumstance, (Al) may be rearranged in the 
following form: : 


"n a (; 5 +(1-3) ae 
—>|—ļ- -—Jet-kM 
ue 8 AE | ru 

Note that (c+ rs) < min[a/B, a/b] for both mar- 
kets to be served under conditions of extreme location 
either at x — 0 or x = s. With this note in mind, we now 
consider the following two sets of conditions, both of 
which yield A > 0. . x 

Case (i): a/B > a/b and f» b. 

This case means that the less elastic demand (or 
equivalently, the demand with a smaller price intercept) 
is flatter than the more elastic demand as pictured in 
Figure Al. By noting a/b»(c-rs/2), this set of 
conditions guarantees condition (A2) above. 

Case (ii): «2 a and B « b. 

The set of conditions directly establishes A> 0 via 
(Al). These conditions also imply a/8 > a/b as shown 
in Figure A2. This states that even if the less elastic 
demand is steeper than the more elastic demand (i.e., 
a/B> a/b and B7 b), the sign of A is still positive 
provided that a> a. We then conclude that the demand 
elasticity alone matters in location decisions under dis- 
criminatory pricing, provided a2 a. This proviso re- 
quires that the less elastic demand be not smaller than 


JUNE 1990 





a 


s < da > 
>58 b an a 


ay 
8 


wy 


FIGURE A2. A> 0 


a 


,B<bbuta<a 


a o 
8 R^ 


v 





0 a a Q 
FIGURE A3. AZ 0 


the more elastic demand for any price P « a/b. In 
contrast to the elasticity criterion, this proviso refers to 
the quantity intercept values and therefore may be 
termed as the (demand) volume criterion, which re- 
quires the monopolist to locate in the market with 
greater volume of demand: 

Combining cases (i) and (ii), we may conclude that 
as long as this proviso holds, the elasticity criterion is 
sufficient to yield A> 2 regardless of the demand slope 
(i.e., no matter whether B > b or fi < b), that is, discrim- 
inatory monopolist will locate at the market with less 
elastic demand if this proviso holds. However, the sign 
of À is not definite if the proviso is violated as drawn i in 
Figure A3. a 
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On the Equilibrium Yen-Dollar Rate 


By HiROSHI YOSHIKAWA* 


This paper presents a definition of the 
long-run equilibrium exchange rate that em- 
phasizes the role of supply factors in addi- 
tion to the more traditional price differential 
variables. Using this definition, we estimate 
the long-run equilibrium yen-dollar rate dur- 
ing the period of 1973—87. This calculation 
demonstrates that supply factors are in fact 
very important determinants of the long-run 
movement of the exchange rate. 

Despite the central role the exchange rates 
play in the economy, economists seem to 
doubt their own ability to explain, let alone 
predict, exchange rate movements. The rea- 
son is that in many cases structural exchange 
rate equations do not significantly outper- 
form various other models that do not ap- 
peal to economic theory such as the random 
walk. (See, for example, Richard Meese and 
Kenneth Rogoff, 1983.) While we also share 
this pessimism concerning our ability to 
forecast exchange rates in the short run (in- 
tradaily to monthly or quarterly), we argue 
that it is possible to reasonably explain, if 
not predict, long-run movements. 

The long-run equilibrium rate is most of- 
ten thought to be determined by purchasing 
power parity (PPP), which tends to focus 
only on price differentials. Medium-run di- 
vergences of the exchange rate from the 
long-run equilibrium are therefore com- 
monly identified with fluctuations in the in- 
flation-adjusted real exchange rate. This pa- 
per demonstrates, however, that the simple 
PPP is unsatisfactory because it emphasizes 


*Faculty of Economics, University of Tokyo, Tokyo, 
Japan. I am indebted to Professors Tatsuo Hatta, 
Takatoshi Itoh, Masahiro Kuroda, Kenichi Ohno, Ku- 
nio Okina, Yoshiyasu Ono, Kazuo Sato, Kazuyuki 
Suzuki, Kazuo Ueda, and two anonymous referees for 
their helpful comments and suggestions. I also acknowl- 
edge with gratitude the excellent research assistance of 
Jennifer Holt Dwyer, Tohru Inoue, and Elizabeth 
Norville. The research reported here was supported i 
the Kikawada Foundation. 
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only the inflation differential. In the more 
elaborated theory of purchasing power par- 
ity, of course, the potential importance of 
supply-side real factors as critical determi- 
nants of the exchange rate has been well 
recognized. (See, for example, John M. 
Keynes, 1923; Rudiger Dornbusch, Stanley 
Fischer, and Paul A. Samuelson, 1977; Alan 
Stockman, 1980.) Nevertheless, in empirical 
applications and policy discussions these 
supply-side real factors have been curiously 
ignored: To our knowledge, an exception is 
David Hsieh (1982). The main point of this 
paper is to show that these supply-side fac- 
tors actually have been very important deter- 
minants of the yen-dollar rate over the pe- 
riod of 1973 to 1987. 


I. The Definition of the Equilibrium Rate 


In this section, we present a definition of 
the long-run equilibrium exchange rate. We 
start by considering the tradables sector of 
the Japanese economy. We assume that two 
inputs, labor and natural resources (includ- 
ing energy), are necessary for production. 
For simplicity, we also assume a fixed coef- 
ficient production function. Accordingly, the 
production process can be characterized by 
two parameters, a and b. Here a and b are, 
respectively, the labor input and the natural 
resources input necessary to produce one 
unit of the tradable good. 

Under the assumption of perfect competi- 
tion, we are led to the following relationship: 


(1) 


Here p and p, are the domestic (yen) 
prices of tradables and natural resources, 
respectively, and w is the nominal wage rate. 
For simplicity, we have assumed zero profits 
in (1), although this assumption is not essen- 
tial to our analvsis. If we allow for a normal 
rate of profit and call it r, we need only to 
add rp to the left-hand side of equation (1). 


wa + pgb — p. 
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As long as we hold the normal rate of profit 
r constant, this-inclusion will not affect our 
analysis. 

Assume that Japan exports the tradable 
goods and imports the natural resources that 
are necessary for the production of the trad- 
ables. In 1980, for example, 96: percent of 
Japan's exports consisted of manufactured 
products, whereas 70 percent of imports were 
raw materials including oil. We take as given, 
in dollars, both p*, the international price of 
the tradable good, and p£, the international 
price of the natural resources. Since most 
manufactured products are more or less dif- 
ferentiated, the “law of one price" does not 
exactly hold. We would therefore expect the 
following relation between p and p*: 


(2) p 7 ep*/e. 


Here e (yen/dollar) is the nominal exchange 
rate, and £ is random deviations due to such 
factors as changes in the relative demand for 
Japan's exportables. We assume that the ex- 


pected value of e, E(e) is one. For the 


natural resources, we have 


(3) e Pr=ePR- 
With (2) and (3), (1) can be rewritten as _ 


(4) wa + epxb = ep*/e. 


If we set p* at the U.S. unit costs, we have 


(5) . p*-w*a*-4 pxb* 


where a* and b* are the labor and energy 
coefficients and w* is the nominal wage rate 
in the United States. When (5) is substituted 
into (4), we obtain 


(6) e=e(w/w*) 
x[a/(1- yes (Te EL 


where ¢*= is the relative price of 


epi /p* 
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natural resources in terms of Japan's ex- 
portables. Since E(c) =1, we finally obtain 


(7) EG OU) 
x[a/(1- Je +. (78 e] 


for a long-run trend of yen-dollar rate. (7) 
defines the long-run equilibrium exchange 
rate. The equilibrium exchange rate depends 
on the following factors: 


(i) The relative nominal wage level in Japan 

and in the United States 

(ii) The terms of trade (the relative price of 
natural resources, such as oil, and of 
manufactured goods in the international 
market) 

(iii) The technological parameters of the 
Japanese export sector. 


The simple PPP tends to emphasize the 
price differential, which in this case is factor 
(i) or the nominal wage differential. In con- 
trast to factor (i), factor (i), the terms of 
trade, and factor (iii), the productivities, are 
real factors. 

At this point, it is appropriate to clarify 
the meaning of the above equilibrium ex- 
change rate. The long-run equilibrium rate 
defined by (7) is the exchange rate at which 
the profit rate of the Japanese export sector 
remiains constant, given exogenous move- 
ments of wages, productivity, terms of trade 
and export prices. It is certainly compatible 
with a variety of different models.! This def- 
inition of long-run equilibrium rate does not, 
however, take into account capital accumu- 
lation and intertemporal saving behavior or 
current account imbalances. In this sense, it 
is simply one version of PPP by which we 
can measure the relative importance of mon- 
etary and real factors in determining the 
long-run movement of the exchange rate. 

Given this equilibrium rate, we observe 
the following relationship between the do- 


14 classical model that produces the equilibrium rate ` 
defined by (7) is in Yoshikawa (1987). 


578 THE AMERICAN ECONOMIC REVIEW 


mestic and foreign prices of the tradables: 


(8) E(e) p*/p=1. 


This means that in order to hold constant 
the competitiveness of both foreign and do- 
mestic countries with regards to the trad- 
ables sector, the p adjusted real exchange 
rate must be one. 

However, the real exchange rate calcula- 
tion is often based on more broadly defined 
price deflators such as the GNP deflator or 
the CPI. In such instances, if we let p, p* 
represent the domestic and foreign deflators, 
respectively, (8) is violated for é, even though 
(9) holds: ; 


(9) ép* /p —1. 
To be more specific, the GNP deflators of 


home (Japan) and foreign (the U.S.) coun- 
tries are respectively defined as 


(10) p=8p+(1-8) py 
and 


(1) — j*-8*p*-(1—8*) pg, 


where py and px are the domestic and 

foreign prices of nontradables, and 6 and 6* 

are the expenditure shares of tradables in 

Japan and the United States, respectively. 
Under the condition 


PN 
(122) ê*+(1-8*)| — 
. p 
PĒ\ 
<8+(1-6 (=), 

(1-8) I: 
if (9) holds, we have 
(13) p<ép*. 
If 8* = 6, (12) boils down to 


* 
(14) ee 
Py PN 


Therefore, if high productivity in Japan 
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makes tradables relatively cheaper in Japan 
than in the United States, Japan’s competi- 
tiveness is increased even though the real 
exchange rate in terms of the GNP deflator 
remains constant. This point is also empha- 
sized by Richard C. Marston (1986) and 
Ken-ichi Ohno (1986). 


Il. The Estimation of the Equilibrium Rate 


In this section, we estimate the equilib- 
rium yen-dollar rate on the basis of (7). We 
adopt the two-country framework, using 
Japan as the home country and the United 
States as the foreign country. Our analysis is 
limited in the sense that we take into ac- 
count only one trading partner. Based on 
this assumption, we calculate (7). 

First, a and b are calculated from the 
input coefficients (annual data) of thirteen 
manufacturing industries as categorized by 
SNA, which are used in the Economic Plan- 
ning’ Agencys medium-term multisector 
model. We identify Japan’s export industries 


_ as fibers, chemicals, primary metals, general 


machinery, electrical equipment, transporta- 
tion equipment, and precision equipment - 
(i=1,2,...,7). Using the export volume of 
each industry, X, (i=1,2,...,7) the weights 
are calculated: : 


(5) 0,=X,/E2X, (i71,2,...,7). 


Here X; refers to exports (in dollar terms) 
after passing customs. Data are taken from 
the Ministry of Finance's Foreign Trade Out- 
look. Using these weights, 


(16) a= Y]o,a!, 


where a' is each industry's labor input co- ` 
efficient. 
Similarly b is calculated as follows: 


(17) b= Yio;[ bio + b; perro bperro,ol» 


where b, is the ith industry's input coeffi- 
cient for oil and natural gas. Here b, perro 
and bprrro,o are respectively the ith indus- 
try's input coefficient for petroleum products 
and the petroleum products sector's input 
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TABLE 1— LABOR AND ENERGY COEFFICIENTS IN 


JAPAN’S Export INDUSTRIES 
Labor Energy 
Coefficient Coefficient 
1973 0.089 0.026 
1974 0.081 0.030 
1975 0.087 0.024 
1976 0,079 0.020 
1977 0.074 0.018 
1978 0.067 0.016 
1979 0.061 0.017 
1980 0.058 0.017 
1981 0.056 0.012 
1982 0.054 0.010 
1983 0.053 0.009 
1984 . 0.049 0.008 
1985 0.049 0.008 
1986 0.044 0.006 
1987 0.041 0.004 


coefficient for oil and natural gas. Using this 
method, a and b are prepared annually. The 
annual series for 1973-87 are shown in Table 
1. We observe that in the 10 years-plus pe- 
riod after the first oil shock, the labor coef- 
ficient and the energy-natural resources co- 
efficient of Japan's exporting industry were 
down to about one-half and one-sixth, re- 
spectively. 

Next, 1* was calculated from the contract- 
based (mostly dollar-based) export price of 
manufactured goods and the import prices 
of oil, gas, and coal. The data source for this 
calculation is the Bank of Japan's Annual 
Statistics of Prices. Here w is the nominal 
wage in Japan's manufacturing industry, and 
is taken from the Bank of Japan's Monthly 
Economic Statistics. 

Finally, to obtain the equilibrium rate de- 
fined by (7), we must measure the U.S. unit 
costs of the products that correspond to 
Japan's exportables. To calculate these unit 
costs, we start with the following definition: 


w*L* + py M* 


(18) Unit Cost = o j 
where w* and př, are the nominal wage and 
price of materials and energy. L*, M*, and 
Q* are the U.S. labor, material inputs and 
output, respectively. The Annual Survey of 
Manufacturers, Statistics for Industry Groups 
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and Industries compiled by the U.S. Depart- 
ment of Commerce provides the necessary 
data. Using these data, we calculate the unit 
cost as follows: 


j w*L* 
(19) Unit Cost = p* x | —— 
P*Q* 








In this equation, w*L*, pf M*, and P*Q* 
are, respectively, the payroll of all employ- 
ees, the cost of materials, and the sum of the 
cost of materials and value added by manu- 
factures; p* in (19) is WPI. We prepared 
this unit cost on an annual basis for the 
above-mentioned seven industries, which 
correspond to Japan's export industries, and 
then with weights (15) calculated the econo- 
mywide unit cost. This is the unit cost w*a* 
+ PgE* in (7). 

To calculate (7), we need a base year. 
Here we consider two cases: The first case 
presumes that the actual exchange rate was 
close to (7) prior to the first oil shock (the 
first quarter of 1973), and the second case 
takes 1975 when Japan's current account 
was nearly in balance. 

The estimated long-run equilibrium rates 
are shown in Table 2. They correspond to 
two different base years. For the sake of 
comparison, we also show simple PPP rates 
based on economywide wages that are sup- 
posed to reflect general inflation differential 
but do not take into account any productiv- 
ity differential. Finally, we show the actual 
yen-dollar exchange rate. They are all shown 
in Figures 1 and 2. 

Our equilibrium rate differs substantially 
from simple PPP rate based on only econo- 
mywide wages. Since our equilibrium rate 
broadly tracks the trend of the actual ex- 
change rate, we argue that the supply-side 
factors such as labor and energy productivity 
are very important determinants of the ex- 
change rate in the long run. The results 
obtained are reasonably robust with respect 
to changes in the base year. 

We can measure the relative impact of 
each factor on the equilibrium rate by calcu- 
lating the hypothetical rate. In order to do 
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TABLE 2— WAGE-ADJUSTED RATES AND EQUILIBRIUM EXCHANGE RATES 


Base Year 1973 Base Year 1975 
Actual Equilibrium — Wage-Adjusted Equilibrium Wage-Adjusted 

Rate Rate Rate Rate ^ Rate 
1973 271.70 271.70 271.70 284.37 228.30 
1974 292.08 271.79 324.83. 284.46 272.95 
1975 296.79. 283.56 353.21 296.79 296.79 
1976 269.55 278.10 370.25 291.07 311.11 
1977 ^' 26851 269.02 373.39 281.57 313.75 
1978 210.44 240.43 367.82 251.64 309.08 
1979 219.14 217.15 361.96 227.28 304.15 
1980 226.74 199.70 359.84 209.01 302.37 
1981 220.54 182.93 348.47 191.46 292.81 
1982 249.08 176.63 347.12 184.87 291.68 
1983 237.51 175.68 ~ 339.78 183.87 285.51 
1984 237.52 166.04 337.53 173.79 283.62 
1985 . 238.54 156.58 33927 163.89 .285.08 
1986 168.52 148.13 342.18 155.04 287.52 
1987 144.64 140.53 . 339.95 147.08 285.65 
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FIGURE 1. EQUILIBRIUM AND ACTUAL EXCHANGE RATES (BASE YEAR, 1973) 


so, we assume that one factor remains con- 
stant at the base-year level. These hypotheti- 
cal rates for the constant a, b, the U.S. unit 
labor cost and energy cost, are shown to- 
gether with the equilibrium rate for the 1975 
base-year case in Table 3. We can see, for 
example, that the equilibrium ‘rate would 


have been 303.77 yen per dollar in 1987. if. 


the labor coefficient in Japan had remained 


unchanged at the 1975 level. As we can see 


in Table 1, in reality, the labor coefficient in 


Japan declined sharply between 1975 ‘and - 


1987, and therefore the equilibrium rate was 
147.08 yen per dollar. The divergence be- 
tween the equilibrium rate and the hypothet- 
ical rate for the constant b, the Japanese 
energy coefficient, is rather small, which 
means that the effect of changes in b on the 
equilibrium rate is not substantial Thus, 
overall, an increase in labor productivity in 


VOL. 80 NO. 3 





Actual Exchange Rites 
170}  ——----—— Epiliria idog bte 
160 ; 
naa ge Hj hu 
450 Vae juste Exchimge 
140 4 — 





YOSHIKAWA: YEN-DOLLAR RATE 581 








1973 1974 1975 1976 1977 1978 1979 1980 1981. 1982 1983 1984 1985 1986 1987 


FIGURE 2. EQUILIBRIUM AND ACTUAL EXCHANGE RATES (BASE YEAR, 1975) 


TABLE 3— THE HYPOTHETICAL EQUILIBRIUM RATE 


Constant 

Japanese 
Equilibrium Labor 

Rate Coefficient 

1973 284.37 27120 
1974 284.46 305.88 
1975 296.79 296.79 
1976 291.07 “321.82 
1977 281.57 331.04 
1978 251.64 324.64 
1979 227.28 323.48 
1980 209.01 311.55 
1981 191.46 297.84 
1982 184.87 297.64 
1983 183.87 300.94 
1984 173.79 308.33 
1985 163.89- 308.53 
1986 155.04 306.21 
1987 147.08 303.77 


Japan is by far the most important factor 
behind the long-run appreciation of the yen.” 


As John R. Norsworthy and David H. Malmquist 
(1983) show, the substantial increase in labor productiv- 
ity in Japan can be attributed to the concentration of 
capital investment in Japan’s manufacturing sector, par- 
ticularly in the machinery industries. This high concen- 
tration of investment since the first oil shock of 1973 
has led to the creation of what can be called the 


Constant 

Japanese Constant - Constant 
Energy U.S. Unit . U.S. Unit 

Coefficient Labor Cost Material Cost 
283.97 266.30 239.77 
283.45, 216.36 260.91 
296.79 296.79 296.79 
29172 29121 302.67 
282.56 283.68 305.75 
252.66 257.35 285.27 
228.37 235.92 271.47 
210.76 223.26 265.90 
194.34 208.09 | 257.47 
188.02 204.73 250.41 
187.09 202.75 253.93" 
176.73 189.43 246.44 
166.89 180.23 234.65 
156.87 171.04 222.42 
148.88 163.14 213.19 


Conversely, on the U.S. side, an increase in 
unit energy and material costs seems to be a 
more important determinant of the dollar's 
depreciation than is an increase in the unit 
labor cost. | 


* machinery block." See Hiroshi Yoshikawa and Fumio 
Ohtake (1987). 


) 
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III. Conclusion 


In this paper, we have proposed a defini- 
tion of the long-run equilibrium yen-dollar 
rate and attempted to estimate it by using 
Japanese and U.S. data. The long-run equi- 
librium rate defined in Section I is based on 
three factors: (1) the (wage) inflation differ- 
ential between Japan and the United States; 
(2) the real world price of natural resources, 
such as oil, in terms of manufactured goods 
(Japan's terms of trade); and (3) the labor 
and energy coefficients in both Japan and 
the United States for Japan's export sector. 
As one variation of PPP it allows us to 
explore the relative importance of monetary 
and real factors in determining the long-run 
trend of the exchange rate. 

From 1973 to 1987, the trend in the actual 
exchange rate broadly coincides with the 
movement of our equilibrium rate. On occa- 
sion, however, the actual exchange rate di- 
verges from its long-run equilibrium value 
for fairly long periods. For example, the yen 
was greatly overvalued in 1978 and the dol- 
lar was overvalued for more than four years, 
from 1981 through 1985. During most of the 
period of the dollar's overvaluation, from 
1981 through 1984, the equilibrium yen rate 
continued to appreciate but the actual yen 
rate depreciated. This divergence can only be 
explained by fiscal and monetary policies. 
The depreciation of the dollar from 1985, on 
the other hand, was a return to the equilib- 
rium value. 

Our analysis demonstrates that real fac- 
tors, most notably, changes in labor produc- 
tivity, have been much more important than 
the nominal wage differential in determining 
the long-run trend of the yen-dollar rate 
from 1973 through 1987. As mentioned 
above, most empirical works concerning the 
long-run exchange rate tend to focus on price 
differential and have largely ignored produc- 
tivity. Meanwhile, statistical evidence indi- 
cates that changes in both nominal and real 
exchange rates can be approximated by the 
random walk, and therefore that they tend 
to be nearly permanent. Because changes in 
real and nominal exchange rates are very 
highly correlated and have similar variances, 
it is consistent with the view that most 


changes in nominal exchange rates are due 
to real shocks with a large permanent com- 
ponent. Purely statistical analyses leading to 
this conclusion, however, cannot explain 
what kind of real shocks are in fact impor- 
tant. This paper has identified increases in 
labor productivity in Japan's export indus- 
tries as the most important factor to explain 
the long-run irend of the yen-dollar rate 
frcm 1973 thrcugh 1987. Given our findings, 
we hope that productivity will be given 
greater weight in future analysis of long-term 
trends of exchange rates. 
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On the Basing-Point System 


By Bruce L. BENSON, MELVIN L. GREENHUT, AND GEORGE NORMAN* 


Jacques F. Thisse and Xavier Vives ob- 
served in this journal (1988) that analysis of 
the base-point price (BPP) system "should 
consider its.role as a coordinating and collu- 
sive device" (p. 12) This conclusion has 
much in common with that of Fritz Machlup 
(1952) and Clair Wilcox (1955) and provides 
a sharply contrasting thesis from that of 
David D. Haddock, who had contended in 
(1982) that BPP systems are competitive. 
However, Thisse and Vives (henceforth TV) 
were not specifically, or even primarily con- 
cerned with base-point pricing. Rather, they 
focused attention on noncooperative strate- 
gies in spatial markets, deriving several con- 
clusions by approaching the choice of pric- 
ing policy from an explicitly strategic, 
game-theoretic viewpoint. While their results 
do imply that noncooperative base-point 


pricing is unlikely, they did not center atten- - 


tion on the conditions that must exist in 
order for a competitive base-point pricing 
system to arise. 

The present paper extends TV's theoreti- 
cal analysis by focusing exclusively on base- 
point pricing and by ascertaining the condi- 
tions that would be required for Haddock's 
noncooperative BPP system to arise. In par- 
ticular, the initial part of this paper utilizes 
the TV framework to identify the necessary 
conditions for a noncooperative BPP system. 
The TV framework leads us to a different 
interpretation than Haddock proposed. The 
second part of this paper adds, however, to 
the contention that the system could indeed 
have competitive origins; but we shall ob- 
serve that it limits such claim to certain 
highly restrictive conditions. This part of the 


*Department of Economics, Florida State Univer- 
sity, Tallahassee, FL 32306-2045; Department of Eco- 
nomics, Texas A&M University, College Station, TX 
71843-4228 and adjunct, University of Oklahoma, Nor- 
man, OK 73019; and Department of Economics, The 
University of Leicester, Leicester, England LE1 7 RH. 


paper also refers to the reality of BPP. A 
third and concluding section accounts for 
the “cooperative” use of the system when, in 
fact, other “collusive” delivered pricing sys- 
tems that would be more profitable than 
BPP can be conceived of. 


I. Theory 


For simplicity, assume that all consumers 
are identical, and are distributed over a line 
market in which there are two production 
sites J and JZ, with site I the lower-cost site 
(perhaps because of better access to raw 
materials). Assume that there are two firms, 
i= I, II selling a homogeneous product, with 
firm i located at site i. Transport costs per 
unit of output from site i to consumer loca- 
tion x are given by a nonnegative, increasing 
function 7,([; — x|), where |-| is the distance 
norm, and 7,(0) = 0. Figure 1 (see also Had- 
dock, Figure 1) illustrates such a case, with 
mc, denoting marginal production cost at 
site i, and m,(x) denoting marginal cost 
of production and transportation from site 
i to consumer location x: mj(x) — mc; * 
ti — xD = I, IT). 

Define the monopoly price pM(x) that 
firm i would charge to consumers at x as the 
price that would maximize profit to firm i 


‘from consumers at x in the absence of any 


competition (this price is derived from the 
standard MR — MC condition). TV show 
that if the duopolists simultaneously choose 
their pricing policy and price, the noncoop- 
erative equilibrium pricing strategy p*(x) 
for firm i is 


pix) if p(x) & m,(x) 
(1) pf(x) =m,(x) if p(x) > mj(x), 
and m,(x) <m,(x) 


m,( x) otherwise, 
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Distance 


FIGURE 1 


a form of price discrimination first discussed 
by Edgar M. Hoover (1937).! Assume with- 
out loss of generality that pM(x) everywhere 
exceed marginal cost of production and 
transportation for firm j (ji). Then the 
price equilibrium is illustrated in Figure 1 by 
the lines AB, CD for firm J and BC for firm 
II. Yf the pricing game is a two-stage game, 
in which firms first choose a pricing policy 
and then compete in prices contingent on 
the chosen pricing policies, TV show again 
that the discriminatory policy of equation 
(1) is the unique (Nash) equilibrium out- 
come. 

Is pricing policy (1) consistent with BPP? 
It is quite clear that firm IJ is following a 
BPP with site J as base, but firm I is also 
following BPP with site II as base. In other 
words, the only equilibrium is a -peculiar 
type of multiple BPP system in which con- 
sumers are charged a price less by the amount 
e than the marginal costs plus freight of the 
second-lowest cost supplier. 

Now consider what will arise if a second 
firm is introduced at site 7: if, for example, 
the producer at site JJ has a second plant at 
site 7. Using precisely the same argument, 
the resulting Nash equilibrium prices are 


‘This equilibrium assumes that individual demand is 
* well behaved," in that profit from consumers at x is 
quasi-concave in price. i 


now given by 


(2) př (x) =m,(x) 
* p m,(x) if m,(x) > my(x) 
Pir(x)= fae if m,(x) « my(x) } 


These are illustrated in Figure 1 by the lines 
EFB and CF for the firms at site J and BC 
for the firm at site JI. Pricing policy (2) is 
the policy derived by Haddock (although 
Haddock does so by requiring that produc- 
tion at site J be “competitive”): it is BPP 
with site I as base.” 


II. Conditions for Competitive BPP 
We can now identify the somewhat restric- 


tive condition under which this policy 
emerges. For a single-base basing-point pric- 


ing system to be the outcome of a competi- 


tive process, it is necessary that production 
at the basing point be in some sense compe- 


Recall that the pricing equilibria assume p(x) 
> m,(x)(j #i) for all x. Note that Haddock ignores 
the possibility that firm JI (the local monopolist) could 
have a sufficiently strong monopoly advantage near site 
II as to be able, and so willing, to charge the monopoly 
price. 
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titive? and production at distant sites be 
monopolized. Yt is further necessary that firms 
at the basing point always be either the lowest 
or the second-lowest cost suppliers to all con- 
sumers. Extending the analysis, a single-base 
BPP system will also arise from the configu- 
ration of suppliers in Figure .2a,. but not 
. necessarily from the configuration in Figure 
2b. Far from being competitive, the deliv- 
ered price system in Figure 2a derives from 
the exploitation of locational rents! . : 

A further peculiar result is readily evident 
with the BPP of equation (2). The low-cost 
producers just break even, but the high-cost 
nonbase-point firm that has a local monopoly 
position can earn supernormal profit or loca- 


3In the TV analysis, two competing producers will 
suffice. 


tional rents. Haddock's argument (p. 304) in 
this respect is in error. The locational rents 
arise because multiple-firm entry at the dis- 
tant site is barred by a combination of de- 
mand and cost conditions. In particular, 
sufficient demand and revenues exist to gen- 
erate returns for one firm that exceed oppor- 
tunity costs, but entry of another firm would 


mean that both sellers incur economic losses. 


These conditions are exogeneous to the pro- 
ducer at.the distant site. Such indivisibility 
has been discussed by Nicholas Kaldor 
(1935), Harold Demsetz (1964), and, more 
recently, B. Curtis Eaton and Richard G. 
Lipsey (1978). N 

One further concern of Haddock was to 
use BPP to present an efficiency argument 
for cross-hauling. But it is clear that cross- 
hauling does not require basing-point pric- 
ing. There is exactly the same form of cross- 
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‘hauling under the pricing equilibrium in 
equation (1), with firm. II shipping to con- 
sumers to its-left, and firm J shipping to 
consumers to the right of firm IJ. Simply 
put, differences in production costs and 
economies of long-haul transportation are 
sufficient to generate this kind .of cross- 
hauling.* 

In addition to the aforenoted ‘theoretical 
criticisms, the idea of a competitive BPP 
system is further subject to the ‘most basic 
criticism that it misinterprets the system. In_ 
particular, the BPP. equilibrium under com- ` 
petitive conditions is justified on the grounds 
that it will allow the firm at site IJ to survive 
when f.o.b. pricing would not.>. This result 
requires that firm JJ supply all demand in 
its natural market area: defined in Figure 1 


as the, area in which mg(x) <.m,(x). 


—region ee’. With such an assumption, BPP 
always generates greater total revenue than 
will f.o.b. pricing. There will, therefore, ex- 
ist some..range of cost and demand condi- 
tions such that BPP will allow firm II to'at 
least break even, while f.o.b. pricing, will not. 
But this is not.in the spirit of the BPP 
system. As Machlup (1949) states: “[the] 
basing-point, technique of pricing. makes it 


possible for any number of sellers, no matter ` 


where they åre located and without any com- 
munication with each other, to quote identi- 
- cal delivered prices for any quantity -of the 
product in standardized ‘qualities and speci- 
fications” (p. 7). 


Assume then that BPP leads to firm JI 


sharing its market with the firms at site J. 
We know there must be at least two produc- 
ers at site J for the subject BPP system to 


“Actually, even more extensive cross-hauling involv- 
ing simultaneous supply of an area by two or more 
firms is easily explained with spatial competition mod- 
els involving price discrimination (for example, John 
Greenhut and Melvin L. Greenhut, 1975). Base-point 
pricing is not a necessary condition for cross-hauling, or 
for freight absorption, so Haddock is clearly correct in 
emphasizing that these phenomena are not evidence of 


collusion. But significantly, they do not oe im- ` 


ply base-point pricing either. 
Actually, of course, the choice is more. likely to 
involve discriminatory pricing than f.o.b. pricing. — . 
With trivial exception of some peculiar demand 
functions. 
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‘emerge. It follows that firm II can at the ` 


most expect only one-third of the total sales 


.in the region ee’ under the BPP system. By 


contrast, f.o.b. pricing will secure all demand 
for firm JJ in a region somewhat smaller 
than ee’, As Melvin L. Greenhut (1956, pp. 

313, 314; 1970, Chapter. 7) points out, once 
such market sharing is considered, f.o.b. 

pricing -can dominate basing-point pricing 
and would if costs corresponded to those of 
Figure 1. Indeed, the classical form of spa- 
tial price discrimination by the distant firm, 
where its delivered price increases in each 
direction from its site, can dominate the f. o.b. 

pricing alternative; moreover, it would not 
appear.to be predatory, as is the case for the 
systematic lowering of delivered prices in 
region ee’ at some market points located at 
greater distances from the-seller than others. 


. MI. Conclusions 


In order for a competitive simple basing- 
point price system to hold, it is necessary 
that a single firm located at a distance from 
a competitive production center exploit its 
local market power, collect locational rents, 
and aloné seli over its. dominated market 


Space. This composite is a necessary condi- 


tion; without its fulfillment, the noncoopera- 
tive base-point price system would not exist. 
It follows in light of the requisite local mar- 
ket power at a distance that BPP is a non- 


. competitive system: Moreover, when such 


power exists, other delivered pricing forms 
will be used in the absence of collusion.. 
This paper thus contends that traditional 


' views on the subject are more appropriate. 


(See Greenhut, 1956, Appendix V.) When 
base-point pricing is followed, it is.at- 
tributable to (i) conscious parallelism of ac- 
tion, where some firms feel obliged (com- 
pelled) to adhere to the price schedule of 
others or (i) stems from an outright collu- 
sive conspiracy whose design is to maximize 
profits subject to cartel policing and enforce- 
ment costs. Such costs are an intrinsic part 
of the system since a cartel must be. orga- 
nized and its agreement enforced in the pres- 
ence of incentives to cheat. As many have: 
pointed out (for example, Hoover, 1937, 
p. 190; Machlup, 1949, pp. 131-136; George 


1 
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J. Stigler, 1979, pp. 1147-1148; Frederic M. 
Scherer, 1980, pp. 329-330), a base-point 


system is much less costly to police and . 
enforce than are other spatial pricing ar- 


rangements. This helps explain its use in- 
stead of other organized delivered price sys- 
tems that would be more profitable in the 
absence of policing and enforcement costs. 
The fact that base-point pricing generates 
locational rents for distant firms also indi- 


cates why this system might characterize a . 


cartel notwithstanding the existence of more 
profitable arrangements, such as f.o.b. pric- 
ing or spatial discriminatory pricing. These 
other more profitable systems may well in- 
duce competitive entry. It follows that if 
base-point pricing deters entry and in the 
process protects existing rents, it may actu- 
ally be the most desirable arrangement that 
the cartel can adopt. 
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Government Debt; Government Soetiding: and Private. 
iD . Sector Behavior Revisited: Comment 


"Perhaps: no issue has generated as much a 
‘controversy among economists in the:past - 


decade as the proposition that an increase in 
' of this ‘so-called: Ricardian equivalence 
‘ ‘proposition is central" to. whether budget 


` feasibility of. expansionary tax reductions, 
‘and to the effects of Social Security on pri- 


- vate saving and aggregate capital accumula- 


_ tion. 
" Although the basic idea that the Tatars tax 


. liabilities associated with government deficits . 
z and debt induce individuals to increase their = - 


saving’ has been around since the time of 
‘David Ricardo and was treated explicitly by 
‘Don Patinkin (1965), Martin Bailey (1971) 


~ and Levis Kochin (1974), the current debate `’ 

' was launched.by Robert Barro (1974). The - 
‘voluminous theoretical literature of recent ` 
years has shown that complete Ricardian ` 

'^ equivalence would be expected to prevail... 
only under very ‘ special - conditions; ` see - 
Douglas Bernheim (1987) for an. especially D 
useful survey and analysis. But the theoreti- - 


cal restrictiveness of the assumptions re- 
 quired for complete Ricardian: equivalence 
does not constitute a practical refutation: 


~ ‘Defenders of Ricardian equivalence ‘can ar- ^ 
` gue that the theory is. only an approximation . 


‘and can claim that, although the stringent 


- conditions required for complete Ricardian ' 


equivalence do not hold, the economy's be- 


-havior in practice is close to the predictions : 


of Ricardian equivalence. 


"Martin Feldstein is Professor of Economics at Har- 


vard University and President of the National Bureau 


" By MARTIN PoUux AND DoUGLAs y. A € 


There are two key empirical questions. 
The first is whether a higher level of taxes 


^O (with: governinent spending constant) in- 
` the government deficit induces an equal off- . ' 
` setting increase in private saving. The truth . - 


duces individuals to reduce their spending 
on consumption as ‘traditional theory holds 


< or has no effect i on consumer Spending as the 
li t : Ricardian equivalence proposition predicts. 
deficits reduce: capital accumulation, to the: ' ^ 


The second deals with the effect of govern- 


"ment outlays on goods and services. Al- 
though the absence of a negative effect on 


consumer spending of such government out- 


' lays is clearly contrary to the Ricardian 


equivalence propositión, the existence of a 
moderate negative , éffect of government out- 
lays ón: consumer spending is not in itself 


‘evidence in favor of the Ricardian equiva- 


lence proposition that individuals increase 
their saving to finance anticipated debt ser- 
vice. As Feldstein (1982) explained, con- 
sumers may correctly believe that a rise in 
current government spending is a good indi- 
cator of a higher level of future government 
spending. Once a program is launched or 


', budgets increased, the process is unlikely to 
-be reversed. An increase in current govern- 


ment spending is therefore a good indication 


- that future taxes will ‘have to be higher to 


finance a: higher level of future government 
spending. Individuals may rationally. reduce 
their own spending when government out- 


A lays increase without a. concurrent increase 
"| in taxes because they anticipate higher fu- 
. ture taxes to finance higher future govern- 
..ment spending even if they give little or no 
‘weight to the debt service implications of the 

d current deficit. 


"The strongest direct eviderice i in favor of 
Ricardian equivalence i is Roger Kormendi's 


. 1983 article in the American Economic Re- 
-. view. He „presents consumption regression 


. of Economic Research. Douglas Elmendorf is Assistant >` equations _ that relate an estimate of con- 


Professor of Economics at Harvard. University. We are * 
grateful to "Greg Mankiw and Lawrence Summers for 


sumption’ . to net. national Laie wealth, 


‘comments on an “earlier draft. The' research reported . dd 


here:is part of the NBER ‘study of the Government 


Budget and the Privaie Economy. 


! Kormendi defines consumption a as the sum ; of cur- 


rent expenditures on services and nondurables plus 10 


590 - THE AMERICAN ECONOMIC REVIEW 


government debt, government spending on 
goods and services, taxes, transfers, corpo- 
rate retained earnings, and government in- 
terest payments. His parameter estimates ap- 
pear to show that an increase in taxes does 
not affect consumption at all while an in- 
crease in government spending on goods and 
services does reduce consumption. Thus 
whatever the theoretical shortcomings of the 
Ricardian equivalence theory, it would ap- 
pear from Kormendi's results that in prac- 
tice consumers behave as the Ricardian 
equivalence theory predicts.? 

Kormendi's analysis has been criticized by 
James Barth et al. (1986) and by Franco 
Modigliani and Arlie Sterling (1986). Al- 
though we are unconvinced by Kormendi's 
analysis, we do not find that either of those 
comments is a persuasive refutation of the 
Kormendi study. While Barth et al. conclude 
that their results “raise. sufficient questions 
about the robustness and interpretation of 
Kormendi's original findings that more em- 
pirical work in this important research area 
is clearly needed" (p. 1165), their estimates 
generally support Kormendi's principal 
finding that consumer spending is sensitive 
to government outlays on goods and services 
but not to taxes. Their.analysis extends the 
Kormendi sample through 1983, separates 
federal government debt from state and local 
debt, estimates for alternative subperiods, 
and tries substituting the, par value of gov- 
ernment debt for the market value of the 





percent of current expenditures on consumer durables 
and 30 percent of the stock of consumer durables. 

. “There have of course been other tests of Ricardian 
equivalence. John Seater and Roberto Mariano (1985) 
interpret their evidence as supporting Ricardian equiva- 
lence while the evidence of Michael Darby et al. (1987) 
and of Feldstein (1982) is inconsistent with Ricardian 
equivalence. There have also been indirect tests of Ri- 
cardian equivalence based on exarmining the effects of 
government deficits on real interest rates and on the 
exchange rate. The results of these tests have been 
mixed. Some researchers, including Charles Plosser 
(1982) and Paul Evans (1985), found that budget deficits 
do not change interest rates or the value of the dollar 
while others, including Feldstein (1986) and Michael 
Hutchison and Adrian Throop (1985), have found the 
opposite. A discussion of these articles lies beyond the 
scope of this paper. 
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debt even though they recognize that the 
latter is the conceptually appropriate mea- 
sure. The only estimates in which the effect 
of government spending is not at least 
marginally significantly negative are in equa- 
tions estimated for the postwar period that : 
contain the theoretically inappropriate par 
value of the government debt. In no equa- 


. tion do taxes have a significant negative 


effect. 

The essential feature of the Modigliani 
and Sterling analysis is to replace the sepa- 
rate tax, transfer, and government interest 
variables with a combined “net tax” variable 
that is equal to government taxes net of 
transfers including government net domes- 
tic interest payments.. The sum of the 
distributed lag coefficients of this net tax 
variable is significantly negative in the 
Modigliani-Sterling consumer expenditure 
equations and is not significantly different 
from the sum of the lag coefficients of net 
national product. In this specification the 
coefficient of the government spending vari- 
able is small and not significantly different 
from zero. Modigliani and Sterling interpret 
their estimates as “strikingly and unmistak- 
enly consistent with a “Life Horizon"-Life 
Cycle approach to consumption behavior and 
equally inconsistent with the infinite horizon 
Ricardian Equivalence Proposition formula- 
tion" (p. 1178). 

Unfortunately, however, Modigliani and 
Sterling do not provide an explicit test of the 
effect of taxes per se on consumption but 
only of the combined “net tax” variable. 
Since the coefficient of the transfers variable 
in the original Kormendi analysis was posi- 
tive, large, and statistically significant, it is 
not surprising that the variable created by 


subtracting transfers from taxes has a coef- 


ficient that is negative, large and statistically 
significant. This should be expected regard- 
less of any additional changes in variable 
definitions or estimation procedures. Al- 
though the variable is correctly labeled as 
“net taxes,” its. coefficient is essentially an 
indication of the effect of transfers. Simi- 
larly, although Modigliani and Sterling do 
present two specifications that include trans- 
fers as a separate variable, in both of those 
equations they constrain the coefficient of 
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the net tax variable to equal the coefficient 
of net national product, so no separate esti- 
mates of the effects of taxes and transfers 
can be inferred. 

These objections do not detract from the 
force of the Modigliani-Sterling argument 
that transfers are negative taxes and that the 
two should therefore be treated symmetri- 


cally in any analysis of Ricardian equi-. 


valence. If this is accepted, Kormendi's 
own estimates of the effect of transfers on 
consumption provide a strong refutation of 
the Ricardian equivalence proposition. 
Kormendi's reply that transfers are received 
by a subgroup of the population that is 
liquidity constrained suggests at a minimum 
that not all taxes and taxpayers should sat- 
isfy the Ricardian equivalence proposition. 


We are nevertheless left without any direct : 


test of Kormendi's conclusion that govern- 
ment spending depresses consumer spending 
while taxes do not. There are two ways in 
which Kormendi's research could be subject 
to further analysis. The first would be to 
develop a more general model of which 
Kormendi’s is a special case and to evaluate 
the relative importance of government 
spending and taxes in that more general 
model. Such a model might include the real 
net interest rate, alternative measures of 
household wealth, the age distribution of the 
population, the income distribution, and 
other variables that could influence con- 
sumer spending but that are not part of the 
Kormendi analysis. The alternative and more 
modest but direct approach followed in the 
present paper is to see whether Kormendi's 
results remain when his equation is reesti- 
mated by different statistical techniques, us- 
ing different functional forms, and for peri- 
ods that exclude the World War II years, 
when consumer spending was constrained by 
Shortages and rationing. When this is done, 
the results are quite contrary to Kormendi's 
and reject the Ricardian equivalence propo- 
sition. : 
More specifically, the present paper shows 
that Kormendi's results are misleading and 
cannot be sustained when the war years 1941 
through 1946 are excluded from the sample. 
Omitting this period of wartime shortages 
and rationing and estimating for the period 


through 1985 reverses Kormendi's principal 
finding and shows that higher taxes depress 
consumption while an increase in govern- 
ment spending has no significant effect on 
consumption. This conclusion is confirmed 
when Kormendi’s procedure of estimating in 
first-difference form is replaced by a more 
appropriate estimation using a first-order au- 
toregressive transformation of an equation 
specified in levels.? 

The paper begins (Section I) by replicating 
Kormendi's estimates and extending the end 
of the sample period from 1976 to 1985. 
Section II then shows the critical importance 
of Kormendi's practice of including the war 
years. Section III examines the effect of ex- 
cluding state and local governments and fo- 
cusing on federal government spending, 
taxes, transfers, and debt. The fourth section 
respecifies the equation in ratio form; this 
reduces the problem of collinearity and pro- 
duces coefficients that imply stronger tax 
effects (with smaller standard errors) and no 
effect of government spending. The fifth sec- 
tion looks at estimates for the postwar 
period only, while the sixth section presents 
instrumental variable estimates that treat the 
current values of taxes, transfers, govern- 
ment spending, and NNP as endogenous. 
There is a final concluding section. 


I. Replication and Basic Variations 


Table 1 presents alternative estimates of 
Kormendi's basic specification relating con- 
sumer spending to net national product (Y,), 
lagged net national product (Y, ,), govern- 
ment spending on goods and services (GS,), 


3A proponent of Ricardian equivalence might of 
course say that this reversal of Kormendi’s empirical 
findings does not contradict Ricardian equivalence since 
current taxes may be a proxy for future government 
spending. This line of argument, carried to the extreme, 
would make it impossible to refute Ricardian equiva- 
lence with estimates of consumer behavior. There is also 
a conceptual problem since taxes finance not only gov- 
ernment spending on goods and services but also the 
empirically more important transfer payments and debt 
service. Moreover, regression equations estimated with 
our data (and similar equations estimated by Modigliani 
and Sterling) fail to find any predictive effect of current 
taxes on future spending. 


592 


THE AMERICAN ECONOMIC REVIEW 


JUNE 1990 


TABLE 1—REPLICATION AND BASIC VARIATIONS OF THE KORMENDI SPECIFICATION 


Equation (1.1) (12) (1.3) 
Sample 1931-76 1931-76 1931-76 
Data* > 1 2 ata 
Estimation FD FD FD 
Y, 0.29 0.30 0.33 
(0.04) (0.05) (0.06) 
Ya 0.07 0.11 0.08 
(0.02) (0.03) (0.03) 
GS, —0.23 —0.15 —042 
(0.02) (0.03) (0.03) 
TX, 0.07 — 0.08 —047 
(0.08) (0.10) (0.11) 
GB, —0.055 — 0.039 — 0.014 
(0.018) (0.029) (0.030) 
TR, 0.83 0.70 0.45 
(0.15) (0.21) (0.23) 
W, 0.025 0.021 0.033 
(0.008) (0.011) (0.013) 
RE, 0.10 —0.04 —0.13 
(0.11) (0.15) (0.18) 
GINT, 1.15 2415 1.71 
(0.91) (1.15) (1.16) 
R? 0.91 0.83 0.79 
SER 17.5 22.3 56.4 
p 2 id e 
DW Stat. n.a. 1.5 1.6 





(1.45 (1.5) (1.6) (1.7) (1.8) 
1931-85 1931-40 1931-40 1931-40 1931-85 
1947-85 1947-84 1947-85 
3 3 2 3 3 
FE FD FD ARI ARI 
0.30 0.31 0.30 0.31 0.30 
(0.05) (0.07) (0.06) (0.06) (0.07) 
0.02 0.08 0.09 0.09 0.11 
(0.03) (0.03) (0.03) (0.03) (0.04) 

—010 0.05 0.03 0.13 —0.11 
(0.03) (0.09) (0.09) (0.09) (0.03) 
-015 —0.17 ~0.13 —0.16 —0.07 
(0.11) (0.13 (0.13) (0.12) (0.12) 
—0.008  -0.0004 0.014 0.081 —0.010 
(0.026) — (0.050) — (0.029 (0.032) (0.030) 
0.65 0.66 0.76 0.84 0.83 
(0.21) (0.20) (0.20) (0.14) (0.19) 
0.015 0.014 0.016 0.020 0.024 
(0.009) — (0.00 (0.007 (0.006) (0.008) 
—0.03 0.02 0.05 —0.04 0.16 
(0.17) (0.18) (0.17) (0.18) (0.20) 
113 0.88 0.66 0.89 1.31 
(0.68) (0.63) (0.57) (0.50) (0.66) 
0.75 0.79 0.83 0.999 0.999 
61.4 57.4 24.7 48.0 62.2 
z > = 0.76 0.86 
17 17 1.6 1.8 1.7 


Note: 'The dependent variable is consumption. A constant term was included in the estimation but is not reported. 
Standard errors are in parentheses. See text for definition of vaziables. 


*Data: ‘Kormendi ?Pre-Benchmark ?Post-Benchmark 


total tax revenue (TX,), government debt at 
market value (GB,), transfer payments (TR,), 
private wealth excluding government debt 
(W,), corporate retained earnings adjusted 
for capital consumption allowances and in- 
ventory valuation (RE,), government inter- 
est payments to persons and businesses 
(GINT,), and a constant term. Standard er- 
rors of the parameter estimates are shown in 
parentheses beneath each coefficient. 
Kormendi's original article presents de- 
tailed definitions and sources for these sta- 
tistics. Since our purpose is to assess the 
sensitivity of Kormendi's conclusion to the 
inclusion of the World War II years, we have 
tried to stay as close as possible to his speci- 
fication and his definitions in constructing 
the regression variables for our analysis. As 
far as we can tell, there are only two small 
differences between our definitions and 
Kormendi’s. First, we use the personal con- 
sumption price deflator for all variables while 


Kormendi uses different price deflators for 
different variables. Second, we use the Fed- 
eral Reserve Flow of Funds Balance Sheets 
for the U.S. Economy, 1946-85 as the source 
of data on "domestic net assets" to measure 
private nonhuman wealth while Kormendi 
rescales estimates of several types of wealth 
presented in the Survey of Current Business; 
the Federal Reserve data have the virtue of 
being directly comparable to the estimates 
by Raymond Goldsmith used by Kormendi 
and by. us for the earlier years of the sample. 
Kormendi's series on the market value of the 
government debt is extended using the 
method developed in James Butkiewicz 
(1983). A complete listing of the data is 
available on request. 

Equation (1.1) of Table 1 reproduces the 
parameter estimates reported in Table 5 of 
Kormendi. This is the only equation re- 
ported by Kormendi that includes both the 
tax variable and the government debt vari- 
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able. Note that it is estimated in first-dif- 
ference form for the entire sample from 1931 
through 1976, including the years of World 
War II. In this specification an extra dollar 
of government spending on goods and ser- 
vices reduces consumer outlays by 23 cents. 
By contrast, an extra dollar of tax revenue 
has no statistically significant effect; its co- 
efficient is actually positive but less than its 
standard error. The coefficient of the govern- 
ment debt variable also has an implausible 
negative sign but is actually statistically sig- 
nificant. Although this might have been in- 
terpreted as evidence that the equation is 
misspecified or misestimated, Kormendi just 
notes that the negative coefficient is clearly 
contrary to the traditional theory that im- 
plies that government debt is a form of 
wealth that should have a positive effect on 
consumption. 

Equation (1.2) represents our attempt to 
replicate the original Kormendi estimation 
for his sample period of 1931-1976. Since 
Kormendi’s equation was estimated with the 
data available before the major 1985 bench- 
mark revision of the national income ac- 
counts, we used pre-benchmark revision data 
to estimate equation (1.2). Although the re- 
sults are not identical to equation (1.1) (pre- 
sumably because of earlier revisions in the 
national income and wealth statistics and 
perhaps because of inadvertent differences in 
the way that the data are constructed), the 
coefficients are quite similar to Kormendi’s 
results in equation (1.1). In particular, the 
government spending coefficient is —0.15 
with a standard error of only 0.03, implying 
a significant effect of government spending 
on consumption, while the tax coefficient is 
only —0.08 with a standard error of 0.10. 
The coefficient of the debt variable remains 
negative and larger than its standard error. 


Equation (1.3) repeats this estimation for 
the same sample period but with the most 
recent available data (as of July 1987). The 
primary effect of this data revision is to 
increase the coefficient of the tax variable to 
— 0.17 with a standard error of 0.11, imply- 
ing that the hypothesis that the true tax 
coefficient is zero or positive can be rejected 
at a 7 percent significance level. The coeffi- 


cient of the government debt variable is still 
negative but drops to less than half its stan- 
dard error. The government spending vari- 
able is reduced in size but is still substan- 
tially greater than its standard error. 

Extending the sample period through 1985 
(equation (1.4)) leaves these key parameter 
estimates essentially unchanged. The coef- 
ficient of the government-spending variable 
is significantly negative while the coefficient 
of the tax variable, although absolutely larger 
than the government-spending coefficient, is 
only significant at the 10 percent level. 


H. Excluding the War Years 


A crucial feature of the Kormendi esti- 
mates is that his sample includes the war 
years, when consumption was reduced by 
widespread rationing, by shortages of con- 
sumer durables and other consumer goods, 
and by patriotic appeals to purchase saving 
bonds. The personal saving rate jumped from 
4.0 percent in 1940 to 10.9 percent in 1941 
and to more than 20 percent in each of the 
next three years before subsiding to 19.2 
percent in 1945 and 8.6 percent in 1946. The 
war years were also a time in which govern- 
ment spending rose much more than tax 
revenue. Real government spending on goods 
and services in 1982 dollars jumped from 
$150 billion in 1940 to $484 billion in 1942 
and $791 billion in 1944, a level that has 
never been reached again. By contrast, real 
tax revenue only rose from $126 billion to 
$264 billion. The evidence presented below 
shows that the strong correlation between 
wartime government spending and the high 
saving rate caused by shortages and ra- 
tioning causes a spurious negative relation 
between government spending and personal 
consumption in the sample as a whole. 

Because of the very unusual nature of the 
consumer goods markets during the World 
War II period and the intense patriotic ap- 
peals for increased saving, the World War II 
years should be excluded in any regression 
analysis of saving behavior. Although Kor- 
mendi does present some estimates that ex- 
clude the war years, those equations never 
contain both the tax and government debt 
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variables and therefore do not provide an 
explicit test of the Ricardian equivalence 
proposition.* 

Equation (1.5) shows that when the. six 
war years 1941 through 1946 are omitted, 
the remaining 49. observations tell a very 
different story. In particular, the coefficient 
of the government spending variable be- 
comes very small and only about half as 
large as its standard error. In contrast, the 
coefficient of the tax variable is — 0.17 with a 
standard error of 0.13 that implies that the 
null hypothesis of a zero or positive effect 
can be rejected at the 10 percent level. It is 
clear that including the war years produces 
very misleading results. 

As a check that the reason for the very 
different conclusions implied by equations 
(1.1) and (1.5) is due to omitting the war 
years and not to the recent benchmark data 
revision, we have reestimated equation (1.5) 
using the data available before the 1985 
benchmark revision. The results, presented 
in equation (1.6), are quite similar to those 
of (1.5) and indicate that it is the exclusion 
of the war years rather than the data revision 
that is critical. 

Kormendi explains that he estimates the 
equations in first-difference form to reduce 
the risk of the spurious results that Clive 
Granger and Paul Newbold (1974) have 
shown can occur when the equations are 
estimated in level form and there is substan- 
tial serial correlation of the residuals. If the 
variables in levels are nonstationary, the hy- 
pothesis tests will be at a level different from 
that intended. However, using first-dif- 
ference estimation is less efficient than esti- 
mation in level form with an autoregressive 
transformation. Moreover, estimation by 
first-differencing has some further disadvan- 


*Kormendi does present one equation in Table 4 
without the war years and with both government spend- 
ing and taxes but without government debt. We have 
followed Kormendi and reestimated this equation in 
first-difference form but, unlike Kormendi, we did not 
obtain a significant effect of government spending. Our 
estimated coefficient of government spending was 0.02 
with a standard error of 0.08; by comparison, the tax 
variable had a coefficient of —0.18 with a standard 
error of 0.12. The difference in our finding may well 
reflect revisions in the national income statistics. 
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tages. If variables are measured with error 
the use of first difference estimation in- 
creases the errors in variables’ bias. If the 
response of consumers to an explanatory 
variable is not immediate, the use of first- 
difference estimation can cause a substantial 
underestimation of its true effect. The re- 
maining equations of Table 1 therefore pre- 
sent estimates in level form after an ARI 
transformation based on the estimated auto- 
correlation coefficient. 

Equation (1.7) is estimated after an ARI 
transformation with an autocorrelation co- 
efficient of 0.76. The results again indicate 
that taxes have a negative effect while gov- 
ernment spending has a positive coefficient. 
The coefficient of the government debt vari- 
able has the correct positive sign and is more 
than twice its standard error; if anything, the 
coefficient is implausibly large. Equation 
(1.8) shows the effect of including the war 
years with the ARI estimation; once again 
government spending becomes significant 
while taxes and government debt are in- 
significant. It is clear that the choice between 
first-difference estimation and an autoregres- 
sive transformation does not affect the con- 
clusion that the evidence in favor of Ricar- 
dian equivalence rests on including the six 
years of World War II and that when these 
years are excluded -Ricardian equivalence is 
clearly rejected. 


III. Total Government or Federal Government 


Federal taxes and spending are very dif- 
ferent from the taxes and spending of state 
and local governments. Individuals can in 
principle avoid a very large part of the state 
and local taxes that they pay by moving to a 
different jurisdiction where they would also 
forego the benefits that higher tax dollars 
purchase. Moreover, while approximately 75 
percent of federal government spending on 
goods and services is for national defense, 
the goods and services spending of state and 
local governments is for education and other 
personal services of the local voters. State 
and local debt is also different in kind from 
federal debt since the value of such area- 
specific debt will tend to be reflected in local 
property values. 
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"TABLE 2—ToraL GOVERNMENT VERSUS FEDERAL GOVERNMENT ONLY 


SELECTED COEFFICIENTS" x 
Equation - Sample Estimation .' Govt GS, TX, GB. . R. SER p DWS 
21 1931-40 First All 005 © —017 —0.0004 0.79 -:..574 - 1.7 
; .1947-85 Difference ': . (009) (013) (0.030) » 
22 1931-40 First . © Fed >` 002  -022 0018 0.79 . 57.4 - 1.8 
1947-85 Difference. ^ — - (009). (014) (0.034) ^ c 
2.3 1931-40 | ARI All 013. ~-0.16 0.081 0.9995: . 622 0.76 1.8 
. 1947-85 . (0.09) -.-(012) (0.032) 
24 1931-40 ARI Fed 013 . -031 0.150 0.9995.. 48.33 0.84 2.0 
^ 1947-85 . <a, 7(009) — (013) ^ (0044) 
2.5 E 1931-85 . First ` All  -010 --015. -0008 .` 0.75 -61.4 - 1.7 
i Difference: (0.03) (0.11) (0.026) : 
2.6 1931-85 First Fed | —0.00 | —0.30 0015 0.75 60.4 - 18 
Difference (0.03) ' (012) (0028) - ; 


*The estimated equations include ali of the :variables presented in Table 1; standard errors are shown in 


parentheses. 


The equations presented in. Table 2 com- 

pare the key fiscal coefficients based on 
Suntan using the spending, taxes, and debt 
of all governments with the corresponding 
coefficients based on equations using federal 
government spending, taxes, and debt. Al- 
though only the coefficients of the three key 
fiscal variables are shown, they are obtained 
from estimates of the full equations of the 
form reported in. Table 1; the full set of 
coefficients is available on request. Separate 
estimates are reported for- the first difference 
and ARI estimates. 

Equations .(2.1) and (2. 2), estimated in 
first-difference form, indicate that federal 
taxes have a more powerful and statistically 
more significant effect on consumption than 
the taxes of state and local governments. The 
other coefficients. are similar with insignifi- 
cant effects of government spending and ` 


government, debt.. In the ART estimates — 


reported in equations .. (2. 3) and (2.4) the 
principal difference is again that the tax co- 
efficient is larger and statistically more sig- 
nificant in the federal: specification than in 
the equation. for all governments. This is 
even true when the war years are included 
(equations (2.5) and (2.6)); the coefficient ‘of 
taxes in the federal. equation is —0.30 (with 
a standard error of 0.12) and therefore about 
three times as large as the coefficient of the 
government-spending variable. Thus’ when 
attention is. restricted to the federal fiscal. 
variables, taxes ‘are important even when the 


~ war years are nid slinodth government 
. spending is important only when the war 
years are included. 


IV. A Ratio: Specification 


One of the problems in making precise 


. inferences about the coefficient values is the 
collinearity among net national product and 


the various fiscal variables. The equations in 
Table 3 present an alternative specification 


. that reduces the problem of collinearity by 


dividing each of the variables by the current 
value of net national product. These equa- 
tions are estimated only for, the Speritication 


. Without the war years. 


Comparing the. coefficients of Tables, 2 
and 3 shows that the coefficient of the tax 
variable is much larger both absolutely and 
relative to its standard error, in the ratio 
specification than in the linear specification. 


|. ‘The coefficients of the government spending 
: variable generally remain positive and: not : 


statistically.significant. The coefficient of the 
debt variable is positive, larger than its stan- 


. dard error, and generally of a plausible mag- 
nitude. - 


. The ratio specification thus provides even 
stronger evidence against the. Ricardian 
equivalence . proposition. than the - linear 
equations of Tables 1 and 2. The estimated 
autocorrelation: parameters, also show that 
there is less autocorrelation in the ratio form 


: than in the linear form. 
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TABLE 3— VARIABLES AS RATIOS TO NNP 
1931-40, 1947-85; SELECTED COEFFICIENTS* 
Equation Estimation Govt GS, TX, GB, R? SER p DWS 
31 First All 0.004 —0.51 0.045 0.91 0.010 - 2 
Difference (0.11) (0.19) (0.032) 
3.2 First — ' Fed —0.02 —0.77 0.058 0.93 0.009 - 2.5 
Difference (0.10) (0.16) — (0.032) l 
3.3 ARI All 0.13 —0.69 0.039 0.996 0.007 0.49 1.9 
(0.09) (0.08) (0.024) 
3.4 i ARI Fed 0.16 —0.69 0.079 0.997 0.007 0.37 1.9 
(0.08) (007) — (0.015) 
*The estimated equations include all of the variables presented in Table 1; standard errors are shown in 
parentheses. i 
TABLE 4— POSTWAR SAMPLE: 1951—85 
SELECTED COEFFICIENTS* 
Functional 
Equation Form Estimation Govt GS, TX, GB, R? SER p DWS 
4.1 Level ARI All 0.02 —049 0.078 ' 0.999 472 0.52 1.8 
. (0.13) (0.17) * (0.043) ` ' ' 
4.2 Level ARI Fed | 002 —0.36 0113 0.999 464 0.53 2.0 
(0.14) (017) (0.051) 
43 Ratio ARI All 0.06 —0.62 0.136 0.95 0.006 0.51 2.0 
(0.14) (0.15) (0.042): 
44 Ratio AR1 Fed 0.08 — 0.60 0.190 0.96 . 0.005 0.53 2.0 
= (0.12) (0.16) | (0.034) . i i 


*The estimated equations include all of the variables presented in Table 1; standard errors care shown in’ 


parentheses. 


V. The Postwar Period 


Combining the prewar and postwar years 
provides a sample of 49 usable observations 
and substantial variation in government 
spending, taxes, and national debt. It is nev- 
ertheless interesting to look at a more recent 
period that avoids the special conditions as- 
sociated with the depression, the war, and 
the immediate postwar years. Table 4 pre- 
sents estimates based on the 35 years from 
1951 through 1985. The four equations in- 
clude the level and NNP-ratio specifications 
and are estimated for the federal government 
only as well as for all governments com- 
bined. The coefficients are estimated with an 
ARI transformation; the estimated autocor- 
relation coefficients are all approximately 
0.50. 

In all of the estimates, the coefficient of 
the government spending variable is small, 
positive, and much less than its standard 


error. In contrast, the coefficient of the tax 
variable is negative and larger than its stan- 
dard error. With the federal fiscal variables 
the tax coefficient is quite large and more 
than twice its standard error. The coeffi- 
cients of the government debt variable are 
always positive and generaly more than 
double their standard errors but also typi- 
cally larger than theory would suggest. 

The estimates based on -postwar data are 
therefore strongly contrary to the predictions 
of the Ricardian equivalence hypothesis. 


VI. Instrumental Variable Estimation 


The parameter values reported by Kor- 
mendi were all estimated without any at- 
tempt to deal with the problem of the endo- 
geneity of net national product and of the 
fiscal variables. That is also the approach 
that has been followed until this point in the 
present paper. It is easy to believe, however, 
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TABLE 5—-INSTRUMENTAL VARIABLE ESTIMATES FOR FULL SAMPLE 
1934—40, 1947-85 FOR FAIR’S METHOD; 1935—40, 1947-85 FoR First DIFFERENCE 
Selected Coefficients* 
Functional - : ] 

Equation Form Estimation Govt GS, TX, GB, R? SER p DWS 

51 Level First All 0.22 0.05 0.059 — 0.56 66.0 - 24 
Difference (0.21) (0.25) — (0.039) : 

5.2 Level Fair's All 0.26 —0.13 0.039 - 0.999 562 0.84 1.6 
Method (0.10) (0.18) — (0.028) 

5.3 Ratio First All —0.11 —0.39 0.053. - 0.80 0.012 - 17 
Difference (0.22) (0.35) (0.043) 

54 Ratic Fair's All 0.15 —0.36 0.080 | 0.99 .0.007 — 0.98 21 
Method (0.09) (0.15) (0.023) 

5.5 Level First Fed 0.17 0.10 0.053 0.34 80.9 - 2. 
Difference (0.25) (0.31) (0.056) 

5.6 Level Fair's Fed 0.19 —0.46 0.052 0.999 59.0 0.88 1.9 
Method (0.10) (0.16) — (0.033) 

5.7 Ratio First Fed | —029 —0.51 0.065. 0.78 0.012 - 1.7 
Difference (0.27) (0.42) (0.047) 

5.8 Ratio Fair's Fed . 0.07 —0.70 0.084 0.99 0.007 0.52 22 

' Method (0.07) (0.08) (0.015) 





*The estimated equations include all of the variables presented in Table 1; standard errors are shown in 


parentheses. 


that the current values of NNP and the fiscal 
variables will be correlated with the error of 
the consumption equation. A surprisingly 
large level of consumer spending would 
probably raise NNP and taxes and might 
reduce transfers and countercyclical govern- 
ment spending. To the extent that this is 
true, the ordinary least squares estimates 
would be biased: and inconsistent. 

An instrumental variable estimation pro- 
cedure can provide consistent and asymptot- 
ically unbiased estimates in this context. The 
practical problem is to find satisfactory in- 
strumental variables that are uncorrelated 
with the current disturbance to consumer 
spending but highly correlated with the en- 
dogenous explanatory variables. Variables 
like population that satisfy the first criterion 
completely generally do. poorly by the sec- 
ond criterion. In the present study we have 


used as instruments the past values of the. 


endogenous variables lagged two, three, and 
four years; that is NNP- NNP_;, 
NNP -4 GS, .,, GS, 4, GS, ,, TX, ,. etc. 
These variables are clearly correlated with 
the fundamental movements and short-term 
trends in the corresponding variables but 
will only be. correlated with the disturbance 
in the consumption equation to the extent 


that those disturbances have a high degree of 
serial correlation. While the instruments are 
not perfect, the use of instrumental variable 
estimation provides a check on the general 
qualitative properties of the ordinary least 
squares estimates. 

Table 5 presents instrumental variable es- 
timation for the entire sample with the war 
years omitted. In order to obtain the lagged 
values needed as instrumental variables it 
was necessary to drop the first four years 
from the sample; the sample therefore be- 
gins with 1935. In addition to instrumental 
variable estimates in first-difference form, we 
have also used Fair’s method to combine 
instrumental variable estimation and a con- 
sistently estimated first-order autoregressive 
correction.* Although the instrumental vari- 
able estimates inevitably appear less precise 
than ordinary least squares estimates, the 
implications of Table 5 are very similar to 
those of the previous ordinary least squares 


5Since our computer program could not apply Fair's 
method to a sample with a gap in tlie data, we have 
applied Fair's method to estimate our equation for the 
entire period from 1935 through 1985 but with individ- 
ual dummy variables for each of the six war years. 
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TABLE 6— INSTRUMENTAL VARIABLE ESTIMATES FOR POSTWAR SAMPLE: 1951-85 


Functional : 
Equation Form Estimation Govt GS, 
6.1 Level First ` All 
Difference (0.21) 
62 Level Fair's All . 0.07 
Method (0.10) 
6.3 , Ratio First All 0.21 
Difference (0.18) 
6.4 Ratio Fair's All. 0.11 
- Method (0.09) 
6.5 Level First ` Fed 
Difference (0.23) 
6.6 Level Fairs Fed 0.14 
Method (0.09) 
6.7 Ratio First Fed 0.22 
-- Difference (0.16) 
6.8 Ratio Fair's Fed 0.15 
Method (0.08) 


Selected Coefficients* 
TX, GE, R? SER » DWS 
0.24  —0.008 0.53 67.6 - 13 


(0.30) — (0.081) 
-011 0.000 0.999 577 0.996 16 


-076 0422 082 0008 - 22 
-066 0113 0.95 0.006 082 23 

0.20 -OD17 048 708 - 2 
—0.33 OD 0999 561 0997 19 
-0.81 0.137 084 0.008 - 24 


— 0.60 0.164 0.95 0.006 0.86 2.4 


*The estimated equations include all of the variables presented in Table 1; standard errors are shown in 


parentheses. 


estimates. The coefficient of government 

. Spending is generally positive and insignifi- 
cant while the coefficient of the tax variable 
is generally negative and larger than its stan- 
dard error. The coefficient of the government 
debt variable is always positive, generally 
greater than its standard error and of a 
roughly appropriate size. The use of Fair's 
method to correct for autocorrelation is gen- 
erally helpful in obtaining more precise and 
more stable coefficients. As we noted with 
the OLS estimates of Tables 2 and 3, the 
results are generally stronger for the federal 
government fiscal variables and for the ratio 
specification. . 

Table 6 presents instrumental variable es- 
timates for the postwar sample. The pattern 
of coefficients is again incompatible with the 
Ricardian equivalence proposition: generally 
positive and insignificant coefficients on gov- 
ernment spending, negative and generally 
Significant coefficients on the tax variable 
(with particularly strong effects in the ratio 
specification), and positive effects of the gov- 
ernment debt. 


VII. Concluding Comment 
Because of the restrictive assumptions re- 


quired to establish the theory of Ricardian 
equivalence, its relevance in practice is es- 


sentially an empirical question. Roger 
Kormendi’s paper appeared to provide 
strong empirical support for Ricardian 
equivalence by showing that increases in 
government spending on goods and services 
depress consumer spending while changes in 
tax receipts have no effect on consumer 
spending. 

The present study shows that Kormendi's 
results are a misleading implication of the 
experience during World War IL, when 
shortages, rationing, and patriotic appeals to 
self-restraint caused an abnormally high rate 
of saving at the same time that the govern- 
ment deficit-financed a uniquely massive in- 
crease in defense spending. When those years 
are excluded from the sample, Kormendi’s 
results are reversed. 

The'estimates presented here show that in 
the equation specified by Kormendi, but with 
the years 1941 through 1946 excluded, in- 
creases in tax receipts have had.a substantial 
negative effect on consumption while in- 
creases in government spending on goods 
and services have had essentially no effect on 
consumption. This evidence is exactly the 
opposite of the implications of Ricardian 
equivalence. This conclusion is robust with 
respect to a variety of modifications in the 
way that the basic equation is estimated: 
using an AR1 correction to deal with serial 
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correlation; limiting the analysis to the fed- 
eral government's fiscal variables; respecify- 
ing the variables as ratios to net national 
product to reduce collinearity; estimating for 
the most recent 35 years instead of for the 
period since 1931; and using an instrumental 
variable procedure to reduce the problem of 
endogeneity. In each of these specifications, 
the results indicate that taxes depress con- 
sumer spending while government outlays on 
goods and services have either a smaller or a 
totally insignificant effect. 

The present study has been limited to an 
analysis within the specification used by 
Kormendi. A different or more general spec- 
ification might lead to different conclusions. 
But the present study has purposely been 
restricted to the Kormendi formulation be- 
cause of the importance that has been at- 
tributed to Kormendi's evidence. The only 
proper inference that can be drawn from the 
present study is that Kormendi's own con- 
clusion is wrong and that, within his own 
specification, the evidence decisively contra- 
dicts the Ricardian equivalence proposition 
and supports the conventional view that 
higher taxes reduce consumption and that 
budget deficits caused by tax reductions 
therefore depress national saving. 
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Government Debt, Government Spending, 
and Private Sector Behavior: A Further Comment 


By FRANCO MODIGLIANI AND ARLIE G. STERLING* 


The response of Kormendi and Meguire 
(1986) to our earlier comment on Kormendi 
(1983) raises several issues which we feel 
need further clarification. They make three 
points: Ak i 


i) we impose restrictions on the way trans- 
fers and interest enter the consumption 
function and, when those restrictions are 
released, our results regarding the Ricar- 
dian Equivalence Proposition (REP) are 
reversed; 

i) differencing is to be theoretically and 
empirically preferred in estimating the 
consumption function and 

iii) that pre-War data add further support to 
their conclusions. l 


A. The Role of Temporary Taxes 


KM base their conclusions on results us- 
ing regression specifications similar to ours 
in all but one crucial respect, namely failure 
to take into account the role of temporary 
taxes. This specification can in no way be 
justified, either on theoretical or empirical 
grounds. Failure to take temporary taxes 
into account biases the results against ‘the 
LCH, by lumping together taxes that should 
have a large effect on consumption with oth- 
ers that should have a small effect, even 
under the LCH.! 


*Sloan School of Management, Massachusetts Insti- 
tute of Technology, Cambridge, MA 02139, and Mar- 
soft Incorporated, Cambridge, MA 02139. 

lOne may ask why we attribute such an important 
role to transitory taxes and not to the transitory compo- 
nent that undoubtedly exists in other variables such as 
income or government expenditure. The answer is that 
the temporary nature of certain tax measures—espe- 
cially those of 1969— was explicitly stated by the gov- 
ernment, was widely understood, and the amounts in- 
volved were very large. We have endeavored to allow 
for transitory components in other variables through 
the device of distributed lags. Equivalently we assume 


We test for the effect of temporary taxes 
by adding a variable * TEMPTAX" which is 
equal to the temporary taxes in every quarter 
in which such taxes were levied, and zero 
elsewhere. Since a temporary tax should re- 
duce consumption by less than a permanent 
one, we should expect the coefficient of 
temptax to be significantly positive, though 
less than the absolute value of the coefficient 
of T. (Note that the variable T includes the 
transitory component.) 

Once temporary taxes are taken properly 
into account, their role is found to be highly 
significant and the value of their coefficient 
is consistent with the hypothesis outlined 
above. Furthermore, the REP is decisively 
rejected. The results shown in the table illus- 
trate our points. We begin in columns 1 and 
2 by duplicating, for reference, the specifica- 
tion used in our earlier comment. We in- 
clude for generality a correction for auto- 
correlation, though the coefficient (rho) is 


` insignificant.? As in our earlier comment all 


«nn 


the flow variables take the form of an Almon 
polynomial distributed lags of first degree, 
including the current year and the four pre- 
ceding ones (hence each variable uses two 
degrees of freedom). Definitions of the vari- 
ables are given in the notes to the table. 

The results regarding the REP are essen- 
tially identical to those obtained before: 
consumers ignore government spending in 
evaluating their disposable income and 
wealth. This conclusion holds whether or not 
one imposes the “consistency condition" — 





that the (expected) permanent values of the variables 
can be approximated by a weighted average of current 
and previous values, with weights determined by the 
regression equation. 

2We follow Kormendi and Meguire and ignore si- 
multaneity issues. We note, however, that Sterling (1985) 
and Feldstein and Elmendorf (1990) treat the issue 
explicitly and find that an instrumental variable ap- 
proach does not alter the conclusions. 
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TABLE 1 — THE CONSUMPTION FUNCTION IN LEVEL FORM, 1952-84 


(1) (2)* (3) (4)* (5) (6)* 














Const. —043 -044  -004  -000  -—049  -039 
(005 ^ (006 (010 (012 (0.09) (0.09) 
A 0.022 0.021 0.018 0.019 0.007 0.013 
(0.004) — (0.005) (0.004) (0.004) (0.006) (0.006) 
G 0.096 0.093 0.050 0.065 0115  '0147 
(0.021) — (0.081) (0.022) (0.030) (0.030) (0.027) 
Y 0.93 0.94 0.79 0.74 0.97 0.77 
(0.02) (0.07) (0.04) (0.06) (0.03) (0.07) 
TEMPTAX 0.63 0.69 1.26 1.33 
(019 (035 (0.22) (0.35) 
DEF 0.19 0.09 —0.62 
(0.07) (0.17) (0.19) 
T -110 —0.78 —0.14 
(0.15) (0.22) (0.15) 
GS 0.13 012 —0.28 
(0.14) (0.18) (0.19) 
TR 0.54 0.63 0.29 0.49 
(008) — (011) (0.11) (0.11) 
RGINT 0.32 0.38 1.70 1.65 
(087 (043) (0.46) (0.37) 
rho 0.04 007 ~052  -054  -010 -030 
(020) (021) (0.18) (0.19) (0.21) (0.21) 
Se 0.00123 0.0127 0.0078 0.0078 0.0114 0.0098 


Like 103.46 103.93 121.58 123.01 107.49 114.13 


*In the columns marked with an asterisk Y is defined as net national product. In the 
remaining columns Y is defined as net national product less net taxes, as defined in the 
notes to the table. * 

The consumption function is in level form and all flow variables are estimated using 
a first degree four quarter (plus contemporaneous term) Almon lag. Rho. is the 
autocorrelation correction factor, calculated using TSP's maximum likelihood method. 
Se is the standard error of the equation. Like is the value of the log-likelihood function. 
In significance tests twice the difference between the values in the constrained and 
unconstrained specifications is distributed chi-squared with degrees of freedom equal to 
the number of restrictions. 

Variable Definitions: 


Y represents income (see the asterisk note to the table). 
T represents net taxes—total taxes less transfers and real net interest pay- 
ments. 


TEMPTAX represents temporary taxes, from the Economic Report of the President of 
1969 and 1975, as defined in our earlier comment. 

GS is government spending on goods and services. 

TR is gross transfer payments by the government sector to the private sector 
(excluding interest). 

RGINT is real net interest payments on general government debt, calculated by 
deducting the product of actual inflation and outstanding debt from 
nominal interest payments. 


DEF is government spending on goods and services minus net taxes. 

A is net worth of households (including government debt). 

G represents the sum of federal, state, and local net financial liabilities at 

book value. 

that the sum of the coefficients on govern- B. Interest and Transfers 
ment spending and taxes be equal and oppo- 
site to the propensity to consume from in- In addition to dropping temporary taxes, 
come before taxes. (That constraint is im- the KM specification adds transfers and in- 


posed in column 1 and released'in column 2, terest to equations (1) and (2). These vari- 
and the consistency condition is not rejected.) ables are already included in net taxes, T, 
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but we can see no objection to testing 
whether they might have an effect larger or 
smaller than that of other components of net 
taxes. Note in particular that under the REP 
hypothesis all transfers should have the same 
effect on consumption as all taxes, namely 
no effect at all. 

Thus our analysis proceeds by keeping 
temporary taxes in the specification but 
adding real net interest and transfers, as 
shown in columns (3) and (4). 

Comparing column (1) with (3) and (2) 
with (4), it is seen that, with or without the 
consistency condition (which is again not 
rejected by the data at the 5 percent level), 
there is a very large increase in the likelihood 
ratio. This evidence is consistent with the 
KM hypothesis that these variables may have 
effects on consumption different from that of 
taxes (though the coefficient of interest is not 
significant). But the results in columns (3) 
and (4) in no way support the REP. In (3) 
the deficit is positive—though insignificant 
and in (4) government spending receives a 
positive coefficient (but again completely in- 
significant). Thus KM’s claim, that our con- 
clusions regarding REP are reversed once 
the constraints we imposed on interest and 
transfers are released, is contradicted by 
these results. 

As a comparison of columns (3) and (4) 
with columns (5) and (6) makes clear, KM 
came to their incorrect conclusion because 
they overlooked the role of temporary taxes. 
If temporary taxes are not included, the 
deficit and tax coefficients estimates could 
appear to support REP. But that omission is 
clearly rejected by the data—temporary taxes 
have a t-ratio of 5.7 in (3) and 3.8 in (4). 

One final comment is in order with regard 
to the role of transfers. According to (3) and 
(4), the effect of transfer payments would be 
extraordinary indeed, amounting, according 
to equation (3), to 0.79+0.09+ 0.54, or 1.42 
dollars of consumption per dollar of trans- 
fer, with a hefty level of significance. If 
taken at face value, this estimate constitutes 
one more impressive piece of evidence reject- 
ing REP, according to which the effect of 
transfers should be zero. 

But we must point out that this estimate is 
absurdly high, both in level and relative to 
the estimated nronensitv ta consume. In ad- 
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dition, these variables also result in an 
equally improbably high estimate cf the co- 
efficient of temporary taxes. We are there- 
fore inclined to regard equations (3) and (4) 
as a Statistical freak and to put credence in 
our original specifications (1) and (2). But, 
whether one prefers one specification or the 
other, REP stands soundly rejected. 


C. Differencing 


Kormendi and Meguire report that the 
residuals of their specifications—estimated 
in level form—show evidence of autocorrela- 
tion. As shown by the row labeled “rho” we 
also find some evidence of autocorrelation 
with the Kormendi and Meguire specifica- 
tion of columns (3) and (4). Strikingly, how- 
ever, the residuals are negatively serially cor- 
related and the value of rho is —0.52 and 
—0.54, respectively. In all cases the data 
reject the hypothesis that rho could be con- 
strainted to equal 1.0, the value implicit in 
the differenced specification. In columns (5) 
and (6) the point estimate of rho is more 
than eight standard deviations from unity. 

Engle and Granger (1987) show that, for 
the level form maximum likelihood methods 
we have used to be appropriate, it is suffi- 
cient to show that consumption and the ex- 
planatory variables used in our regressions 
are cointegraied. All the equations we report 
pass their first test for cointegration: the 
Durbin-Watson statistic of the untrans- 
formed equation is positive and so large (for 
example, in the untransformed version of 
equation (2) it is 1.90) that cointegration 
cannot be rejected. These results are consis- 
tent with those of Engle and Granger in 
their analysis of per capita consumption of 
nondurables and disposable income (in the 
work cited in Section 6). 

Given the strong rejection of the differ- 
enced specification it is no surprise that the 
coefficients reported by Kormendi and 
Meguire are different from those obtained 
using the level specification shown here. 


D. The Irrelevance of Tests Including the 
Prewar Data 


In the tests reported above we have fo- 
cused exclusivelv on the postwar span from 
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1952 to 1984, on the grounds that this sam- 
ple is quite large and that no useful informa- 
tion can be obtained by reaching back to the 
years 1931 to 1951. The reason is that the 
bulk of that period— well over half—con- 
sists of (i) the chaotic years of the great 
depression, and (ii) the war years and the 
immediately surrounding ones. The depres- 
sion period is clearly most unsuited to learn 
something about normal consumer behavior. 
And, as for the war years, it is well known 
that saving then was exceptionally high. This 
is likely to reflect patriotic motivations as 
well as the unavailability of many commodi- 
ties. At the same time, that period is also 
characterized by a huge deficit, the largest in 
American history. 

This combination clearly has the effect of 
biasing the coefficient estimates in favor of 
REP. Therefore one might be justified in 
including at most a handful of years, say 
1940—41. and 1948-51. However the data 
that are available for the postwar period do 
not exist for the earlier period. Some infor- 
mation can be pieced together for these ear- 
lier years but it is clearly less reliable and 
not directly comparable with the data for the 
postwar period. Thus, in order to add the 
prewar years, KM ended up by not using the 
more reliable data available for the postwar 
period. We conclude therefore that estimates 
based on the combined period 1931-1984 
should be disregarded. 


E. Conclusions 


We conclude that there are no grounds for 


changing our original proposition that the 
postwar data provide no support for the. 


hypothesis that consumers consider govern- 
ment spending or the deficit when making 
their consumption decisions. Kormendi and 
Meguire arrive at a contrary conclusion by 
overlooking the theoretically and empirically 
important effects of temporary taxes, using 
an inefficient differenced specification, an 
unrepresentative sample, and less reliable 
data. 
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Government Debt, Government Spencing, and Private 
Sector Behavior: Reply and Update 


By RoGER C. KORMENDI AND PHILIP MEGUIRE* 


The preceding articles by Martin Feldstein 

and Douglas Elmendorf and by Franco 
Modigliani and Arlie Sterling give us a wel- 
come opportunity to return to the effects of 
fiscal policy on private consumption. At stake 
in this debate, we believe, is a potential 
paradigm change—from what Kormendi 
(1983) termed the Standard Approach, which 
bases private consumption on disposable 
. personal income, to what he termed ‘the 
Consolidated Approach, which bases con- 
sumption on aggregate income, government 
spending, and transfer payments, each with 
separate effects. The Standard Approach ex- 
‘cludes Ricardian equivalence a priori. The 
Consolidated Approach not only incorpo- 
rates Ricardian equivalence but, in its aug- 
mented form, allows one to nest the various 
hypotheses associated with the two .ap- 
' proaches. 


Feldstein and Elmendorf argue that Kor-. 


mendi’s (1983) results (and implicitly those 
of Kormendi and Meguire, 1986), which re- 
ject the Standard Approach in favor of .the 
Consolidated /Ricardian alternative, are not 
robust to the exclusion of data from World 
War II and other specification changes. 
Modigliani and Sterling argue that account- 
ing for “temporary taxes” reverses our re- 
buttal of their 1986 comment. We first take 
up the challenge of Feldstein and Elmendorf 
before turning to Modigliani and Sterling. 
We then assess the validity of our preference 
for estimating in differences, by testing 
whether consumption, income, and the fiscal 
variables are cointegrated. Finally, we sum- 
marize what can be learned from the debate. 


"The University of Michigan, School of Business 
Administration, Ann Arbor, MI 48109-1234, and Mid 
America Institute for Public Policy Research, Chicago, 
IL 60615. 
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I. Feldstein and Elmendorf 


Feldstein and Elmendorf focus on “two 
key empirical questions” in Kormendi (1983), 
namely, the relative effects of taxation and 
government spending on private consump- 
tion. Under Ricardian equivalence, (a) taxa- 
tion should have. no effect on private con- 
sumption, and (b) government spending, 
however, should have a. negative effect, since 
spending summarizes the true resource bur- 
den of the government sector on the private 
sector. Under what Feldstein and Elmendorf 
term “traditional theory,” taxation should 
have a negative effect and government 
spending should have no effect. These di- 
chotomous predictions’ allow one to distin- 
guish between Ricardian and traditional the- 
ories. Testing these predictions was central 
to Kormendi (1983) and Kormendi and 
Meguire (1986), whose results were strongly 
consistent with the Consolidated Approach 
in general, and with the above Ricardidn 
predictions in particular.! 

Feldstein and Elmendorf contend that 
Kormendi’s (1983) results are not robust 
when data for World War II are omitted 
from the sample, while their results—strong 
negative effects of taxation on private con- 
sumption but no effects on government 
spending—are robust to a number of alter- 


By allowing transfer payments and government 
spending to have consumption effects that depend upon 
their composition, tae Consolidated Approach of Kor- 
mendi (1983) is mere general than Ricardian equiva- 
lence. While a “strong” form of Ricardian equivalence 
implies that transfers should not affect the time path of 
aggregate consumption, Kormendi and Meguire (1986, 
p. 1186) show how the Consolidated Apprcach can 
reconcile the Ricardian and Life Cycle hypotheses via a 
suitable understanding of the role of transfers. See also 
Kormendi and Meguire (1989), who explore this recon- 
ciliaton using a functional decomposition of transfers. 
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_ native specifications. In this section, we show 
that the results of Kormendi (1983) and 


Kormendi and Meguire (1986) are in fact ~ 


fully robust to all of the alternative specifi- 
cations proposed by Feldstein and Elmen- 
dorf. We then show how to reconcile our 
results with their apparent challenge. 


A. Two Theoretical Issues 


Feldstein and Elmendorf make two theo- 
retical assertions that we wish to address 
before turning to their empirical work. First, 
they state that “...the absence of a negative 
effect of government outlays on consumer 


spending is clearly contrary to the Ricardian . 
equivalence proposition.” While this may be. 


true under a strong version of Ricardian 
theory, in which government spending is im- 
plicitly either dissipative or utility-separable, 
it would not necessarily be true under the 
more general Consolidated Approach of Kor- 
mendi (1983). In particular, if the bulk 
of government spending were devoted to 
nondissipative investment projects that sub- 
stituted for private investment but not for 
private consumption, then the Consolidated 
Approach would predict a zero effect of such 
spending on private consumption. Alterna- 
tively, if government spending were on goods 
and services that were at least partially com- 
plementary to (instead of substitutable for) 
private consumption, then currént period 
government spending could have zero or 
positive effects on private consumption. 
Second, Feldstein and Elmendorf assert 
that the existence of a “ 
effect of government outlays on consumer 
spending is not in itself evidence in favor of 
Ricardian equivalence," but is. possible un- 
der "traditional theory' as modified by 


Feldstein (1982). We ads Feldstein and | 


Elmendorf: 


..consumers may correctly believe that 
a rise in current government spending 
is a good indicator of a higher level of 
future government spending. Once a 
-program is launched or budgets in- 
creased, the process is unlikely to be 
reversed. An increase in, current govern- - 
ment spending is therefore a good indica- 
tion that future taxes will have to. be 


..moderate negative - 
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higher to finance a higher level of future 
government ‘spending. Individuals may 
rationally reduce their own spending 
when government outlays increase 
without a concurrent increase in taxes 
because they anticipate higher future 
taxes to finance higher future govern- 
ment spending. |emphasis added] 


Ironically, this quotation embodies a line of 
reasoning virtually identical to that underly- 
ing the Ricardian equivalence proposition, 
namely, that the private sector correctly ac- 
counts for the future taxes implied by cur- 
rent fiscal policy. 


B.A Brief Review of eee s 
Empirical Results 


. Feldstein and Elmendorf present in their 
Table 1 regressions that attempt first to 
replicate, and then.to rebut, Kormendi's 
(1983) results while staying “...as close as 
possible to his specifications and definitions 


in constructing the regression variables." 


While they claim that their equation (1.2) 
replicates reasonably well Kormendi's (1983) 
equation (5.1), we point out that Barth, Iden, 
and Russek (1986, Table 1) achieve a much 
closer replication. Feldstein and Elmendorf 
then update their data base, incorporating 


the 1985 benchmark revisions of the national 


income accounts. Their equations .(1.3) and 
(1.4) yield marginally significant negative co- 


-efficients for taxes while reducing the signif- 


icant negative coefficient for government 
spending. In (1.5) which omits. the years 
1941-46, the negative effect of government 


. spending vanishes, although the tax effect of 
: —0.17 has a t-statistic of only — 1.3. 


In their Tables 1 through 6, Feldstein and 
Elmendorf explore the consequences of vari- 
ous combinations of four main specification 
changes: (1): estimating in levels with an 
ARI correction, (2) using, fiscal variables 
measured only over the federal government, 
(3) specifying all variables as ratios to NNP, 
and (4) estimating via instrumental vari- 
ables. They. also explore the effects of re- 
stricting the sample to the period 1951-85. 
The apparent force of Feldstein and Elmen- 
dorf's argument emerges only when two or 
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TABLE 1— UPDATED KORMENDI SPECIFICATION, WITH ALTERNATIVES 
SUGGESTED By FELDSTEIN AND ELMENDORF 





Equation (1.1) (1.2) (1.3) 
Sample 1931-85 1931-40 1930—40 
Period 1947-85 1947-85 
Level Data 
Alternative Omit with ARI 
Specification War Correction? 
Y, 028 0.30 0.29 
(0.06) (0.06) (0.06) 
Yu 0.07 0.09 0.09 
(0.03) (0.03) (0.03) 
GS — 0.26 — 0.24 — 0.24 
(0.03) (0.07) | (0.08) 
TX 0.06 0.05 0.14 
(0.10) (0.12) (0.12) 
TR 0.73 0.85 1.10 
(0.21) (0.21) . (0.17) 
RGINT 0.03 0.31 : 037 
(0.07) (0.10) (0.09) 
GB — 0.035 0.034 0.058 
(0.024) (0.032) (0.033) 
Ww 0.012 0.013 0.018 
(0.008) (0.008) (0.007) 
RE 0.17 0.07 0.09 
. (0.17) (0.17) (0.17) 
R? ` : 0.766 0.815 0.976 
SER 0.0285 0.0264 0.0260 
DW 14: 1.5 1.6 
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(1.4) (1.5) (1.6) (1.7) 
1931-40 1931-40 1935-40 1951-85 
1947-85 1947-85 1947-85 
Federal 
Data Ratio Form Instrumental Postwar 
Only with 1/Y, Variables® Data 
0.32 zE 0.22 0.31 
(0.06) (0.10) (0.11) 
0.07 0.05 0.06 0.12 
(0.03) (0.02) (0.05) (0.05) 
— 0.24 — 0.25 — 0,25 —0.22 
(0.07) (0.06) (0.11) (0.11) 
—0.03 —0.12 0.17 0.00 
(0.13) (0.12) (0.22) (0.16) 
0.78 0.68 0.67 0.81 
(0.24) (0.14) (0.40) (0.29) 
0.39 0.09 0.29 0.44 
(0.12) (0.06) (0.19) - (0.30) 
0.074 —0.021 0.059 0.052 
(0.034) (0.025) (0.043) ` (0.045) 
0.016 0.017 0.017 0.006 
(0.007) (0.007) (0.010) (0.010) 
0.00 — 0.06 0.07 0.06 
(0.16) (0.11) (0.23): (0.24) 
0.822 0.967 0.680 0.706 
0.0259 0.0057 0.0278 0.0266 
1.4 1.8 1.5 1.7 


Note: All variables, including the dependent variable consumption, are in real per capita terms. We derive 
consumption, Y, and GS directly from real data; we obtain the other variables by dividing nominal data by the NNP 
deflator. RGINT is real government interest payments. All data are post the 1985 benchmark revision of the NIPA. 
For additional details on the data and variables, see the Data Appendix here and in Barth et al. (1986). - 

Standard errors of estimated coefficients are shown in parentheses. All regressions estimated with a constant (not 
reported). R? is not adjusted for degrees of freedom. SER = standard error of the regression; DW = Durbin-Watson 
statistic. Except for (1.3), all regressions were estimated over differenced data. In (1.4), we compute GS, TX, TR, 
RGINT, and GB from data for the federal government alone, ignoring grants-in-aid to state and local governments. 


In (1.5), we express all variables as ratios to NNP, with 1/Y, included among the regressors. 
* Estimated (Cochran-Orcutt) residual first-order serial correlation coefficient in 0.88. 
Coefficient of 1 / Y, is 0.77 (0.12). The constant is 0.0025 (0.0012). 

“Instrument list: constant, W, GB, RGINT, RE, and lags 2 to 4 of Y, GS, TX, and TR. 


more of these alternative specifications are 
combined. Whereas the tax coefficient is neg- 
ative but only marginally significant in their 
Table 1, that coefficient is often large and 
negative in Tables 2 through 6. 

Taken as a whole, Tables 1 through 6 in 
Feldstein and Elmendorf seem to contain 
strongly anti-Ricardian estimates. If these 
results were to stand up to detailed scrutiny, 
we would gladly yield to the weight of the 
evidence. ' 


C. The Robustness of the Consolidated 
Approach to the Feldstein-Elmendorf 
Specification Changes 


Table 1 presents the Feldstein and Elmen- 
dorf alternative specifications estimated us- 
ing our own data. In particular, we follow 
Barth, Iden, and Russek (1986) and Kor- 
mendi and Meguire (1986) in our variable 
definitions, and apply these to the revised 
national income accounts. Further details on 
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the data and the construction of the regres- 
sion variables are given in the. Data Ap- 
pendix to this paper. 


Equation (1.1) in Table 1 reestimates the - 


canonical Augmented Consolidated -specifi- 
cation of Table 4, column 2, in Kormendi 
and Meguire (1986), over the 1931—1985 pe- 
riod. The results are very close to those we 
published in 1986; if anything, they provide 
even stronger evidence in favor of Ricardian 
equivalence and the Consolidated Approach. 
Note especially that the spending and tax 
coefficients in our (1.1) are —0.26 and 0.06, 
respectively, whereas these coefficients in 
Feldstein and Elmendorf's (1.4) are —0.10 
and —0.15. 

In equations (1.2) through (1.6), we delete 
the World War II years, first for the canoni- 
cal specification (1.2), and then for Feldstein 
and Elmendorf’s four main alternative speci- 
fications taken one at a time (1.3-1.6). Equa- 
tion (1.7) is (1.1) estimated over the 1951-85 
period. These estimates all bear out the Con- 
solidated Approach, that is, large, negative 
and significant effects of government spend- 
ing combined with small and insignificant 
effects of taxes? 


D. Reconciling Feldstein- Elmendorf 's 
Results with Those of Table 1 


The results in Table 1 above are both 
dramatically different from those reported 
by Feldstein and Elmendorf and fully con- 
sistent with our earlier results. In Table 2, we 
show the steps that bridge the gap between 
their results and ours. We report the govern- 
ment spending and tax coefficients* (along 


See also the meticulous Data Appendix of Barth, 
Iden, and Russek (1986). 

IIn specifying their ratio-to-NNP specification (1.5), 
Feldstein and Elmendorf do not include 1/Y,, and 
hence fail to take the constant into account. The :-sta- 
tistic of 1/Y, is 6.5, and hence we include it in all the 
ratio form results reported here. In only a few cases, 
however, does failing to include 1/Y, in (1.5) have 
much effect on the coefficients of either geverment 
spending or taxes. ; 

*In the regressions underlying Table 2, the coeffi- 
cients for government debt were generally insignifi- 
cantly different from zero, xo as many negative esti- 
mates as positive ones. 
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with their standard errors) for the five key 
specifications, each estimated with and with- 
out the war years: the “basic” differenced 
specification of Kormendi (1983), level data 
with an ARI correction, federal fiscal vari- 
ables only, ratio-to-NNP form, and instru- 
mental variables. Under each specification, 
we show the number of the corresponding: 
Feldstein-Elmendorf regression. 

Column 1 of Table 2 presents our attempt 
to replicate Feldstein and Elmendorf. With 
the exception of the instrumental variables 
(IV) estimates, our replications are very close 


(and our IV replications are generally more 
. favorable to their hypotheses than their own 


IV results are). Taken as a whole, however, 
the results in column 1 show that, contrary 
to Feldstein-Elmendorf's claims, there is in 
fact little material difference between esti- 


.mates for the full period and those with 


World War II omitted. Both samples consis- 
tently yield negative coefficients for the tax 
variable. Although the government spending 
coefficients are significantly negative in the 


full sample, they are nearly always smaller in 


magnitude than the tax coefficients. More- 
over, as discussed above, Feldstein and 
Elmendorf themselves assert that negative 
government spending coefficients are not 
necessarily evidence in favor of Ricardian 
equivalence. 

Hence Feldstein and Elmendorf's anti- 
Ricardian results are not the result of omit- 
ting the war years from the sample.) Rather, 
their results stem from two seemingly incon- 
sequential choices in the construction of their 
data. 

We now address these choices in turn. 
First, Feldstein and Elmendorf’s defiate 
nominal data by. the consumption expendi- 
tures deflator. In making this choice, they 
overlooked the availability of published real 


5To those who have read Kormendi (1983), Kor- 
mendi and Meguire (1986), or Barth, Iden, and Russek 
(1986) carefully, it should come as no surprise that the 
war years are not the key factor driving Feldstein.and 
Elmendorf's results. These three papers investigated in 
detail the sensitivity of estimates to various subperiods, 
including ones that did not include the war years, and 
found no material evidence of inhomogeneity, especially 
for the govérnment spending and tax coefficients. 
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TABLE 2—RECONCILING FELDSTEIN, AND ELMENDORF'S GS AND TX COEFFICIENTS WITH TABLE 1 
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, (1) (2) 
Author: FE ` 
Improved Variable 
. Definitions No No 
Use Published 
Real Data No Yes 
Deflator for Consumption Consumption 
Nominal Data Expenditures Expenditures 
Specification . 
(Number of 
Corresponding . 
Estimate in FE) GS TX GS TX 
1931-85 E 
Basic —010 —0.14 —021 0.01 
'(1.4) (0.03) (0.11) (0.03) (0.10) 
ARI -011 —0.09 | —024* 016* 
(1.8) (0.03) (0:12) (0.03) (0.11) 
Federal only  —0.09 —028 — -021 -010 
(2.6) .(0.03) (0.11) (0.03) (0.11) 
Ratio? —0.10 —0.35 | -023 -015 
(NR) (0.03) (0.11) (0.02) (0.09) 
Instrumental —0.07 —0.18 —022 :—-0.01 
Variables (NP) — (0.06) (0.30) (0.06) (0.25) 


1931—40, 1947-85 
. Basic 


0.05 —016 


—017 004 


(1.5) (0.09) (0.3) (008) (013) 
ARI 0.11 -013 ^-016 015 
. (1.7) (0.08) (0.12) (0.08) (0.11) 
Federal only 0.03 —0.32 -019 —0:09 
(22) (0.09) (0.14) (0.08) (0.15) 
Ratio? —0.05 —0.47 —024 .-0.19 
(NR) (0.08) (0.13) . (0.06) (012) 
Instrumental 0.01 —0.03 — 0.24 0.06 


Variables (5.1) 


(0.16) (0.22) 


(0.18) (0.24) 














Q) (4) (5) 
KM. 
Yes Yes Yes 
No Yes Yes 
Consumption Consumption NNP 
GS TX GS TX GS TX 
—022 0.02 —0.26 0.15 —0.26 0.07 
(0.03) (0.10) (0.03) (0.10) (0.03) (0.11) 
—0.23 0.06 —0328 0.27 —0.30 0.16 
(0.03) (0.11) (0.03) (0.10) (0.03) (0.12) 
020 —0.04 0.25 0.08 —0.25 —0.05 
(0.03) (0.12) (0.03) (0.13) ,(0.03) (0.11) 
—020 —0.12 —027 0.01 =0.26 —0.10 
(0.02) (0.10) (0.02) (0.10) (0.02) (0.09) 
—0.26 —0.04 —0.28 0.13 -0.26 0.01 
(0.08) (0.28) (0.08), (0.30) (0.06) (0.27) 
—0.08 —0.03 —023 0.10 —023 0.06 
(0.09) (0.12) (0.09) '(0.12) r (0.08) (0.12) 
—0.02 —0.01 —0.19 - 0.20 -0.23 0.16 
(0.08) (0.11) (0.09) (0.11) (0.08) (0.12) 
—(.0 —0.07 —024 0.01 —0.3 -0.03 
(0.08) (0.14) (0.08) (0.15) (0.07) (0.14) 
0.08 —022 0.26 —0.08 —024 —017 
(0.07) (0.13) (0.06) (0.13) (0.06) (0.12) 
—0.13 . 0.08 —0.24 0.22 —026 0.17 


(015) (0.24) 


(0.18) (0.27) 


(0.12) (0.23) 





Note: Standard errors of estimated coefficients are shown in parentheses. We will make available upon request the 
complete regression corresponding to any entry in this table. FE = Feldstein and Elmendorf (1990); KM = Kormendi 
and Meguire (1986); NR = not reported by FE. ` 

In (1), we used FE's actual values (courtesy of Douglas Elmendorf) for consumption (the dependent variable), 
population, GB and W. We use their values for W throughout this table, so that the results under (5) do not 
correspond exactly to those in Table 1. For further details on the data and computation of the variables, see the Data 
Appendix and the note to Table 1. 

Except for the data differences described in the preceding paragraph, the Easic specification is the same as (1.1) 
and (1.2) in Table 1. We compute ARI estimates from levels data with a correction for residual serial correlation, 
either maximum likelihood (1931-85) or Cochrane-Orcutt (otherwise). All other specifications are estimated using 
OLS over first differences and are described in the note to Table 1. 

* Maximum likelihood algorithm failed to converge. 


5 With 1/Y,. 


(constant dollar) data for net national prod- 
uct, government spending, and the compo- 
nents of private consumption, assuming ex- 
plicitly that doing so involved only “small 
differences." Moreover, since.the dependent 


warinhla i6 nat a mancura af enncunmntian 


expenditures but rather of consumption flow, 
the implicit deflator for the latter would be 
the more consistent choice. Whether to use 


‘the available real data or data deflated by 


some other deflator is an instance of the 


elacciaal “ainday numbhar nrahlam 7 and ae 
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such cannot be resolved apriori.Ó However, 
in light of Feldstein and Elmendorf’s stated 
purpose “...to assess the sensitivity of Kor- 
mendi’s conclusion to the inclusion of the 
World War II years, [while attempting] to 
stay as close as possible to his specification 
and his definitions” (emphasis added), we 
now consider the sensitivity of their results 
to deviating from our earlier work in this 
respect. 

To this end, we present in column 2 of 
Table 2 estimates identical to those in col- 
umn 1, except that Y, GS, and consumption 
are constructed from published real data. 
The results for all specifications are now 
fully consistent with the Consolidated Ap- 
proach, whether or not the World War II 
years are excluded. Government spending 
has significant negative effects of about the 
magnitude uncovered by Kormendi (1983), 
Barth, Iden, and Russek (1986), and Kor- 
mendi and Meguire (1986), while taxes have 
small and insignificant effects of mixed sign. 

Second, Feldstein and Elmendorf do not 
implement the improved data and variable 
definitions that emerged from the 1986 de- 
bate. For example, Barth, Iden, and Russek 
(1986) meticulously reconstructed, correct- 
ed, and extended Kormendi’s (1983) origi- 
nal data and variable definitions. Further, 
Modigliani and Sterling (1986) argued cor- 
rectly that the Fisherian inflation premium 
should be netted from nominal interest pay- 
ments to obtain a measure of real gov- 
ernment interest payments, RGINT. In 
Kormendi and Meguire (1986), we acknowl- 
edged these improvements and added some 
of our own, including incorporating Cox’s 
(1985) new series on the market value of 
federal debt outstanding into our govern- 
ment debt and real government interest vari- 
ables. For these reasons, one should not 
proceed from Kormendi (1983), as did Feld- 
stein and Elmendorf, but rather from the 
data definitions emerging from the 1986 de- 
bate. 


SReal data are not available for financial variables 
such as taxes and transfers. Hence these must be de- 
flated by some deflator. Kormendi (1983) chose the 
NNP deflator out of consistency with his measure of 
income. The: implicit deflator for consumption is a 
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Columns 3 through 5 of Table 2 show the 
effects of incorporating these data improve- 
ments. In column 3, we grant Feldstein and 
Elmendorf their choice of using only nomi- 
nal data deflated by a uniform deflator. 
However, since consumption expenditures is 
not the dependent variable in either Feld- 
stein and Elmendorf’s regressions or ours, 
we use the deflator implied by the actual 
dependent variable, consumption flow. We 
then return to the real data on national 
product and its components, while deflating 
the financial variables by either the implicit 
consumption flow deflator (column 4) or the 
NNP deflator (column 5) as in Kormendi 
(1983). Except for granting Feldstein and 
Elmendorf their data on private wealth (a 
variable not at issue), column 5 returns to 
the data choices underlying Table 1. 

In almost all cases, the results in columns 
2 through 5 support the Consolidated Ap- 
proach. In almost every case, the tax coeffi- 
cients are insignificant, and in the vast 
majority of cases they are close to zero. 
Furthermore, there are as many positive tax 
coefficients as negative ones. In almost all 
cases, the government spending coefficients 
are negative and highly significant. Only the 
estimates in column 3 without the war reveal 
an insignificant effect of government spend- 
ing. Even in these cases, however, Chow tests 
do not reject the hypothesis that the coeffi- 
cients estimated without the war are equal to 
those for the full sample.’ OR 

The implications of Table 2 are easily 
summarized. Feldstein and Elmendorf's re- 
sults obtain only as the joint consequence of 
(1) ignoring available real data, and (2) fail- 
ing to incorporate state-of-the-art data and 
variable definitions. Had they chosen other- 
wise in either respect, they would have ob- 
tained results consistent with Kormendi 
(1983) and Kormendi and Meguire (1986). 
Moreover, and contrary to their claim, nei- 
ther their results nor our own depend in any 
fundamental way on the inclusion or exclu- 
sion of data for World War II. 


7For example, the F-statistic for the homogeneity of 
the coefficients of the Basic regression is 1.69 with 
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TABLE 3— CONFIRMING THE TESTS OF KM (1986) WITH TEMPORARY TAXES 


Test 


1. Are Net Tax and Consistency 
Restrictions Accepted Over 
1952-84? (KM, Table 2) 


2. Adding 1933-51 to: 
as PDL/Levels With Net Tax and 
Consistency Restrictions 
(KM, Table 3) 
b. PDL/Levels Without Net Tax and 
Consistency Restrictions í 
(KM, Table 4, col. 1) 


3. Adding 1931-51 to the Augmented 
Consolidated Specification Estimated 
in Differences (KM, Table 4, col. 2) 


F-Statistic F-Statistic with 
from Temporary Taxes (TT) 
KM (1986) Included in Regression 
(DOF) (DOF) 
MSL MSL 
5.9 6.8 
(6,20) (6,18) 
0.001 < 0.001 
1952-83 1952-85 
6.8 12.6 
(19,24) (19,24) 
« 0.001 « 0.0001 
43 3.3 
(19,18) (19,18) 
0.002 0.007 
11 1.55 
(21,22) (21,23) 
0.41 0.15 


Note: MS = Modigliani and Sterling (1990). KM = Kormendi and Meguire (1986). 
Under the null, the statistics shown have F sampling distributions with (numerator, 
denominator) degrees of freedom (DOF). MSL = marginal significance level. PDL = 
first-degree polynomial distributed lags. The regressions underlying (2a) and (2b) differ 
from (1) and (4) in Table 1 of MS only in that they were estimated using the data 
described under “Data for Tables 3 and 4” in the Data Appendix. The regressions 
underlying (3) are similar to (1.1) in our Table 1. 


BH. Modigliani and Sterling 


In our 1986 paper, we showed how to nest 
the restrictions implicit in Modigliani and 
Sterling’s (MS) concept of “net taxes” (i.e., 
total taxes net of transfer payments and real 
government interest) into an unrestricted 
specification and so test the validity of the 
MS restrictions. We then established three 
findings: (1) granting Modigliani and Ster- 
ling their sample period and specification 
(polynomial distributed lag over levels data, 
henceforth *PDL levels"), their implicit re- 
strictions were rejected by their own data; 
(2) granting Modigliani and Sterling their 
PDL/levels specification but extending the 
sample period back to 1933 (using variable 
definitions as close to theirs as possible), 
their specification failed a test of homogene- 
ity over the full period, whether or not the 
MS restrictions were imposed; and (3) in 
contrast, Kormendi's Augmented Consoli- 
dated specification was temporally homoge- 
neous, yielding coefficients for government 
spending and taxation that rejected the MS 


specification in favor of the consolidated 
Approach. 

'In their rejoinder, Modigliani and Sterling 
(1990) claim that "[fJailure to take tempo- 
rary taxes into account biases the results 
against the Life Cycle Hypothesis." In 
Table 3, we present the F-statistics for the 
four tests undertaken in Kormendi and 
Meguire (1986). Column 1 contains for refer- 
ence the results obtained in our earlier pa- 
per. Column 2 contains the same tests with 
"temporary taxes" àdded to the regressions. 
The inclusion of “temporary taxes" has no 
material effect on these tests. The net tax 
and disposable income restrictions in MS's 
preferred specification are still rejected. 
Whether or not these restrictions are im- 
posed, the MS specification still fails a test 
of homogeneity when the data are extended 
back to 1933. Meanwhile, the data still sup- 
port the homogeneity of the Augmented 
Consolidated specification, although not as 
strongly as before. 

In Table 4, we present the government 
spending and tax coefficients from the eight 
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TABLE 4— GOVERNMENT SPENDING (GS) AND TAX (TX) COEFFICIENTS 
WITH AND WITHOUT TEMPORARY TAXES (TT ): MS Net Tax 


S - AND CONSISTENCY RESTRICTIONS RELEASED ] 


Number i Sample GS Spe PE TT 
MS Specification: Levels Data, Polynomial Lags : . 
a) - 1952-84 —0.32 —0.12 - 
(0.20) (019 
a^ ; 1952-84 —0.03 —0.63 0.60 
(023) (0.32) (0.37) 
Q) . . ' | 1933-85 —0.17 0.23 - 
i (0.06) (0.26) 
(25 l 1933-85 — 0.24 0.29 ' 1.69 
(0.04) : (0.20) (0.44) 
KM Specification: Differenced Data, No Tags - ; 
(3) 1952-85 —026 0.00 = 
(0.14) (0.16) 
^ 1952-85 —0.27 —0.16 0.69 
: MAU (0.13) (0.16) (0.31) 
(4) 1931-85 2026: 0.06 DOM 
(0.03) (0.10) " 
(4^) 1931-85 —0.25 0.01 0.62 ' 
(0.03) (0.10) (0.28) 


Note: KM = Kormendi and Meguire (1986). MS — Modigliani and Sterling (1986). TT 
for 1931—47 and 1985 is 0. When TT appears in a regression, T'X is net of TT. Data are 
as follows: (1) and (1^, MS Table 3; (2) and (2^), same as those used to estimate col. 3 
of Table 2 except for the addition of EQ; (3) through (4), same as those used to 


estimate Table 1. Also see “Data for Tables 3 and 4" in the Data Appendix. 


specifications that embody the three essen- 
tial differences between us and Modigliani 


and Sterling. These coefficients are from un- - 


restricted specifications estimated (1) with 


and without the. period 1931-51, (2) in. 


PDL/levels versus in differences without 
lags, and (3) with and without "temporary 
taxes." With one exception, all specifications 
yield significant negative government spend- 
ing coefficients and insignificant tax coeffi- 
cients (of mixed sign), results fully consistent 
with the Consolidated Approach. The one 
exception, (1^), is essentially regression (4) in 
Modigliani and Sterling. 

But Modigliani and Sterling reject (4) as 
unreasonable a priori, because the effect of 
“temporary taxes" on consumption is posi- 
tive and “improbably high.” We fully con- 
cur, noting that the coefficients on tempo- 
rary taxes (TT ) in Table 4 are always large 
and positive. In the face of these positive 
coefficients, however, Modigliani and Ster- 
ling argue for a return to restricted specifi- 
cations such as (1) and (2) in their Table 1, 
in which the net tax restriction works to 
“tame” the temporary tax coefficient so that 


its estimate is (in their judgment) reasonable. 
But, as we show in Table 3, whether “tem- 
porary taxes" are included or not, the MS 
restrictions are resoundingly rejected by the 
data. 

Thus, we prefer to remain ‘agnostic as. to 
what, ‘if :anything, their “temporary . tax" 
variable is capturing and reemphasize the 
results in. Tables 3 and 4.8 Seven of the eight 


* While we area in principle sympathetic to a tempo- 
rary/permanent decomposition of taxes and other 
variables (for example, Seatér and Mariano, 1985), 
Modigliani and Sterling’s temporary tax variable (TT) 
clearly does not yield reasonable results. This may stem 
from certain problematic aspects of the way they mea- 
sure TT. First, note that temporary taxes (TT) —0 
except for the years 1968—70 and 1975. In this regard, 
TT does not include the tax on excess corporate profits: 
that was in force until 1954. Second, during much of the 
period 1968-70, when TT was positive, the business 
Cycle was at a peak. Similarly, 1975 was a recession year 
and TT was negative. Hence TT possibly proxies for 
procyclical determinants of consumption not captured 
by the other explanatory variables. Third, TT is an 
estimate (which we were unable to replicate from origi- 
nal sources), derived under static assumptions, of the 
temporary component of federal income taxes alone for 
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unrestricted specifications shown in Table 4 
firmly reject Modigliani and Sterling’s pre- 
diction of significant negative tax effects and 
zero government spending effects. The one 
unrestricted case that supports their predic- 
tion is not, as they claim, due solely to 
including temporary taxes in the regression. 
Rather, it is the joint consequence of tempo- 
rary taxes, a postwar sample, and PDLs ap- 
plied to levels data. When any of these three 
specification choices.are relaxed, their results 
fail to hold. Moreover, even in the one case 
that appears to support their prediction, the 
temporary tax coefficient is “implausible,” 
causing Modigliani and Sterling to retreat to 
their restricted specification, which we have 
shown to be firmly rejected by the data. We 
do not believe this evidence supports their 
position. 


III. Cointegration : 


Both of our critics take exception to our 


preference for a differenced specification. The . - 


substantive question underlying the choice 
of levels or differences for estimating a time 
series regression is whether its variables are 
cointegrated. We propose to resolve this 
matter by the following Monte Carlo exer- 
cise, which extends the methods.of Engle 
and Granger (1987) and may prove of inde- 
pendent interest. 

. We simulate. (1.1) under both cointegra- 
tion and noncointegration so that our infer- 
ences do not depend on which of the two 
. hypotheses is taken as the null? We take the 





` the years 1968-1970 and 1975. Even granting that the 
specific legislation.in question was intended to be tem- 
` porary, it is far from clear that there were no counter; 
vailing and offsetting: factors in the overall budget pro- 
cess. Did agents really expect that revenues from the 
income tax surcharge of 1968 would not be followed by 
some other "revenue enhancing" legislation when it 
expired? Did the 1975 income tax "rebate" in effect 
offset the windfall profits tax on oil, also enacted around 
1975? Whatever the answer to these questions, we can- 
not seriously entertain a positive coefficient for “tem- 
porary taxes" as measuring the.effect on consumption 
of any tax. r 
?For a discussion of the importance of simulating the 
distribution of test statistics under both the null and the 
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regressors as given,’ and therefore only sim- 
ulate the dependent variable PC under each 
hypothesis. To simulate PC under cointegra- 
tion, we add AR(1) disturbances generated 
from iid Gaussian noise to the fitted values 

from (1.1) estimated in levels form with a 
maximum likelihood. AR(1) correction. We 
simulate PC under noncointegration in a 
similar way, except that we use the fitted 
values. from (1.1) estimated in differenced 
form, again with an AR(1) correction, and 
cumulate the resulting simulated changes in 
PC to obtain a simulated levels series. We 
then regress (OLS) each simulated PC series 
on the actual regressors, both in level and 
differenced form and record the resulting 
Durbin-Watson statistic. Finally, we draw 
inferences by comparing the Durbin-Watson 
statistic computed from the actual data to 
the simulated sampling distributions. 

The results of this simulation exercise are 
shown in Table 5, which shows that the 
observed Durbin-Watson value of 0.69 com- 
puted from the level data is close to the 
simulated median under noncointegration 
(0.63). In contrast, 0.69 is fairly far in the 
left tail of the sampling distribution under 
cointegration, rejecting the null at the 10 
percent level. Hence the observed Durbin- 
Watson value is much more consistent with 
noncointegration. Even more striking, the 
observed Durbin-Watson value of 1.34: de- 
rived from the differenced specification 
is fully consistent with noncointegration 
(p = 0.62), and inconsistent with cointegra- 
tion ( p = 0.001). 

- To summarize, we find no evidence that 
the variables in (1.1) are cointegrated. This 
result contrasis with empirical findings 
(surveyed in Stock and Watson, 1988) that 
income and consumption:are cointegrated. 
Such findings are typically derived. from 
specifications that use (a) disposable income 





alternative in the context of testing for a unit root in 
rea] GNP,.see Kormendi and Meguire (1990). . 

- V We omit retained earnings (RE) from the simu- 
lated regressions because RE appears in the Augmented 
Consolidated “specification only through the definition 
of disposable personal income. The RE coefficient in 
(1.1) is also insignificant and uncontroversial. 
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TABLE 5—SIMULATED DISTRIBUTION OF DURBIN-WATSON STATISTIC (DW) 


UNDER COINTEGRATION AND NONCOINTEGRATION. 





'* Augmented Consolidated Specification 
Sample Period 1930-85 





2000 Monte Carlo Replications 
Estimation Actual ` Percentiles of Simulated Distribution 


Method DW P ^ :Min 5 .10 -. 25 50 75 >° 90 -95° Max 


: i 2 . Simulate Under Cointegration — , E 
Levels . 0.69 010 ` 025 0.62 0.69 0.80 0.93 108 .124 1.34 1.88 


Differences 1.34 0.001 ` 1.26 1.64 174 ° 186 2.03 2.19 ` 234 243 2.82 
E Simulate Under Noncointegration — - i e 
Levels `- 0.69 0.59 0.24 0.35 0.40 0.50 -0.63 0.80 . 0.95 1.05 1.54 
Differences . 134 0.62 . 0.59 0.88 096 110 , 127, 145.. 1.63 1.71 2.23 
Note: P = fraction of simulated values « actual DW. Low values of DW favor noncointegration. "Actual DW” is 


estimated from OLS run on the actual data. The simulated DW are from OLS estimates of (1.1) in levels as well as in 
differences, but with simulated instead of actual consumption and without retained earnings. To simulate consump- 
tion, we first estimate (1.1) over the actual data (in levels and in differences) with a maximum likelihood correction 
for AR(1) residuals, and recover the fittéd values, p and o (standard error of the estimate) from the estimate. We then 
add. z;= pz, it to the fitted values of these regressions, where e, = N(0,o), with (o, c) = (0.798, 0. 0299) under 
cointegration, and (0.492,0.0271) under noncointegration. The first simulated value of level consumption is set equal 
to its actual value in 1930. Each simulated z, series is 155 observations long, with the first 100 values being discarded 


to eliminate the effect of an arbitrary starting value of 0. All calculations were performed using version 3.0 of RATS. 


instead of unrestricted fiscal variables, and 
(b) postwar data only, rather than the longer 
data starting in 1929. Contrary to the claims 
of our critics, our use of a differenced speci- 
fication is supported by the data." 


UModigliani and Sterling argue that differencing 
is inappropriate because their PDL/levels estimates 
*,..reject the hypothesis that rho could. ..equal 1, the 
value implicit in the differenced specification.” How- 
ever, the sampling distribution of test statistics for 
cointegration, including p ‘and the Durbin-Watson 
statistic (DW), are not known when these are computed 
from multiple regressions with PDL variables, estimated 
over small samples. In particular, contrary to Modigliani 
and Sterling’s claim, neither the simulations nor the 
results of Engle and Granger (1987) apply here because 
these were derived for bivariate regressions of non- 
durable consumption on disposable income alone. 
Moreover, the negative first-order autocorrelation in 
Modigliani and Sterling’s equation (4) is an artifact of 
their use of PDLs and the small number of degree of 
freedom: A -Monte ‘Carlo exercise (available upon re- 
quest) analogous to Table 5 shows that adding simu- 
lated white noise to the fitted values of (4) and estimat- 
ing with PDL/levels yields a mean DW of 2.86, the 
value obtained from ‘an OLS estimate of (4). This per- 
haps surprising result is consistent with Johnston (1984, 
equation (8—63)), who shows that given n observations, 
K regressors (with each PDL variable yielding 2 regres- 
sors) and white noise disturbances, E( DW) = 2[(n — 
1)/(n-—k)), or 3.56 in the case of (4). This negative 


` IV. Conclusion 


Feldstein and Elmendorf's results obtain 
only.as the joint result of (a) not using 
available real data on consumption, income, 
and government spending, and (b) failing to 
implement the improved variable definitions 
that emerged from the 1986 debate. More- 
over, and contrary to their claim, neither 
their results nor ours depend in any essential 
way on whether data for the period 1941—46 
are included in the sample. None of our 
1986 findings are overturned in any way by 
including Modigliani and. Sterling’s. pro- 
posed measure of temporary taxes in the 
regressions. Furthermore, the coefficient of 
temporary taxes in unrestricted specifica- 
tions is large and positive. Finally, simula- 
tion tests extending Engle and Granger 
(1987) show that our Augmented Consoli- 
dated specification is not likely to be cointe- 
grated, which supports our choice of estima- 


autocorrelation is also consistent with the overfitting 
suggested by the failure of e to validate PUSH 
(Table 3, row 2b). 
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tion in differences. The results in Kormendi 
(1983) and Kormendi and Meguire (1986), 
that is, significant negative effects of govern- 
ment spending with no significant effects of 
taxes, remain robust to the challenges posed 
by the critics. 

After two rounds of debate, we wish to 


highlight certain methodological contribu- n 


tions of the Consolidated Approach. Under 
what we term the Standard Approach, con- 
sumption is primarily a function of dispos- 
able personal income, that is, Y — TX + TR 
+ RGINT — RE. This approach restricts. the 
effect on consumption of the components of 
disposable income to be the same (except for 
sign) and allows no role for government 
spending. Under the pure Ricardian Ap- 
proach, consumption is a function of “total 
disposable income,” that is, Y — GS, where 
government spending is assumed to repre- 
sent the true resource burden of the govern- 
ment sector on the private sector. Under the 
Consolidated Approach, consumption is a 
function of Y, GS, and TR, each with possi- 
bly distinct effects, thus allowing govern- 
mient spending to substitute for private con- 
sumption and government redistribution of 
income to affect the aggregate propensity to 
consume. 

The key to the tests developed in Kor- 
mendi (1983) was to nest these and other 
hypotheses. by releasing all restrictions on 
the fiscal variables in the consumption func- 
tion. Doing so yields the Augmented Consol- 
idated Approach, in which consumption is a 
function of Y, GS, TX, TR, and RGINT. 
This approach is equivalent to including the 
right-hand-side variables from the govern- 
ments budget constraint (Deficit = TX — 
GS —TR — RGINT) as explanatory vari- 
ables in the consumption function. The esti- 
mated coefficients ‘of the fiscal variables 
can then be interpreted as the effects on 
consumption of deficit- financed marginal 
changes in these variables.!? 

The importance of releasing implicit re- 
strictions on the fiscal variables transcends 


See Charette (1986) for further tests of the stock- 
flow aspects of the Government Budget Constraint in 
the context of Hayashi's (1982) model applied to Cana- 
dian data. 
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the particular time series aggregate con- 
sumption function context in which it was 
originally proposed. In virtually any context 
in which the relation between private con- 
sumption and income is addressed, whether 
it be Euler equations, cointegration tests, 
vector autoregressions, or tests of the sensi- 
tivity of consumption to income innovations, 
one can no longer use disposable personal 
income without question. At minimum, one 
must test, using an unrestricted specifica- 
tion, whether aggregate income, government 
spending, taxes, and transfers each have dis- 
tinct effects on consumption. 


DATA APPENDIX 


This appendix updates and extends the Data Ap- 
pendix in Barth, Iden, and Russek (1986) (henceforth 
BIR) and should be read in conjunction with it. In 
particular, it defines (a) the variables used in (1.4) and 
the rows labeled “Zederal only” in Table 2, and (b) new 
measures of the market value of government debt and 
of real interest paid by the government. All data are 
rounded to the nearest $U.S. 100 million, and real data 
are rescaled into 1972 prices. As POP is rounded to the 
nearest 100,000, all variables are denominated in thou- 
sands of $U.S. per capita at 1972 prices. Unless other- 
wise stated, all da:a are in current prices, and all out- 
standings are measured as of the end of the preceding 
year. Data series or variable names ending in ^F" (real) 
or “FN” (nominal) are measured only over the federal 
government. 


The following abbreviated references are used below: 


BS Business Statistics, a biannual supplement to 
the SCB. 

ERP 1988 Ecoromic Report a the President to the 
Congress. 


FF Flow of Funds Accounts, Financial Assets, and 
; Liabilities, Year-end 1963-86 (Z.1), Table 

“State and Local Government— General 
Funds.” 

FERTW Fixed Reproducible Tangible Wealth in the US: 

1925-85. 

National Income and Product Accounts of the 

United Stetes: Historical Tables, 1929-82, up- 

dated to 1985 using the July 1987 SCB. The 

notation “x.y.z” refers to line number z in 

NIPA Table x. y. 

SA Statistical Abstract of the United States, 1984 
issue. 

SCB Survey of Current Business. 


NIPA 


Data Differing from BIR only by Updates and Revisions: 
CDR, CNDR, CSR, DURR, GSR, NNPR, PNNP 
(latter three variables now NIPA series 1.2.18, 1.9.5, 
and 7.7.3), REN, TRN, and TXN are as defined in BIR, 
except revised and undated as ner NIPA. Population 
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(POP) for 1928-38 is from SA, Table 2, column “Resi- 
dent Population,” and for 1939-85, from ERP, Table 
B-30, col. 1. Data for KN, the total net stock of fixed 
private capital used in the computation of W, are from 
FRTW, Table A13, p. 243, col 2, with updates 
for 1984-85 from SCB, August 1987, p. 100, Table 2, 
row 1. - 


Data for Deflators other than PNNP: Let CDN, CNDN, 
CSN (NIPA, 1.1.3-5) be consumption expenditures in 
current prices on durables, nondurables, and services. 
Let DURN (FRTW, Table A18, Net stock, “Total, all 
types," col. 2, with updates from August 1987 SCB, 
Table 20, col. 1) be the net stock of consumer durables, 
likewise in current prices. Then 


PPC = (0.3CNDN +0.1CDN + CNDN + CSN)/ 
(0.3CNDR +0.1CDR + CNDR +-CSR). 


Also, 


PPCX = Implicit Deflator for Personal Consumption 
Expenditures ( NIPA, 7.4.2) 


Feldstein and Elmendorf’s Data: Actual values for GB, 
GBF, PC, W, and FEPOP (resident population) kindly 
provided by Douglas Elmendorf. We rescale FEPOP 
into 100,000s, and the other data into $1000 at 1972 
prices. Unlike POP, FEPOP does not include Armed 
Forces personnel stationed overseas. 


Data for Tables 3 and 4 

1952-84: Table 3, Modigliani and Sterling (1986). 
Data for government interest (RGINT) 
and the 1948 values of all variables were 
kindly provided by Arlie Sterling. 

The correspondence between variables 
defined in this Appendix (KM) and the 
variables appearing in Modigliani and 
Sterling (1990) (MS) is as follows: 


1931-85: 


MS: A C DE G RGINT T T* TR 
KM: W, EQ PC DEFGS GB RGINT TX-—T* TT TR 


where 
DEF =GS+TR+ RGINT — TX 
EQ = MVLSN/SDIVI 


MVLSN - Market value at year-end of stocks listed on 
the New York Stock Exchange (BS, various 
editions, esp. 1967 (p. 108) and 1986 (p. 77), 
far right column on page). 


SDIVI is defined in the Note to Table A1. 


TT =0 (1930-67, 1971-74, 1976-85); 
=T* from Table3in MS — (1968-70, 1975). 


When TT is included in the regression, TX is net 
of TT. 
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Data for GB, Market Value of Government Debt 

Data with sources other than FF are December values 
for the preceding year. GBN3 (through 1976) and GBN1 
are exactly as described in BIR. 


CMISLN  =Par Value of Credit Market Liabilities 
Issued by State and Local Governments 
(FF, row 13). 

GBN2 — Market Value of Privately Held Gross 


Federal Debt (Cox, 1985, Table 2). 

MBP — Average Market Price of Municipal 
Bonds per Dollar of Principal, 1977-85 
(data courtesy of Douglas Elmendorf). 

SHLTEON =-Par Value of State and Local Holdings 
of Tax-Exempt Obligations ( FF, row 9). 
Assuming these securities to be mostly 
short term, then market and par values 
are approximately equal. 


SLBPN = Par Value of Long Term Tax-Exempt 
State and Local Debt ( FF, row 13). 
GBN4 = CMISLN — SLBPN(1— MBP) 
— SHLTEON (1976-85) 
GBN3 —GBNA(GBN34,/GBNA4,) (1977-85) 
GBN = GBN2+ GBN3 (1943-85) 
= GBN1(GBNy/GBN143) (1929-42) 


Data for Government Interest, RGINT 


FRPROFN = Federal Reserve Bank Profits paid to 
the U.S. Treasury (NIPA, 3.2.7). 
GDIV = Corporate Dividends received by 
F Government ( NIPA, 3.1.18). 
GINTN — Net Interest paid by Government 
f (NIPA, 3.1.13). 
"GINTFN = Net Interest paid by Federal Govern- 
ment ( NIPA, 3.2.22). 
GINTFORN = Government Interest paid to For- 


eigners ( NIPA, 3.1.16). 

P, — Value in period t of whichever of 
PPCX, PPC or PNNP we use to 
construct DIVI. 


PI, = InP + E)/OL Pv) 
DRB = PI X GBN 
DRBF = PI(GBN — GBN3) 


Other Fiscal Data 

GSN = Government Purchases of Goods and 
Services (NIPA, 1.1.18). 

— State and Local Government Pur- 
chases (NIPA, 1.1.22). 

=State and Local Government Pur- 
chases in constant prices (NIPA, 
1.222). 

— Government Subsidies to Businesses 
Net of Current Surplus of Govern- 
ment Enterprises ( NIPA, 3.1.19). 

— That part of NSUBN attributable 
to the federal government (NIPA, 
3.2.27). 

= Federal Government Transfer Pay- 
ments to Domestic Persons (NIPA, 
3.2.19). 

= Government Transfer Payments to 
Foreigners ( NIPA, 3.1.12). 

= Federal Tax Collections ( NFPA, 
3.2.1). 


GSSLN 


GSSLR 


NSUBN 


NSUBFN 


TRFN 


TRFORN 


TXFN 
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TABLE AL —SUMMARY DEFINITIONS OF VARIABLES USED IN TABLES 1 AND 2 
































Column Number from Table 2 
Variable (2 D 6) (4) 
DIVI PPCX x FEPOP Same PPC x POP Same 
GB FE Same GBN/SDIVI Same 
cpm E S GBN — G3N3 

ame SDIVE Same 

GINTN# GINTN# — FRPROFN — DRB— GDIV 
RGINTF ——- Same Same 
DIVI DIVI 
GINTFN# GINTFN# — FRPROFN — DRBF 
RGINTF ————— Same Same 
DIVI DIVI 
GS GSN/DIVI GSR/FEPOP (GSN/DIVI)- GS# (GSR /POP) + GSH 
GSSL* GSSLN/DIVI GSSLR/FEPOP GSSLN/DIVI GSSLR/POP 
PC (0.3DURN+0.1CDN+ PCR/FEPOP PCR/POP Same 
CNDN + CSN)/DIVI 

TR TRN/DIVI Same (TRN + NSUBXA)/DIVI Same 
TRF TRFN/DIVI Same (TREN + NSUBFN)/DIVI Same 
TX TXN/DIVI Same (TXN — FRPROFN)/DIVI Same 
TXF TXFN/DIVI Same (TXFN — FRPROFN)/DIVI Same 
Y NNPN/DIVI NNPR/FEPOP NNPN/DIVI NNPR/POP 


*GSF = GS — GSSL 


Note: The variable definitions under (4) also hold for col. 5 and all of Table 1, except that DIVI = PNNP x POP, and 
W in Table 1 is defined by (A8) in BIR. RE is always REN/DIVI. Deflators are normalized so that 1972 =1. 


Same = same formula as the entry immediately to the left, except that DIVI (SDIVI ) is taken from the 
column in which “Same” appears. 

FE = Feldstein and Elmendorf's actual data, rescaled to a 1972 base; also the case for PC in (1), for 
FEPOP in (1) and (2), and for W in Table 2. 

GINTN# = GINTN — GINTFORN 

GINTFNit: =GINTFN — GINTFORN 

GSN3t = (TRFORN + GINTFORN)/DIVI 

NNPN = Net National Product ( NFPA, 1.9.5). 

PCR =0.3DURR +0.1CDR + CNDR + CSR. 

SDIVI, = (DIVI, — DIVI, .,)/2. 

Charette, Michael F., *Canadian Tests of Di- 
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Tobin's q and the Structure-Performance Relationship: 
Comment 


By JERRY L. STEVENS* 


The .exchange of comments between 
William Shepherd and Michael Smirlock, 
Thomas Gilligan, and William Marshall (this 
Review, December 1986) raised two key 
points that remain unresolved. The first point 
is whether Tobin's q ratio, a firm's financial 
market value divided by replacement cost of 
its assets, is a better measure of firm perfor- 
mance than accounting rates of return. The 
second point of contention is whether supe- 
rior performance, however measured, can be 
attributed to efficiency rather than market 
power. This paper offers further clarification 
_ on both of these points. 

In Section I the performance measure 
choice is shown to be influenced by fun- 
damental, differences between finance and 
economics. In Section II, the structure-per- 
formance model employed by Smirlock, 
Gilligan, and Marshall (hereafter, SGM) and 
Shepherd is shown to be a special case of a 
more general model allowing for a depen- 
dence of the market-share-performance rela- 
tionship on the concentration ratio. The same 
data from the original study by SGM (1984) 
are used in Section III to provide a compari- 
son of SGM’s findings with empirical results 
from an alternative specification of the struc- 
ture-performance model. This comparison 
suggests that attributing superior firm per- 
formance exclusively to efficiency is not well 


founded. Concluding remarks are found in . 


Section IV. 
I. Tobin's q and Accounting Rates of Return 


Shepherd and SGM debated the merits of 
Tobin's q ratio as an alternative to more 


* Associate Professor of Finance, E. C. Robins School 
of Business, University of Richmond, VA 23173. I am 
indebted to William Shepherd for providing data and 
helpful advice. I also thank Manuel Jose and Len 
, Nichols for their comments and encouragement. 
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traditional accounting rates of return, such 
as net income to sales, as a measure of firm 
performance.! Objections to accounting rates 
of return have focused on accounting mea- 
surement errors from arbitrary depreciation 
schedules and expensing of intangible assets 
(for example, advertising and research and 
development).? The truth is that both q and 
accounting rates of return are subject to 
many of the same measurement problems to 
some degree. To test the sensitivity of q and 
accounting rates of return to measurement 
errors, Henry McFarland (1988) used Monte 
Carlo experiments to determine which ac- 
counting mezsure provides the best approxi- 
mation to its * true" measure and found that 
q estimates have smaller average errors than 
accounting rate of return measures. In addi- 
tion, the q ratio was found to have a much 
higher average correlation with its true mea- 
sure. 

A fundamental issue in the q versus ac- 
counting return debate is that profitability 
and value are often used interchangeably as 
performance standards even though they 
each represent a different phenomenon. 
Much of the disagreement on this point re- 
flects a difference between finance and eco- 
nomics perspectives. Research questions for 
economists tend to revolve arouad prof- 
itability and price-costs measures, while re- 


‘Empirical studies in industrial organization tend to 
measure performance with net income divided by sales. 
Earnings before interest and taxes would be more ap- 
propriate than net income for price-cost margins studies 
since differences in financing, depreciatior, and tax 
effects unrelated to pricing and efficiency of production 
affect net incomc. 

For a discussion of accounting rates of return as 
measures of firm performance, see Franklin M. Fisher 
and John J. McGowan (1983), Ira Horwitz (1984), 
William F. Long and David J. Ravenscraft (1984), 
Michael van Breda (1984), and Franklin M. Fisher 
(1984). 
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search questions in finance tend to revolve 
around valuation, where value is determined 
by what investors are willing to pay for 
claims against the firm.? Shepherd has ar- 
gued for accounting rates of return, consis- 
tent with the traditional concern in indus- 
trial organization economics for the firm’s 
ability to extract higher price-cost margins 
from sales. SGM, trained in finance, advo- 
cate Tobin’s g ratio as a performance mea- 


sure since q reflects ex ante financial market . 


valuation of all rents to the firm including, 
but not limited to, both the level and risk of 
future profitability. 


Shepherd found q ratios to be conceptu-: 


ally debatable because, unlike accounting 
profit, q is a phenomenon of capital market 
valuation, not of the firm. But in finance, 
the objective of the firm's management is 
to maximize financia] market valuation of 
the firm. Extensive documentation of the 
semistrong form of efficient security pricing 
suggests that accounting data is fully re- 
flected in stock price valuation.* Clearly, the 
debate is driven by differences in objectives 
represented by profits as opposed to value. 
While the debate tends to break down 
along disciplines, there are reasons why in- 
dustrial organization economists might use q 
in market power studies. Since market struc- 
ture has been shown to affect the risk of cash 
flows (see, for example, Marti Subrah- 
manyam and Starvos Thomadakis (1980) and 
Manuel L. Jose and Jerry L. Stevens (1987)), 
the structure-q relationship. represents a 
broader set of market power influences. For 
example, a firm may use market power to 
lower risk rather than increase the price-cost 
margin. If so, the market value of the firm 
would be enhanced even though there was 
no significant relationship between market 
power and accounting rates of return for 
that period.) Measures of q also discrimi- 


“These differences between finance and economics 
are discussed by Lawrence Summers (1985). 

For a reference to empirical verification of semi- 
strong efficient market pricing, see Thomas E. Copeland 
and J. Fred Weston (1983). 

5Richard Schramm and Roger Sherman (1974) offer 
a more complete discussion of how market power can 
increase the value of the firm through lower risk rather 
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nate well on the basis of profitability. Using 
Monte Carlo techniques McFarland (1988) 
found that q estimates were neither consis- 
tently better nor consistently worse than ac- 
counting rates of return in. detecting supra- 
competitive profits. 


i I. Specification of the Structure-Performance 
Hypotheses 


The following regression model was used 
by SGM and a number of prior researchers, 
including Shepherd (1972) and Bradley T. 
Gale and Ben S. Branch (1982), to analyze 
performance of the ith firm: 


(1) Performance; 
=a + bMS, + cCR, 
+dMSG, + eMBE, 


‘+ fHBE, + derror term,, 


where MS represents firm market share, CR 
is the four-firm concentration ratio, MSG is 
market share growth, MBE is a medium-bar- 
rier to entry dummy variable, and HBE is a 
high-barrier to entry dummy variable. SGM 
found that market share dominates the con- 
centration ratio (b is significant and high 
compared to c) and used this finding to 
conclude that “efficiency,” rather than “col- 
lusion,” is fesponaible for superior perfor- 
mance. 

Shepherd (1986) questioned SGM's con- 
clusion that a finding of significant positive 
market-share-g and insignificant concentra- 
tion-q relationships reflected cost efficiency 
rather than price collusion.’ Shepherd ar- 





than higher profits. Also, see William G. Schwert (1981) 
for a discussion of the potential use of financial valua- 
tion concepts in economics. 

Shepherd and SGM both found that substituting q 
for accounting profits did not change the basic findings 
of market share dominance. 

The SGM assumption that market share’s influence 
on firm performance is due exclusively to efficiency is 
only one of three different interpretations found in the 
literature. At the other extreme, Stephen Rhoades (1985) 
explains positive market-share-performance findings as 
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gued that concentration is not the exclusive 
form of market power and that market share 
“dominance” advantages affect prices. Un- 
fortunately, Shepherd’s attention was di- 
verted away from modeling the joint influ- 
ence of market share and concentration to a 
concern for collinearity between market 
share and concentration. SGM’s (1986) re- 
sponse showed that their results were not 
sensitive to collinearity. 

Rather than focus on collinearity between 
market share and market concentration, the 
sensitivity of SGM's findings to the specifi- 
cation of the market-share-q relationship 
should be tested. SGM's model specified a 
constant market share coefficient as if a firm's 
market share has the same relationship with 
q in a highly concentrated market as in a 
less concentrated market. An alternative 
specification of equation (1) that allows for 
an interaction of market share and concen- 
tration provides a test for the assumption 
that the concentration-q relationship is cap- 
tured exclusively by the concentration ratio 
coefficient of equation (1). 

There is good reason to expect a depen- 
dence of market-share-q relationships on 
market concentration. Dennis C. Mueller 
(1986) has argued that market dominance 
advantages for a firm depend on the firm's 
market share relative to the distribution of 
shares by its rivals. For example, if market 
concentration offers price collusion, ineffi- 
cient firms survive under the pricing um- 
brella and higher profitability will corre- 
spond with higher market shares. In this case 
both collusion and efficiency compete to ex- 
plain the same positive market-share-q ‘phe- 
nomenon even if they are not collinear. De- 
pendence of the market share-performance 
relationship on concentration 0q/0MS de- 
pends on the level of CR. According to this 





rents from lower elasticity of demand due to inherent 
product differentiation. A middle ground is provided in 
the work of Roger Clarke, Stephen Davies, and Michael 
Waterson (1984), where a variety of alternative phenom- 
ena are used to explain market share-performance find- 
ings. : 
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“collusion” argument, the interaction coef- 
ficient would be positive reflecting higher 
premiums for market share when collusive 
pricing in a concentrated industry is present. 
However, more than one interaction hypoth- 
esis is plausible. An “efficiency” interaction 
hypothesis suggests that a firm with high 
market share attracts a premium due to ef- 
ficiency, but that the premium will be lower 
if there is oligopolistic competition with other . 


- large market share firms. This view is consis- 


tent with a positive-market-share-q coeffi- 
cient and a negative coefficient for the (MS: 
X CR) interaction variable. 

The “collusion” interaction hypothesis is 
consistent with Shepherd’s objection to 
SGM's conclusions. Individual coefficients of . 
market share and concentration in the SGM 
model would be biased due to the restriction 
of a zero interaction effect as both market 
share and concentration compete to explain 
the same phenomenon. The "efficiency" in- 
teraction hypothesis ‘is that the market- 
Share-g correspondence would show even 
stronger efficiency effects if market. concen- 
tration's downward bias on the estimated . 
relationship were recognized. These compet- 
ing hypotheses are tis in the following 
model: 


2) q,=a+bMS,+cCR,+dMSG, 
( di i i i 
+ eMBE, + / HBE, 
+ g(MSXxCR); 


+ error term,, 


where all variables are defined as before. The 
SGM specification is a special case of equa- 
tion (2) where the ‘correspondence between 
market share and firm performance is con- 
strained to be a constant with respect to the 
concentration ratio. If the SGM specification 
is correct, the t-test for the interaction coef- 
ficient (g) will not be statistically significant. 
The collusion (efficiency) interaction hypoth- 
esis is supported by a statistically significant 
positive (negative) sign for the market- 
share-concentration (MSXCR) interaction 
coefficient. 
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IH. Reproduction of SGM’s Empirical Work 


To determine whether interaction effects 
alter SGM’s conclusions, regression results 
for equation (1) were obtained from the same 
Shepherd (1972) data panel employed by 
SGM. Tobin’s q was constructed for the 
firms in Shepherd’s data using the proce- 
dures of Eric Lindenberg and Stephen Ross 
(hereafter, L&R) (1981) and SGM (1984).® 
To validate the q-program, constructed q 
values were compared to published q values 
from the L&R paper for the same firms over 
the same time period. Tests for differences in 
means and variances were conducted and the 
null hypothesis of equal means and vari- 
ances could not be rejected. A correlation of 
a 0.98 existed between computed q values 
and the L&R values. These findings verified 
the consistency of procedures used to con- 
struct the q values of this study. 

Table 1 provides a comparison of pub- 
lished regression results from the SGM pa- 
per for equation (1) and the identical model 
estimated from the reconstructed data set 
(RDS).? Results from the reconstructed data 
set are similar to the SGM findings with 
market share dominance over concentration 
when the traditional model of equation (1) is 
employed. 

-Regression results for equation (2) are also 
reported in Table 1. The high level of signif- 
icance for the positive interaction term sup- 
ports the “collusion” hypothesis and mis- 
specification of the SGM model.? A level 


*vValues for q had to be constructed because SGM’s 
data were not made available when requested. However, 
SGM also used the Lindenberg and Ross procedures so 
a validation of the q program was possible. Additional 
summary measures of the reconstructed data set and 
comparisons with the SGM data are available from the 
author upon request. 

?The diagnostics described by David Belsley, Ed- 
ward Kuh, and Robert Welsch (1980) were employed to 
evaluate Shepherd’s concern over potential degrading 
collinearity between variables in the SGM model. Re- 
sults from the PROC REG procedure of SAS (Statisti- 
cal Analysis System software package, version 5) veri- 
fied the SGM conclusion that degrading collinearity was 
not present. 

10The interaction term (MSXCR) is collinear with 
MS and CR, causing the t-tests to be lower on the 
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TABLE 1— SGM AND RECONSTRUCTED DATA SET - 








(RDS) REGRESSION RESULTS? 
Equation (1) Equation (2) 
Independent SGM RDS RDS 
Variables Results Results Results 
Intercept .-2600 | —2010 2.480 
(3.3)? — (193) (1.63) 
MS 0.055 0.070 —0165 
(361? . (5315 (— 2.64)? 
CR 0.009 0.013 — 0.049 
(1.05) (1.15) (—2.52)* 
MSG 2.670 1.270 1.060 
(450^ — (142) (1.27) 
MB . 0261 —0.046 0.429 
(0.90) — (—0.13). (0.429) 
HB 0.380 0.336 0.910 
(0.95) (0.64) (1.78) 
(MSxCR) 0.003 
: (3.84)° 
R? NR 0.437 0.517 
F 17.06 13.99 15.88 





"The dependent variable is Tobin’s q ratio. t-statis- 
tics appear in parentheses. 
Coefficient significant at the 5 percent level. 
*SGM did not report (NR) the R? values for their 
regressions. 


of market concentration greater than 55 
percent, compared to a sample mean of 
65.14 percent, is required before a positive 
market-share-q correspondence is obtained 
(989/9MS —- 0 if —0.165 + 0.003CR > 0). 
Market concentration has a positive influ- 
ence on 4 for firms with market shares above 
16.33 percent below the market share sample 
mean of 23.51 percent (09/0 CR » 0 if 
— 0.049 + 0.003MS > 0).! 

Since multiple regressors appeared in the 
significance tests for concentration and mar- 
ket share, F-tests of vectors of coefficients 
were also conducted. The F-test results sup- 
port the significance of both market share 
and concentration with a computed F of 





individual MS and CR coefficients. Even so, both MS 
and CR coefficients remain significant. 

“David J. Ravenscraft (1983) employed an interac- 
tion term in his empirical work with line of business 
data. Differences in data and variable measurement 
prevent a comparison of Ravenscraft’s results with those 
from Shepherd’s data panel. 
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8.21 for the hypothesis that 0q/0 CR —0 
and a computed F of 23.76 for the hypothe- 
sis that 0g/2 MS — 0, both compared to a 
critical F value of 4.88 at the 1 percent level 
of significance. These results along with the 
significance of the interaction variable in 
Table 1 modify the SGM conclusions. Con- 
centration is no longer “dominated” by mar- 
ket share since a statistically positive mar- 
ket-share-g relationship depends on a high 
level of concentration. It would be wrong to 
conclude that market share dominates when 
rents due to market share cannot be sepa- 
rated from rents due to concentration. The 
SGM “rents to efficiency” conclusion ap- 
pears to rest on a misspecification of the 
structure-performance model. 


IV. Conclusion 


The controversy surrounding the findings 
and methods of Smirlock, Gilligan, and 
Marshall has led to a productive exchange 
of ideas. Differences between single-period 
profits and a financial valuation view of. per- 
formance have been identified and addi- 
tional paths of influence of market power on 
firm performance may now be investigated. 
For example, it would be interesting to know 
how the interaction between market share 
and concentration affects q through system- 
atic risk. If firms with higher market share in 
less concentrated industries are less risky, 
lower q values for such firms would be.due 
to lower returns rather than higher risk. Such 
research may narrow the gap between the 


views of firm performance im economics and 


finance. 

' While Smirlock, Gilligan, and Marshall 
presented their results as proof of the domi- 
nance of efficiency over collusion, the sensi- 
tivity of their results to the specification of 
the regression model has been demonstrated. 
Specifically, positive returns to market share 
have been shown to depend on a high level 
‘of market concentration, suggesting that col- 
lusion and efficiency explanations are not 
clearly separated. Attributing superior per- 
formance to efficiency alone is not substanti- 
ated given the interdependence found be- 
tween, the market-share-q relationship and 
the. concentration ratio. 
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Cooperation, Harassment, and Involuntary 
Unemployment: Comment 


! 


~ Recently A. Lindbeck and D. Snower pub- 
,.lished an interesting paper in the American 
Economic Review where they try to show 
that cooperation and harassment activities of 
insiders toward entrants cause an underbid- 
ding failure. This means that although the 
utility of a job is higher than the utility of 
. unemployment, firms and unemployed work- 
ers are not able to sign a contract which 
stipulates that a particular job is performed 
at less than the prevailing wage. Since insid- 
ers have the power to withdraw cooperation 
from entrants, they are able to make them 
less productive. By harassment activities they 
increase the disutility of work and the reser- 
vation wage of entrants and make them more 
expensive. Withdrawal of cooperation and 
harassment reduces the (potential) prof- 
itability of unemployed workers and may 
allow insiders to enforce nonmarket clearing 
wages. 

In this comment! I will show (i) that they 
apply incompatible assumptions. As a conse- 
quence all insiders are replaced by outsiders 
if they threaten to harass entrants and if 
they set their wages according to the wage 
rule of Lindbeck and Snower. (ii) If harass- 


*Department of Economics, University of Technol- 
ogy, Vienna, Argentinierstrasse 8/175, Vienna, Austria. 

The same notation as in Lindbeck and Snower is 
used. A represents effective work force and € stands for 
the utility ofa worker. f(A) = f(arLr + ag Lg) denotes 
output, a, is the level of cooperation among insiders, 
Ly (Lg) is the number of employed insiders (entrants), 
ag Stands for the level of cooperation between insiders 
and entrants, hg is the harassment activity of an in- 
sider, Hy denotes the aggregate level of harassment 
against an individual entrant. There are upper and 
lower bounds on Hy and a; (ag): 0x Hg < H,1x ay, 
Gg € A. R;(R p) stand for reservation wages of insiders 
(entrants) and R; equals 1 whereas Rg =1+ Hg <1+ 
H. W; (Wg) denotes insider (entrant) wages. Through- 
out this comment it is assumed that the potential work 
force in efficiency units $ is higher than labor demand; 
that is, there is always some voluntary or involuntary 
unemployment. 


By ERNST FEHR* 


ment activities h, are utility decreasing for 
insiders, (80 /8h, « 0) hg > 0 is no credible 
threat and in equilibrium A, =0 prevails. 
(ii) There exist Pareto-improving contracts 
which eliminate harassment and induce in- 
siders to cooperate with entrants. Hence, in 
equilibrium only voluntary unemployment 
prevails. 

If the number of insiders (m in each firm) 
is so large that their marginal product ( Af") 
is below their reservation wage R,-1 
(scenario 1), the work force is reduced until 
Af' — R, (see Figure 1). In this scenario un- 
employed workers are never profitable for 
the firm even if full cooperation and no 
harassment were to occur. Therefore, nonco- 
operation and harassment do not increase 
the market power of insiders; the equilib- 
rium wage is equal to R, and only voluntary 
unemployment may exist. The effective equi- 
librium work force of each firm A, is given 
by the equation Af'( Am) = Af’(A,) =1. 

In case of an intermediate number (m « m 
« m) of insiders (scenario 2) their marginal 
product is higher than their reservation wage: 
Af'( Am) > R; at m. With full cooperation 
and no harassment, outsiders would be 
profitable. In scenario 2 withdrawal of coop- 
eration and harassment activities ensure that 
entrants are never profitable, that is, their 
reservation wage Rẹ (>R,=1 because of 
harassment) is above their marginal product 
agf'(Am) —1f'( Am). Since insiders are able 


_ to make entrants unprofitable, individualistic 
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wage setting by insiders will force the firm to 
pay 


(1) R, «W,—- A'(X) = Af'( Am); 


that is, the profit of the marginal incumbent 
worker is zero. 

It is clear that a wage above Af' would 
induce the firm to fire the insider. Lindbeck 
and Snower denote this no-firing condition 
(W,x Af’) as absolute profitability con- 
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scenario 1 
(n«n«i) (fism) 


scenario 3 scenario 2 


(0«msm) 


3 = An 2 = An 1 = AD 


' FIGURE 1 


straint (APC). In this scenario the effective 


equilibrium work force is given by A, = Am, 
for example À, = Am in Figure 1. 

If the number of insiders is very low (m < 
m, scenario 3) the marginal product of en- 
trants is high enough to render them prof- 
itable despite withdrawal of cooperation and 
harassment’ activities. The marginal insider 
must make himself at least as profitable as 
the marginal entrant; otherwise he will be 
replaced. Lindbeck and Snower denote this 
requirement the relative profitability con- 
straint (RPC). The insider wage according to 
the RPC is lower than Af’ but higher than 
Ry 


(2 R,«W;-A(1-* Hg) < Af'( Am). 


It is obvious that the higher Hj the higher 
W,. Therefore, Hj = H. Entrants receive a 
compensation equal to 


(3  W-Rg-R;tHg-l-H. 


Equation (3) implies that outsiders are 
hired up to the point where f’(A) = f’(A3) = 
1+ H. Thus, A, = Am. In case of m « m the 
number of entrants employed is given as 
Am — Am — agLg — Ly because insiders do 
not cooperate with entrants. In scenarios 2 
and 3 unemployment is involuntary because 
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if outsiders received full cooperation (ag = 
a,= A) and no harassment occurred (i.e., 
identical conditions of employment, ICE), 
they would be willing to work for less than 
the prevailing wage: 


(4a) R,(H,=0) =1<W, (scenario 2) 
(4b) ' Rg(Hg-0) =1<W, (scenario 3) 
I. The Total Profitability Constraint (TPO) 


The TPC requires that it is not profitable 
to replace all insiders. Lindbeck and Snower 
deal with this possibility in fns. 12, 15, and 
17. Henceforth I call this scenario zero. They 
assume in fn. 17 that the upper bound on 
Hy is itself constrained to be below HC: 
0x Hg x H x HC. The value of H is given 
by the requirement that the maximized profit 
in scenario 3, 7;*, is equal to the maximized 
profit in scenario 0, m. From the fact that 
52;*/8H, <0 and 7;*(H°) =a" they de- 
rive the “no-replacement condition" Hg x 
HC. In this section it will be shown that 
v. = v only holds if Hg = H€— 0; that is, 
in the case of Hj» 0 it is profitable to 
replace all insiders. 

According to the assumptions of Lindbeck 
and Snower, entrants are unable to raise 
their productivity through cooperation and 
always receive Ry. Therefore, if only en- 
trants are employed, ag -1, Hj —0, Wg = 
R,=1. Profits are maximized at A) = g(1) 
with g= (f^^. From this, m is given by 


(5) apt = f [801)]- 802). 
Notice that A, is in the range of an interme- 
diate incumbent work force: 4m < Ào < Am 
(see Figure 1). From (2) and (3), it follows 
that À, = Am=g(1+ H) and Lg — Am — 
Am. Therefore, m*¥ can be written as 
(6) mf*-f(Am)-Wimn -WgLgy 

- f [g(1- H)] - A(1*- H)m 

— (14 H)( Am — Am) 
= f(4m) -(1* H) Am 


=f[g(1+ H)]-(1+ H)g(1+ H). 
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Differentiation of m with respect to H 
shows that maximized profit is a decreasing 
function of H: 


(7) Sm*/8H-g|f'- (1 H)] 


— g(1- H) 2 — g(14 H) <0. 


From (5), (6), and (7), it follows that m 
>m if H0. Therefore, in scenario 3 all 
insiders will be replaced.if they set their 
wages according to the rule derived by 
Lindbeck and Snower and if H, =H > 0. Or 
put differently: The. assumption of H, = H 
= HC» 0 (together with monopolistic * wage 
setting by insiders) is not compatible with 
the assumption of 7;* 2 m. 

. The same conclusion emerges for scenario 

2 if the incumbent work force is small enough 
to ensure Am <A, = g(1). Since 7* is given 
by 


(8 /— sf-f(Am)- Af(CAm)m. 
= f(A) - f'(33)À5, 


it is increasing in Àj. If A, = g(1), vj Perros 
but for A, < g(1), m; is lower than m. 
One way out of this dilemma would be the 
. formulation of a different wage determina- 
tion process. Suppose for example that the 
wage is determined in a Rubinstein-bargain- 
ing process (Rubinstein, 1982) between indi- 
vidual workers and the firm. The outcome 
of such a process can be approximated by 
the asymmetric Nash-Solution if the time 
period between offers and counteroffers 
becomes small. (See Binmore 1987; Binmore, 
Rubinstein, and Wolinski, 1986; Hoel, 1986.) 


In scenario 3, f'( Am) 21+ H=W,,; that. 


is, in general it is profitable to employ en- 
trants. Therefore, in equilibrium (with L, > 
0) the marginal product of insiders is equal 
to Af'(Am). According to the asymmetric 


Nash-Bargaining Solution, the division of: 


this cake results from 
(9)  Wr[Af(Am)-W;]' * maxW, 
st. W,>R,=1. 


Insiders want to maximize W, whereas em- 
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ployers maximize the marginal profit of an 
insider, but both are constrained by their 
respective bargaining powers [0 < a <1 resp. 
(1— a)? The outside-option principle 
(Sutton, 1986) requires that the reservation 
wage R, is not considered as status quo-point 
but as a lower bound on W,. 
According to (9) W, is given by 


| lif wAf’( Am) <1 
ae Sen otherwise. 


It is obvious that the RPC is met because 
aAf'(Am) < Af'(Am) = A(1+-H). If the 
TPC is also met, Am insiders and (4m — 
Am) = Ly entrants are employed. 

Applying the asymmetric Nash-Bargain- 
ing Solution to scenario 2, (m >.m) gener- 
ates 


: _ [if aAf'( Am) «1. 
Uo re Sn Am) otherwise. 


The APC is always met; that is, W, is below _ 
Af'(Am) and if the TPC holds, all insiders 
but no entrants are employed. 

If insiders differ with respect to their bar- 
gaining power a, (10) or (10^) determines a 
different wage for each of them. For simplic- 
ity I assume now that a is the same for all 
insiders. It is, however, important to keep in - 
mind that there is no coordination among 
them and that each bargains individual- 
ly. with his. employer. This bargaining 
structure which underlies the analysis of 
Lindbeck and Snower may lead to a viola- 


` tion of the TPC. In scenario 3 the TPC 


requires 
f(Am) 


Using the third line of (6), (10) and 
f'( Am) =1+ H yields 


—Wm — (1 H)( Am Am) > 1. 


(11) (1-a) Af'( Am) m z (mf — v). 


? The value of a (resp. (1 — a)) may be derived from 
the time preferences of the bargaining parties or from 
asymmetries in the bargaining procedure. (See Binmore, 
1987 and Binmore, Rubinstein, Wolinski, 1986 or Hoel, 
1986.) 
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If (11) is violated the firm replaces all insid- 
ers. Since (m — m ) is positive it is easy to 
see that for sufficiently large values of æ (11) 
does not hold. The same argument applies to 
scenario 2 if Am < Ag? 


II. Disutility of Harassment Activities 


According to Lindbeck and Snower, “it is 
usually safe to assume that harassment activ- 
ities are disagreeable to the harassers" (p. 

.171, fn. 7). Nevertheless they assume that 
harassment does not affect the well-being of 
insiders because this assumption “has self- 
evident implications” for their results. I think 
it would have been better to describe these 
self-evident consequences in detail in order 
to enable the reader to make a better judg- 
ment of the relevance of their results. 

If harassment vis-à-vis entrants decreases 
the utility of insiders (8Q//6h', < 0) employ- 
ment and wage decisions change for two 
reasons: (i) Since the contribution of each 
harasser (h/,) to the aggregate level of ha- 
rassment (H, 1.) is likely to be small, it is in 
general not utility maximizing to choose the 
upper bound value Ag max- Even hi, = 0 may 
be an individually rational outcome. (ii) If 
workers cannot commit themselves to Ai > 0, 
they wil not threaten to harass entrants 
because they (and their employers) know 
that it would not be in their interest to carry 
out the threat. 

Suppose that Q' = Q'(W, hg) is the utility 
function. of an insider and that Qi = 
8Q1/8Wi> 0, Qi = 80'/5h',<0.4 The ag- 
Te level of harassment Hg is given as 

25 e: 

pcs that the wage is determined by 
(10) or (10^) and that a’ = a, all i W/ can be 
written as W, = aAf'(Am + Lz). The level 


"In ‘scenario 2 the TPC requires f(Am)— 
aAf'(Am)m z my. Adding Af’(Am)m on both sides 
and using (8), results in (1 — a) Af'(Am)m > (ag* — v). 
(mgt — a) is positive if Am < Ag. 

In order to simplify the presentation, Q} is not 
dependent on the riumber of entrants. It seems plausible 
that (— Q5) increases with Lg. This would strengthen 
the argument below. 

For the rest of this comment it is assumed that the 
TPC is met and that W; = aAf'(A) exceeds Rz. 
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of Lg is derived from f’(Am+ Lg) =1+ 
Hy. Therefore, L, is a function of Hj and 
6L, /8H, = (1/f"). From the maximization 
of Q'[aAf'( Am + Lg( Hg)), hg] with respect 
to h follows 


(12) iaA 4- Qi — 0, 


in case of an interior solution for hê. It is 
obvious that the assumption of Qj —0 im- 
plies hi, = h'g max In general, however, hi, < 
he Em - holds and if Q5 is large enough, even 

=0 may be an individually rational out- 
come: 

.If we allow for binding threats, (12) deter- 
mines the optimal threat level of hù. But if 
we stick to Lindbeck and Snower's assump- 
tion of a noncooperative game, we cannot 
allow for such commitment possibilities. In 
the absence of self-binding threats, hi, > 0 is 
not credible because it can only be imple- 
mented after the firm has chosen to employ 
some entrants. 

Suppose that the employer has chosen Lg 
70 such that À — A, This decision fixes 
the wage at a. Af (As Given this level of A, 
it is never optimal for an insider to execute 
his threat because A.» 0 does not change 
W, but reduces Q'. Since a rational employer 
recognizes the incredibility of hj, > 0, he will 
always choose Ly such that A = 9" and since 
a rational worker can foresee this, he will not 
threaten hj, > 0. 


Ill. Pareto Improving Contracts 


Our argument in Section II implies that if 
harassment is costly for the insiders, unem- 
ployment caused by harassment will vanish. 
In this section we show that even if harass- 
ment is nót costly for insiders the firm can 
always "persuade" insiders to refrain from 
hg > 0. Moreover the same type of contract 
that does away (in equilibrium) with A» 0 
is also capable of inducing insiders to coop- 
erate fully with entrants. 

Let us now consider these contracts in 
more detail: (i) The first contract guarantees 


®This argument holds for any level of Lp 0 such 
that À < Ag. 
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insiders (in a legally enforceable way) a 
slightly higher time-rate wage than they 


would have received through individual bar- 


gaining and demands that they cooperate 
fully with and do not harass entrants. (ii) 
The second contract assumes that the firm 
can observe cooperation and harassment with 
some positive probability v. It makes the 
payment above the level of wages that can 
be achieved through individual bargaining 
(Le., the wage premium) contingent on the 
behavior of insiders; that is, if the firm dis- 
covers noncooperation or harassment, the 
wage premium is not paid. 

We analyze first contract (i). Suppose that 
without this contract a scenario 3 equilib- 
rium prevails,’ that the number of insiders 
is m’<m, and that each insider gets 
aAf’(Am) > Rz. In the case of Hy = 0, each 
firm would employ (A4 — Am’) entrants in- 
stead of (4m — Am’). This would generate 
additional profits of 7,. In Figure 1-7, is 
given by the area (abcd). Applying contract 
(i) the firm guarantees insiders a wage of 

= [aAf'(.Am)- w,] where the premium 
w, satisfies 0 < w,-m' « m,. In exchange for 
w, it demands ki — 0. Under this contract 
. each insider and the firm is obviously better 
off. 

Since W;* is legally znforseablé: the firm 
cannot deviate from this contract and for the 
same. reason insiders have no incentive to 
deviate. In the Lindbeck/Snower model 
(with wage bargaining) insiders want to raise 
their marginal product as much as possible 
because their wage depends on A-f’. The 
lower the number of entrants, the higher is 
aAf’, But with contract (i), wages are in 
some sense independent from Af’. Although 
the actual marginal product at A, is given 
by Af'(A9), insiders can be sure of getting 
W,*. They have no reason to restrict Lg 
through A40. It is important to notice 
that the feasibility of this contract does not 
depend on the observability of harassment 
by the firm. Since the firn knows that 
6Q/8hi. = 0, it also knows that insiders have 
no incentive for harassment. EC 


"The following argument applies also to scenario 2 as 
long as Am < Ap. 
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Deriving the implications of contract (i) 
for the cooperation behavior of insiders is 
now .straightforward. The fitm could make 
additional profits sç (compared with a situa- 
tion where az —1, A= Ào, and H,=0) if 
each insider cooperates fully, that is, chooses 
ai, = ai aae In Figure 1 my is given by the 
area (a'b'cbc^). 

It guarantees to pay [aAf '(Am)-- Wa + 
Wc], where vc» 0, w-m’ = (w, + We): m< 
7, - 7.5 Óm and demands at = at. AS 
before the firm cannot deviate from this 
contract and insiders have no incentive to do 
so. The resulting level of the equilibrium 
work force is then Am. 

One might argue that although contract (i) 
gives insiders no incentive to deviate from 
A, max and hi, = 0, it provides also no-incen- 
tives to stick to these levels. The resulting 
equilibrium will, therefore, be weak.’ This 
problem can be overcome if there exists a 
positive probability + that an insider who 
chooses hi, > 0 or a «a max loses w. And 
that is exactly what can be achieved through 
contract (ii)? Starting from the same sce- 
nario 3 equilibrium as above, the firm offers 
a wage W,* = aAf’(Am)+ w,0 < wm’ < ôr. 

But now insider behavior is observed at no 
cost to the firm with probability 7 > 0" and, 


Sap max is the maximum level of cooperation of an 
individual insider toward entrants. Of course if ai, = 
A, max: all i, ag =A follows. 

"According to Harsanyi (1986, p. 104), an equilib- 
rium is called strong if each layers s (ie., insider’s and 
firm’s) equilibrium strategy is the only best reply to the 
other players’ equilibrium strategies. Otherwise it is 
called weak. Under contract (i) any level of aj, and h'g 
with 0 < a£ € al maxs OS h'g < hb max is a best reply; 
hence the equilibrium i is weak. 

OF course, if ai. and hi, are paih observable 
and verifiable in court and the firm can make the 
payment of w contingent on af = ab max: hg = 0. But 
contract (i) assumes neither perfect observability nor 
verifiability in court. 

The employer's getting this information costlessly 
(with some minimum probability) does not seem to be a 
strong assumption. In most (hierarchical) production 
processes a minimum of vertical supervisioa arises as a 
joint product. Probably there is some degree of horizon- 
tal supervision because entrants obviously dislike ha- 
rassment and prefer to work in a cooperative envi- 
ronment. If thev are harassed or do not receive full 
cooperation, they may complain about it. All that is 
needed for the validity of the argument below is 7» 0 
and w » 0. t and w may be very low. 
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therefore, if af «ak. or hz»0, the 


expected income of an insider is given 
by (raAf'(Am) * (1 — r)|aAf'(Am)-* w]) 
= aAf'(Am)- (1— r)w. In the case of full 
cooperation and no harassment, income is 
aAf’(Am)+ w. It is obvious that each in- 
sider prefers a no-harassment/full coopera- 
tion strategy under this contract. Since w > 0 
they accept such a contract and since da — 
wm’ > 0 firms will offer them. 

There exists an argument against the im- 
plementability of contract (ii), namely, that 
firms may cheat insiders and claim falsely 
that ab < ak max or hi;>0 has been ob- 
served. In order to remove this incentive to 
cheat, the contract may stipulate that any 
premium which is not paid because of non- 
cooperation or harassment has to be dis- 
tributed to other insiders. 

Contract (i) and contract (ii) do away with 
noncooperation and harassment; that is, in 
equilibrium there is no incentive for a < 
Gg, max OF h'g > 0. This induces firms to choose 
the maximum level of A that is compatible 
with their production possibility frontier and 
the disutility of work (without harassment): 
Af(A)=1 of A=Am. Although in this 
model we can never get more employment in 
efficiency units than Am (in this sense À = 
Am is a full employment equilibrium) and 
although in a contract ()—or contract 
(ii) equilibrium any outsider can get a job 
at his reservation wage, we may have invol- 
untary unemployment if we apply the defi- 
nition of Lindbeck and Snower. According 
to their definition, a worker without a job is 
involuntarily unemployed if his reservation 
wage in efficiency units and under identical 
conditions of employment (RIf*/alC*) is 
strictly below the efficiency wage of insiders 
(W,/a,). Assuming for simplicity that a1 
= A, we have RICE/aiC* —1/4. In our con- 
tract (1)—or contract (ii)—equilibrium W, is 
always above 1 because insiders can always 
enforce at least a wage of R,;=1 through 
individual bargaining and get in addition 
w > 0. Therefore, we have W,/a,=W,/A> 
1/A, and hence any worker without a job is 
involuntarily unemployed,’ In our view it is 


™21¢ we assume alCF < A, we may still have W,/A > 
1/aK, 
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more appropriate to compare RÍCE/a ICE = 


1/A with the efficiency wage of entrants, 
Wyz/ag. Without harassment entrants get 
W,=1 and with full cooperation ag A 


(see fn. 8). Thus, implementing contract (i) 


or (ii) leads to an equilibrium with no invol- 
untary unemployment.” 


13We want to stress that our arguments in Section III 
are also applicable to union models of unemployment 
(for example, McDonald and Solow, 1981; Johnson and 
Layard, section 5.1, 1986). These models are implicitly 
or explicitly based on the assumption that all workers in 
a firm have to be paid the same nonmarket clearing 
wage. It is, however, not clear why insiders (union 
workers) should object to the employment of some 
outsiders at a market clearing wage if they are guaran- 
teed their jobs and get a wage premium which gives 
them a higher total income than they would have re- 
ceived through collective bargaining. 
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: Cooperation, Harassment, and Involuntary 
l Unemployment: Reply .. 


i By ASSAR LINDBECK AND DENNIS J. SNOWER* ` 


The basic idea of our (1988) article is that 
involuntary unemployment can occur be- 
cause (a) the experienced, incumbent em- 
ployees, “insiders,” have an incentive to keep 


their wages above the market-clearing level. 


and (b) these insiders are able to achieve this 
(within limits) by imposing their own prefer- 
ences on their firms and on the other work- 
ers, “outsiders,” through the threat of ha- 
rassing and withdrawing cooperation from 
underbidders. The outsiders are involuntar- 
ily unemployed in the sense that their inabil- 


ity to find jobs is due to the circumstance ` 


that they face less favorable working condi- 
tions (in terms of cooperation and harass- 
inent) than insiders do. 

The involuntary unemployment persists 
- because the discriminatory conditions facing 
. the outsiders are not eliminated through ün- 


. For clarity and brevity, we used a particu- 
larly simple model to show how cooperation 
and harassment activities can give rise to 
involuntary unemployment. ‘Our model is 
appropriate to the problems we chose to 
analyze; clearly, if the domain of problems is 
broadened, the model must be extended. The 
simplicity of our analytical framework in- . 
evitably leaves some potentially interesting 
questions unanswered, and Fehr's comment 


raises three of them. Let us consider each in 


turn. 


I. Replacement of Insiders 


|f insiders protect their positions by withhold- 


ing cooperation from underbidders and harass- 


` ing them, then why does the firm not replace _ 


derbidding. In particular, the outsiders may ` 


be disinclined to engage in underbidding 
because they know that if they did so, the 


insiders would “harass” them, thereby rais- . 


ing their. reservation wage. Moreover, firms 
may be unwilling to replace some of their 
insiders by underbidding outsiders, because 


they know that if they did so, the remaining `- 


insiders would not “cooperate” with the un- 
derbidders in the production process, thereby 
reducing the underbidders’ productivity. 

It is also. worth noting that firms may. be 
disinclined to replace -all their insiders by 
outsiders. In our (1988) article we ‘simply 
took this for granted, but Ernst Fehr’s com- 
ment points to the need to provide specific 
reasons why this may be so. As we will show, 
these are not hard'to come by. ': 


*Institute for International Economic Studies, Uni- 
versity of Stockholm, S106 91, Stockholm, Sweden, and 
Department of Economics, Birkbeck College, Univer- 
sity of London, .7-15 Gresse Street, London WIP IPA, 
England, respectively. : 
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all its insiders by outsiders? 


In practice, firms rarely replace their en- 
tire insider work forces, presumably because 
they expect this action to be extremely costly. 
A potentially important reason for this is 
that (i) firms face labor turnover costs, which . 
means that employment activities are associ- 
ated with economic.rent and (ii) the insider 
wage is the outcome of negotiations in which 
insiders capture only some of this rent. Then 
the wage cost of a firm's insider work force 
exceeds that of an entrant work force by no 
more than the labor turnover costs associ- 
ated with the replacement of the insider work 
force. By implication, the firm would not 
find it profitable to implement the “replace- 
ment strategy," whereby all insiders are re- 
placed by new entrants. 

The firm's turnover costs from the irae 
ment strategy may take many forms, but the 
following are particularly relevant to our 
(1988) article: 


— The new work force may be less produc- 
tive than the one- it replaces, because 
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newcomers generally have less on-the-job 
training; for example, they usually have 
not had the opportunity to acquire the 
skills required to cooperate with one an- 
other in team production. 

— Cooperation and on-the-job training are 
often acquired by learning from the ex- 
isting insiders; but if all insiders have 
been fired, the learning process may well 
be lengthy, uncertain, and expensive. 

— The insiders’ harassment activities may 
be expected to continue even if all insid- 
ers have been fired, as fired employees 
are generally capable of performing such 
activities outside the firm. These activi- 
ties may take the form of picket lines 
and social ostracism, as well as other 
forms of industrial and social unrest, 
perhaps even sabotage and violence. They 
drive the reservation wages of the under- 
bidding entrants above those that the 
insiders had when they were employed. 

— There may be a loss of customer good- 
will (associated with a loss of firm rev- 
enue), following the adverse publicity 
which the replacement of entire work 

` forces is likely to attract. 


In addition, of course, firms often face a 
wide variety of labor turnover costs stem- 
ming from job security legislation and union 
activities (such as strikes and work-to-rule 
actions). 

In our (1988) article we attached little 
importance to these considerations, since tlie 
possibility of replacing entire insider work 
forces—especially under conditions of team 
production— struck us as remote. Rather, we 
implicitly assumed that impediments to such 
replacement exist, and we concentrated on 
what seemed to be a practically more essen- 
tial. question, namely, whether firms may be 
unwilling to replace even some of their in- 
siders by underbidding entrants due to the 
insiders' cooperation and harassment activi- 
ties. 

Fehr's comment points to the need for 
giving the replacement strategy some formal 
attention. Our model shows that, under indi- 
vidualistic bargaining, the insiders seek to 
achieve the highest possible wage subject to 
the constraint that no insider becomes un- 
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. profitable to the firm (the "absolute prof- 


itability constraint") and that each insider is 
at least as profitable as the marginal entrant 
(the *relative profitability constraint"). Fehr 
suggests a third constraint, namely, that the 
entire insider work force remain at least as 
profitable as a work force of entrants (the 
* total profitability constraint"). 

Fehr's analysis implies that the total prof- 
itability constraint on the insider wage will 
be binding if all of the following conditions 
are met: (1) fired incumbents do not harass 
the entrants who replace them, (ii) the firms’ 
incumbent work forces are sufficiently small, 
(iii) the insider wage is set unilaterally by the 
insiders, (iv) entrants’ productivity is inde- 
pendent of the number of insiders in the 
firm, and (v) there are no rent-related 
turnover costs other than those associated 
with the insiders' cooperation and harass- 
ment activities. Although this result is cor- 
rect, it is quite unlikely for all these condi- 
tions to be met simultaneously in practice. 
To incorporate the replacement strategy into 
our analysis, these conditions clearly need to 
be relaxed. 

First, as noted, the mere fact that all the 
insiders have been fired does not preclude 
them from harassing the workers who have 
ousted them. Accordingly, let us suppose 
that the new entrants will be harassed by the 
fired insiders,’ so that the entrant wage is 
Wy (1+ H) (where H is the disutility from 
being harassed and the reservation wage in 
the absence of harassment would be Wg —1). 
Then the firm's profit under the replacement 
strategy is 


Q). mt-/[g(14H)] 
—(14*- H):g(3* H).. 


This is equal to the firm's profit (7,*) under 
Scenario III (in which all insiders are re- 
tained and some entrants are hired) and is 


1We did not make this assumption in our (1988) 
article, as the replacement strategy plays no role in our 
formal analysis. In making this assumption now, we 
show that the replacement strategy is unprofitable, while 
our formal analysis remains unaffected. 
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less than its profit (7;*) under Scenario II (in 
which the insiders are retained but no en- 

trants are hired), as given by equations (6) 
and (8) of Fehr’s comment. Thus, under the 
assumption that fired workers are able. to 
engage in harassment activities, the firm has 
no incentive to implement the replacement 
strategy. 

Second, when considering the potential 
profitability of replacement, the assumption 
concerning the division of economic rent 
becomes significant. In-this context, it is 
desirable— as Fehr suggests—to portray the 
wage as the outcome of a bargaining process 
between the firm and its insiders, whereby 
the insiders capture only a fraction a (0 < a 
<1) of the available rent. In that case, the 
firm's profit under the replacement strategy 
(a), given by equation (1) above, is always 
strictly less than «;* and m, and thus the 
insider work force will not be replaced. 

. Finally, in evaluating the replacement 
strategy it is important to observe that the 
productivity of entrants generally depends 
on the number of insiders cooperating with 
them and training them. As more insiders 
are replaced by entrants, the entrants lose 
access to colleagues who can cooperate with 
them and provide on-the-job training, and 
thus the less productive the entrants may 
become. It was unnecessary to incorporate 
this consideration into our model when we 
only analyzed the replacement of marginal 
insiders; but the possibility of replacing the 
entire insider work force makes the matter 
significant: 

To formalize this dependence of entrant 
productivity on insider availability, let L, 
and L, be the number of insiders and en- 
trants in the firm (respectively), a, and ag 
be their respective labor endowments, and 
ai, be thé degree to which the representative 
insider (i) cooperates with each entrant. Then 
the firm’s production function may be re- 
specified? as 


(2) Q- f(ar L, t ag-Lg), 


?In our (1988) article, we assumed that ag =1 when 
a‘, = min(a‘;) and when all insiders are retained. We 
now assume that ap <1 when ai, = min(alL) and all 
insiders are fired. 
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where 


ag pat, Lr), | npo 937 0,. 


and 
ii [min(a5,),0] «1. 


Then the firm's Minployment level in Sce- 
nario: III is 
(ai), ^l 


(3) à= s[(1+ /v[nin 


and its profit i is 


(3a) m= f[A]-(1+ H):A» 0. 


Yet its profit under the replacement strategy 
is ; 


(35) f =f [n[min(az),0] L2] . 


- (14 H)-L£. 


Since p[min(a7),0] <1 it is easy to show 
that m, < m; in other words, it is less prof- 
itable to replace all the insiders.than to re- 
tain them. 

In short, there are a host of good reasons 
why firms do not fire all their insiders if 
these try to prevent underbidding through 
noncooperation and harassment activities. 
Given any of these reasons, the total prof- 
itability constraint becomes redundant, and 
the firm has no incentive to replace the en- 
tire insider work force. 


I. Credibility 


If insiders find it disagreeable to engage in 
harassment, is their threat to harass potential 
entrants credible? 


In our (1988) model, insiders’ harassment 
decisions cannot be implemented until the 
firm’s employment decisions have been made, 
and the employment decisions in turn are 
made only after the insider wage has been 
set. This is the background to Fehr’s point 
that insiders have nothing to gain from ha- 
rassing entrants—their wage and employ- 
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ment has already been determined.? But if 
the harassing activity is disagreeable to them, 
then insiders do have something to lose. In 
that event, the harassment threat is not cred- 
ible: the insiders have no incentive to imple- 
ment their harassment threat, and conse- 
quently the firm will ignore this threat when 
making its employment decisions. 

This argument is merely an artifact -of 
combining the assumption on the disagree- 
ableness of harassment with the one-period 
framework of analysis. For brevity and sim- 
plicity, our (1988) model assumed that agents 
have a one-period time horizon, and we 
sidestepped the credibility problem by sup- 
posing that insiders suffer no utility loss 
from harassing entrants. Yet if the credibil- 
ity problem is to be examined in detail, this 
approach needs to be amended. 

To begin with, it is important to note that 
although harassing may be an inherently dis- 
agreeable activity, the insiders may in fact 
derive positive utility from it when they are 
provoked by entrants engaging in underbid- 
ding.* (Consider, for example, common atti- 
tudes of incumbent workers to so-called 
“scabs.”) In short, whether the insiders gain 
utility or disutility from harassing may well 
depend on whether this activity is a response 
to a threat to insiders' income or job secu- 
rity. In context of our (1988) analysis, ha- 
rassment may give the insiders utility since it 
is assumed to occur only when there is un- 
derbidding. In that event, the harassment 


?Fehr also argues that insiders have no incentive to 
harass entrants when harassing activities yield signifi- 
cant disutility and when each insider makes an insig- 
nificant contribution to the overall level of entrant 
harassment. This argument misses the mark. Our entire 
analysis is based on the assumption that either (a) the 
firm’s team is sufficiently small for each insider to make 
a significant contribution to the overall levels of cooper- 
ation and harassment or (b) the insiders in each firm 
belong to a union which is able to impose its coopera- 
tion and harassment decisions on its members. Clearly, 
in the absence of such an assumption, insiders have no 
individual incentive to engage in discriminatory cooper- 
ation and harassment practices. 

4We failed to clarify this matter in fn. 7 of our 
article, which however was not concerned with the 
credibility issue. Yet setting the record straight does not 
affect the conclusions of our analysis. . 
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threat is credible in our model, even if the 
insider wage is predetermined when the in- 
siders make their harassment decision. 

Furthermore, even if harassing activities 
would always be disagreeable to the insiders, 
the harassment threat may nevertheless be 
credible in a multiperiod setting. In that 
context, the insiders may have a current 
incentive to engage in harassment activities 
in order to establish a reputation in the 
future. This reputation can discourage the 
firm from future hiring: thus the firm's work 
force will be smaller than it would otherwise 
have -been and, by implication, the insider 
wage will be larger. In particular, the insid- 
ers will have an incentive to harass the en- 
trants whenever the disutility from doing so 
is less than the utility from the associated 
rise in the future insider wage. _ 

To make this point formally, let us assume 
that workers maximize their utility over an 
infinite time horizon. As in our (1988) paper, 
our analysis of insiders’ harassment activity 
applies either (a) to an individual insider, 
acting atomistically within a production team 
or (b) to a firm-specific union of insiders that 
is able to impose its decisions on its mem- 
bers. Let T, be the level of harassment of an 
entrant (either by an individual insider or by 
a union of insiders) in time period /, mea- 
sured by the entrant's disutility from being 
harassed in that period. For simplicity, but 
without loss of generality, let the harassment 
level be a discrete variable, so that T,= H 
(where H is a positive constant) when ha- 
rassment takes place and T, = 0 when it does 
not? Let H?,, stand for the firm's expecta- 


SIt is straightforward, though algebraically more 
complicated, to derive our qualitative results from the 
assumption that harassment is a continuous variable. 
Then, as Fehr indicates, the harassment activity level is 
set so that the marginal gain from harassment (via the 
induced rise in the insider wage) is equal to the marginal 
disutility of the harassment activity per se. Observe that 
when the production team is reasonably small (as pro- 
duction teams often are), the marginal gain from harass- 
ment may be quite large; and when the harassment 
activity has been provoked through underbidding, the 
associated marginal disutility may be quite small. In 
that event, the optimal level of harassment may well be 


large. 
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tion of T, and we assume that this expecta- 
tion is formed as follows: 


(4) ; l Hr, =T, 


(which can be shown to be rational in the 
context of our model). 

Thus, when harassment takes place in pe- 
riod t (T,= H), the firm expects. harassment 
to continue in period t +1 (Hy, = H). Then 
the insiders receive the wage W,,,,;= A-(1 
+H) (where the insider-entrant productiv- 
ity ratio is A and the entrant’s reservation 
wage in the absence of harassment is 1). On 
the other hand; if there is no harassment in 
period t (T, = 0), the firm expects no harass- 
ment in the next period either (Hf 70. 
Then the insider wage is W; ,,,— 

"In line with Fehr's assumption that ha- 
rassment is disagreeable to the harassers, let 
_ the positive constant A, be the insider's (or 
union’s) level of harassing activity, measured 
by the disutility of harassing. Clearly, the 
level of harassment ( H) affecting the entrant 
is positively related to the level the harassing 
activity (hg) by the insider (or union); for 
simplicity, let H = 8-h,. Let 6 be the rate of 
time discount. Then, assuming that the firm 
expects harassment in the first period, the 
present value of the insider's (or union’s) 
utility when there is harassment in all peri- 
‘ods is 


(5) PV,-[4-(1- H)— he] 


(1478824 ---) 
=[A-(1+ B-hg) - hg] 
-[17/(1—- 8)]. 


When there is no harassment in any period, 
the present value is 


(5b) PV,=A-(1+H)+4-(8+62+-+-) 

= A- (19 B-hg) * A-[8/(1— 6)]. 
One of these two present values represents 
an optimum for the insider (or union) since 


the present values depend linearly on the 
per-period utility, and thus the insider can 
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always make himself at least as well off by 

always harassing or never harassing than by 

harassing in some periods but not in others. 
The harassment threat is credible iff 


(6a) PV, 2 PV,, 
which implies that 


(6b) hg (8-8-1) 2 0. 

Condition (6b) holds whenever 8 > (1/8). In 
that event, insiders’ harassment threat is 
credible even when the harassing activity 
per se yields disutility. 


HI. Pareto-improving Contracts 


Can Pareto-improving time-rate wage con- 
tracts be found which induce insiders to coop- 
erate fully with entrants and to forego harass- 
ing them, thereby eliminating the involuntary 
unemployment? 


Fehr suggests two contracts which al- 
legedly perform this function. The first con- 
tract grants the insiders a wage premium (w) 
in addition to what they would receive under 
individualistic bargaining (a- f'( A- m),Ó pro- 
vided that they cooperate fully with the en- 
trants and forego harassing them. Fehr as- 
sumes that this contract can. be implemented 
even if harassment (and presumably also co- 
operation) is unobservable to the firm. This 
means that the firm must offer each insider a 
wage of W,*=[a-f'(A-m)+ w], regardless 
of the insider's actual cooperation and ha- 
rassment activities. Then it is easy to see that 
the insiders will cooperate fully with en- 


. trants and avoid harassing them only when 


they have nothing to gain from doing other- 
wise. But then the cooperation and harass- 
ment threats are not credible. Consequently, 
the insiders will not make these threats in 
the first place, and thus the contract is un- 
necessary. 

Now suppose that the insiders do have 
something to gain from implementing these 


Recall that a is a measure of insider bargaining 


power and f’(.4-m) is the marginal product of the 


firm's work force. 
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threats—as when they gain utility from ha- 
rassing or withholding cooperation in re- 
sponse to underbidding, or when they estab- 
lish a reputation that discourages the firm 
from future hiring. Then the contract will 
not be effective in promoting cooperation 
and discouraging harassment, for precisely 
the same reason that the regular time-rate 
wage (in our model) is ineffective. 

The second suggested contract looks more 
promising, at first sight. This contract grants 
each insider the wage Wj* — a-f'(A-m) 
when the firm observes him to have harassed 
entrants or withheld cooperation from them, 
and gives him an additional wage premium 
otherwise (so that his wage becomes W,* = 
[a-f'(A: m) t w,]. The underlying assump- 
tion is that the firm can, with some positive 
probability, observe each insider's cooper- 
ation and harassment activities—observe 
them, in fact, with sufficient. accuracy and 
objectivity so that every insider can agree 
with the firm about the circumstances under 
which a wage reduction for Hiobcagperauod 
or harassment is justified. 

This assumption is quite unlikely to be 
met in practice, and this may be: a reason 
why we do not observe such contracts in the 
real world. Cooperation activities are the 
product of teamwork among workers, and it 
is an inherently impossible task to identify 
an individual's contribution to a cooperative 
‘effort with sufficient objectivity and accuracy 
to use that observation as a basis for wage 
payments. It is difficult to imagine what pro- 
cedures a firm could use to convince work- 
ers, arbitrators, or courts that a particular 
insider had been uncooperative, rather than 
the other insiders and entrants of the team. 
The difficulties involved in observing harass- 
ment activities are, if anything, even greater 
since the final product of these activities (a 


"1t future hiring is discouraged, the insider wage will 
be greater than it otherwise would have been. Contrary 
to Fehr's assertion, the insider wage (W;* — [a-f'(A-m) 
+w]) does depend on the marginal product of the 
firm's work force (f’(A-m)), and thus insiders do have 
an incentive to restrict entry to this work force. 
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rise in the entrant's disutility of work) is not 
open to objective measurement. 

In the absence of accurate and objective 
Observations of cooperation and harassment 
activities by individual insiders, the firm 
could face substantial litigation costs when- 
ever it refuses to pay the wage premium, 
given that the above contract is in force. 
These costs, along with a possible loss of 
customer goodwill may generall be ex- 
pected to deter firms from offering the above 
contract. Moreover, in the hypothetical event 
that accurate and objective observation could 
be made, the above contract may still not 
keep the insiders from their rent-seeking ac- 
tivities, if (as already discussed before) the 
insiders derive utility from withholding co- 
operation or harassing underbidders or if 
they have the opportunity of building repu- 
tations in a multiperiod context. 

It is worth noting, however, that whereas 
the inputs into team production cannot be 
measured objectively in practice, the output 
may be measurable in some cases. Thus out- 
put-related wage contracts appear to be more 
promising devices for inducing insiders to 
cooperate with entrants. Yet such contracts 
also give rise to problems of implementation 
and our (1988) paper discusses some such 
difficulties. Finally, observe that, like Fehr's 
wage contracts, the output-related ones are 
incapable of reducing involuntary unem- 
ployment which arises from insiders' harass- 
ment activities. 

In conclusion, it appears to be very diffi- 
cult to design practically implementable con- 
tracts that bribe the insiders to forego their 
discriminatory cooperation and harassment 
activities, and Fehr has certainly not suc- 
ceeded in doing so. 
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Internal Migration and Urban Employment: Comment 


By ALAN Day HAIGHT* 


Recently in this journal William E. Cole 
and Richard D. Sanders (1985) criticized the 
Todaro migration model and offered a dif- 
ferent approach. A lively and interesting de- 
bate followed, but the Cole and Sanders 
(CS) model was not actually solved. Indeed, 
Michael Todaro (1986, p. 566) suggested that 
the model yielded no unique algebraic solu- 
tion, a charge to which CS (1986) did not 
respond. This note identifies the changes 
needed to provide closure of the CS model. 

Combining all of the [CS] equations on 
the. subsistence sector (Section VI), and in- 
serting a full-employment equation,} 
tains, 


F^ X [W, N, / F(N,,); Pr, Ym» Pop] +G] 








where the notation.is that of CS. (Bars are 
added over characters to indicate 
constants.”) Unfortunately, this is one equa- 
tion in two unknowns: the urban-subsistence 
wage (W,,) and urban-subsistence employ- 
ment (Nps): 


*Instructor, Department of Economics, Bates Col- 
lege, Lewiston, ME 04240, and doctoral candidate, Uni- 
versity of Wisconsin-Madison. 

the full employment equation inserted here is N,, 
+N,=N,, where “N,” represents total subsistence 
population, employed in urban-subsistence and rural- 
subsistence sectors. l 

?To verify which variables are exogenous, see CS 
(1985, notes 29, 35, 41, 42). CS give no equation endog- 
enizing G (hence Gi is used above), but CS do sketch an 
example (note 33) where G is an export base multiple of 
X. If one used a function such as G[ X(W,,, N,,)] in- 
stead of exogenous G, this would not alter ‘the closure 
issue: G would then be an indirect function of the same 
unknowns as X. Calculations are available from the 
author. 
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To close the model, I suggest adding an 
equation requiring that the production func- 
tion for urban-subsistence services be linear: 
F(N,,) =EN s where € represents the aver- 
age (and marginal) product of urban-sub- 
sistence labor. (Evidently CS had both full 
employment? and linear production in 
mind, but omitted specifying: them.) This 
linearity eliminates the troublesome N,,/ 
F(N,,) factor, leaving 


sl 


dox c5 
E]: 
c 


A 


Was 








where the LHS and RHS are the derived 
demand for urban-subsistence labor and 
residual supply of urban-subsistence labor, 
respectively. This reduced form does deter- 
mine W, , closing the model. 


?For their position on full employment and lack ‘of 
barriers to entry, see CS (1985, p. 482, p. 490). 

^CS note that labor is the only input (equation 7) 
and assume that the competitive U—S sector pays labor 
its average product (equation 8). 
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Cooperative and Noncooperative R&D in Duopoly with 
Spillovers: Comment 


By IRENE HENRIQUES* 


Claude d’Aspremont and Alexis 
Jacquemin (1988) employ, a simple yet ele- 
gant symmetric duopoly model of R&D and 
spillovers to compare several equilibrium 
concepts. These concepts include (1) the 
two-stage noncooperative solution, (2) the 
two-stage mixed game,! 9 the two-stage 
fully cooperative solution,? and (4) the social 
planner's optimum.? 

For each of the cases stated above, they 
computed the equilibrium levels of output 
(Q-—4,*4;) and R&D (x,—x,- x) and 
the required second-order conditions. They 
report (i) for large spillovers (i.e, B > 0.5) 
x**>X>X>x* and Q** >Q > Q*- Q and 
(ii) for small spillovers (i.e., 8 < 0.4) x** > X 
>x*>X and O**> O*>6>0, where x 
denotes a firm’s R&D level, Q denotes total 
industry output, ** denotes the social opti- 
mum, ” denotes the fully cooperative model, 
* the noncooperative two-stage case, and 
^the mixed game. f is the spillover parame- 
ter. 

Here we show that comparing the pure 
cooperative and the pure noncooperative so- 
lutions as defined by d’Aspremont: and 
Jacquemin is only meaningful when the 
noncooperative solution is stable, that is, 
when spillovers are not too small. We find 
that, for very small spillovers (in our 
example this occurs when £ <0.17), the 


d'Aspremont-Jacquemin observation holds - 


*Department of Economics, Queen’s University, 
Kingston, Ontario Canada K7L 3N6. The author is 
indebted to John M. Hartwick and Perry A. Sadorsky 
for their helpful suggestions. I would also like to thank 
the FCAR fund of Le Ministére de l'Education du 
Québec for financial support. 

"Here firms cooperate in R&D but remain noncoop- 
erative in output. 

Firms cooperate in both their R&D and output 
decisions. 

A two-stage framework was also employed when 
computing the social optimum. 
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because the noncooperative model is unsta- 
ble. The importance of this result rests on 
the fact that even though the output reaction 
functions cross correctly when f < 0.17, the 
R&D reaction functions do not. When 0.17. 
< B < 0.41, stability obtains but R&D levels 
are higher in the noncooperative case than 
the fully cooperative one. For large spillovers 
the d'Aspremont-Jacquemin result is con- 
firmed. Moreover, we find that the introduc- 
tion of spillovers in the case of the noncoop- 
erative model tends to “promote” stability. 
In the case of the cooperative model, how- 
ever, as the level of spillovers is increased, an 
equilibrium ceases to exist. 


I. Stability and Spillovers 


Following d’Aspremont and Jacquemin 
(1988), we use a linear inverse demand 
schedule for the “industry” 


(1) 


such that a,b » 0 and q4- 4, x a/b. Pro- 
duction costs are given by 


Q) Ggs x; x;) = (A aa Bx;)qi» 
i,j-1,2 
where 0 < A <a, A» x, + Bx; 0 x B «1 and 
B is the spillover parameter. The cost of 
installing x; are (y/2)x? (i=1,2). Symme- 
try results from assuming that the players 
have the same costs. 
Firm i’s profits are given by 
(3) I= [a -b(aq* 4:)]4 


-[4-x- Bx] a (4/2) xi. 


p-a-b(a*d). 


i* J, 


*When f =0, we have the Barbara J. Spencer and 
James Brander (1983) two-stage duopoly R&D game. 
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The model described above is employed in 
two different games. In the first, firms act 
noncooperatively in both output and R&D. 
In the second game, firm behavior is mod- 
eled as a two-stage game played by two firms 
who fully cooperate in both their output and 
R&D decisions.’ 

Stability in duopoly involves reaction 
functions crossing “correctly.”® In the model 
described by d'Aspremont and Jacquemin, 
the stability requirements must apply for 
both choice variables (namely, output and 
R&D levels). In other words, in R&D and 
output games, one must reduce the system to 
reaction functions in either output or R&D 
space. 

The plausible approach, therefore, is to 
work back to the first stage (R&D stage) and 
analyze the reaction functions in R&D space. 
Before working back to the R&D stage, 
however, let us first look at the reaction 
functions in output space (equation (4)), tak- 
ing R&D levels as given (i.e, x; > 0). 


a-bq,- A*x;tfx, — 
()  «- ee 


These output reaction functions will always 
cross "correctly" in our example because 
0<A<a, x, t Bx, S A, b>0 and R&D 
levels are given. 

In R&D space, however, we employ the 
following reduced form reaction functions: 


(5 x 


20-8Y — 
9b Y 


, 


where i, j —1,2 and i+ j. The numerator of 
(5) is the second-order condition in the R&D 
stage that must be less than 0 for an equilib- 
rium. These reduced form reaction functions 
cross "correctly" when |dx,/0x,| is less than 


5The solutions to these games can be found in 
d'Aspremont and Jacquemin (1988). 

SSee Jesus K. Seade (1980) for an analysis of stability 
in one-stage Cournot duopoly games. 
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1. Differentiating (5) with respect. to x, 
yields? 


-2(2- B)28 —1)/9b 


2(2— g)* 
3» 7 


(6) 


Plotting the reduced form reaction func- 
tions in R&D space gives us a further 
dimension with which to examine the 
d'Aspremont and Jacquemin results. Figure 
1 depicts six cases. The first three cases 
represent the small spillover situations, while 
the last three represent the large spillover 
cases.? 

From Figure 1 (cases 1 through 3), we 
observe that the introduction of spillovers 
leaves the model unstable for 8 «3/2 
— 7/2 = 0.17 and stable for 0.17 < B «1.0. 
The existence of instabilities can lead to 
corner solutions for which specialization in 
R&D production is implied.? Note, however, 
that for B 7 0.6 an equilibrium cannot be 
obtained in the fully cooperative model.'? 
Furthermore, in the case of the noncoopera- 
tive two-stage model, we observe that al- 
though the second-order conditions are satis- 
fied when f < 0.17, stability is not assured.!! 


?The requirement that |dx;/dx,|<1 is exactly that 
used by Seade (1980). In our example this translates as 
follows: : 





«1. i, j=1,2 and i+ |j, 





where g,(-) = [a ~ b{q(x1, x2) + (42x. x2))] 
4, X2) - [4 — x; — Bx iG x2) - (72x? i j= 
L2ij. 

*These reduced reaction functions are, in general, 
given by x = x10, 405, X2) 4205, x2)) and x)= 


Xa (X BOG: x2); 4309 x2). 


9See John M. Hartwick (1988) for further details. 

19 This point is important since it imposes restrictions 
on the values of B, y, A, a, and b that one can employ 
to satisfy all the equilibrium concepts discussed in 
d'Aspremont and Jacquemin. 

In the fully cooperative case, the second-order con- 
ditions, (1+ 8)2/2— 2y < 0, are satisfied for all B < 0.6. 
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CASE! B=0.10 CASE 2 B=0,17 
4 ; 


fa (xı) 


S f gl 





CASE 4 B=0.50 








FIGURE 1. REDUCED FORM REACTION FUNCTIONS 
IN R&D SPACE 
ASSUMPTION 1: Inverse demand: p a — b(:), 
a —10.0, b 1.0 
Production costs: (4- xo BXx,)4, A=7.0, B < 0.41 


R&D costs: (y/2)x2, y =1.0 i, j=1,2 i*j 


Hence it is necessary to set the proper pa-. 
. rameter restrictions according to the relevant - 


stability and existence requirements of each 
model before one can compare both models 
in either a qualitative or quantitative man- 
ner. 

From cases 4 through 6, we observe that 
under large spillovers the slopes of the 
reduced form reaction functions change. Al- 


.though these reaction functions cross “cor-. 
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rectly,” R&D levels (x, and x,) are posi- 
tively related under large spillovers rather 
than being negatively related as we observed 
under small spillovers. How. does this relate 
to the d'Aspremont and Jacquemin result in 
the case of large spillovers? Intuitively, the 
introduction of large spillovers tends. to 
counter the negative monopolistic effects ob- 
tained in a fully cooperative model such as 
reduced activity levels relative to competitive 
models. In the case of the noncooperative 
model, as the level of spillover rises, firms 
tend to "free-ride" on the other firm's 
knowledge.” 


12The reader cen observe this “free-riding” charac- 
teristic by noting that in the noncooperative model: 
firms see their levels of R&D falling, while the fully 
cooperative firms see their R&D levels rise as the 
spillover parameter incréases. 
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Cooperative and Noncooperative R&D 
in Duopoly with Spillovers: Erratum 


By CLAUDE D'ASPREMONT AND ALEXIS JACQUEMIN* 


In our article published in this Review 
(volume 78, no. 5, December 1988, pp. 
: 1133-37), we have shown that in the pres- 
ence of sufficient spillovers of the R&D ben- 
efits, duopolists, cooperating in R&D but 
not in output, spend more on R&D than 
noncooperating firms at both stages, and 
also produce more output, closest to the 
socially optimal level. A second symmetric 
result is that for small spillovers, duopolists 
cooperating neither in R&D nor in output 
spend more on R&D and produce more 
output than cooperative firms. However, this 
result has been obscured by an obvious in- 
equality inversion and other typos implying 
a modification in the conclusions. Indeed in 
fn. 13 on p. 1135 one should have that 
X» x* iff 


(14 8) " 
4by —(14- B)? 


(2-8) 
45by —(2-B)(1+8) 


or B > 0.41. 


This implies that the classification in ` 


fn. 16 on p. 1137 should be changed to 


x**>x*ž 2> Â; Q**>Q*>O>O. 

Therefore, “for some spillovers, such that 
B x 0.4, the classifications are different and 
the ‘second-best’ for R&D is obtained by 
a noncooperative behavior in both stages” 
(p. 1137, lines 5-8). 

These results have now been generalized 
to a wide class of oligopoly models (K. Suzu- 
mura, 1989); the stability conditions of the 
solutions have been established (I. Hen- 
riques, this issue); and the effects of various 


*Université Catholique de Louvain, 1348 Louvain- 
la-Neuve, Belgium. i 
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rates of research spillovers have been ex- 
plored (N. Vonortas, 1989). All these papers 
emphasize the crucial role played by the 
spillover parameter 8 and, implicitly or ex- 
plicity, suggest the usefulness of endogen- 
izing it. Three main types of factors 
corresponding to the three dimensions of the 
relevant game can influence the value of f. 

First, there is the nature of the research: 
a priori, results of precompetitive, generic 
research are less easily appropriable and 
therefore lead to more spillovers than those 
of specific, applied development activities. 

Second, the nature of the product is im- 
portant with the usual distinction between 
homogeneous and differentiated goods that 
could lead to different solution concepts such 
as Cournot and Bertrand: a priori, spillovers 
are superior in the former case, given that 
the more standardized is a product, the eas- 
ier it is to embody in it the results of R&D. 

Third, the nature of the contract and the 
degree of perfection in information also af- 
fect the rate of spillovers. On one side, these 
spillovers are higher for members of the co- 
operative agreement than for noncooperative 
firms. On the other side, within the coopera- 
tive group itself, 8 can vary according to the 
organizational arrangement, fi—1 corre- 
sponding to perfect communication and uti- 
lization of the resulting information, for ex- 
ample, through integrated laboratories. 

In terms of empirical verification and pub- 
lic-policy decisions, it is therefore crucial to 
extend the analysis of the welfare effects of 
cooperative R&D by taking into account the 
main determinants that can modify, at one 
momení of time and over time, the level of 
the corresponding externalities. 
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Independent Auditor’s Report 


Executive Committee 
The American Economic Association 


We have audited the accompanying balance sheets of The American Economic Associa- 
tion as of December 31, 1989 and 1988, and the related statements of revenues and 
expenses, changes in general fund and restricted fund balances and cash flows for the years 
then ended. These financial statements are the responsibility of the Association’s manage- 
ment. Our responsibility is to express an opinion on these financial statements based on our 
audits. 

We conducted our audits in accordance with generally accepted auditing standards. Those 
standards require that we plan and perform the audit to obtain reasonable assurance about 
whether the financial statements are free of material misstatement. An audit includes 
examining, on a test basis, evidence supporting the amounts and disclosures in the financial 
statements. An audit also includes assessing the accounting principles used and significant 
estimates made by management, as well as evaluating the overall financial statement 
presentation. We believe that our audits provide a reasonable basis for our opinion. 

In our opinion, the financial statements referred to above present fairly, in all material 
respects, the financial position of The American Economic Association as of December 31, 
1989 and 1988, and the results of its operations and its cash flows for the years then ended, 
in conformity with generally accepted accounting principles. 


Certified Public Accountants 
March 2, 1990 
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THE AMERICAN ECONOMIC ASSOCIATION BALANCE SHEETS FOR THE YEARS | 
. ENDED DECEMBER 31, 1989 ANp 1988 


Assets 


CasH 
INVESTMENTS, at market (Notes A and B) 


Accounts RECEIVABLE, no allowance for doubtful 
accounts considered necessary 


INVENTORY or Index of Economic Articles, at cost 
PREPAID EXPENSES 


OFFICE FURNITURE AND EQUIPMENT——at cost, less 
accumulated depreciation of $85,151 (1989) and 
and $77,721 (1988) : 


Liabilities and Fund Balances 


ACCOUNTS PAYABLE AND ACCRUED LIABILITIES 


' DEFERRED REVENUE (Note A) 
Life membership dues 
Other membership dues 
Subscriptions ; 
Job Openings for Economists 


ACCRUAL FOR SURVEY or Members (Note A) -. 


Funp BALANCES: 
General 
Net worth . 
Restricted (Note A) 


Total Fund Balances 


See notes to financial statements. 
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1989 1988 
$ 549991 $ 9227 
5,756,069 4,968,804 

52,189 154,146 
205,395 178,659 
31,448 22,673 
59,774- 62,236 
$6,754,866 . $6,358,790 
$ 600,843 $ 538268 - 
-33,890 36,570 
. 715403 — 649,297 
^ 512102 548,611 
24241 23.727 
1,290,636 1,258,205 
159,051. 271,918 
2,641,042 3,966,108 
63,294 318291 
4,704,336 - 4,284,399 
$6,754,866 $6,358,790 
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THE, AMERICAN ECONOMIC ASSOCIATION STATEMENTS OF REVENUES AND EXPENSES 
. FOR THE YEARS ENDED DECEMBER 31, 1989 AND 1988 





1989 1988 
REVENUES FROM DUES AND ACTIVITIES: : 
Membership dues and subscriptions $1,048,752 $ 943,758 
Nonmember subscriptions 759,282 756,611 
Job Openings for Economists subscriptions ` 37,498 34,111 
Advertising 134,428 128,493 
Sale of Index of Economic Articles 90,695 175,596 
Sale of copies, republications, and handbooks 39,780 44,136 
Sale of mailing list 56,591 . 49,268 
Annual meeting 83,426 70,983 
Sundry (Exhibit I) > 835296 > 92,925 
Operating Revenues 2,333,748 2,295,881 
PUBLICATION EXPENSES: : 
American Economic Review 743,155 708,580 
Journal of Economic Literature 950,835 911,429 
Survey of Members (Note A) 70,000 70,000 
Job Openings for Economists 62,440 58,666 
Index of Economic Articles 50,272 96,399 
Journal of Economic Perspectives 393,262 361,079 
2,270,564 2,206,153 
OPERATING AND ADMINISTRATIVE EXPENSES: 
General and administrative: 
Salaries 205,473 ' 176021 
Rent 22,001 20,388 
Other (Exhibit II) 214,814 f 228,969 
Committee 63,894 54,810 
Annual meeting ' 6,608 - (08 7,544 
512,790 487,732 
Operating Expenses 2,783,354 2,693,885 
Operating Deficit ( 449,606) ( 398,004) 
INVESTMENT INCOME RECOGNIZED (Note B) 297,054 - ' 280,512 
EXPENSES IN Excess or REVENUES: : ($ 152,552) ($ 117,492) 





See notes to financial statements. 
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THE AMERICAN ECONOMIC ASSOCIATION STATEMENTS OF CHANGES IN GENERAL Funn BALANCE 


Balance at January 1, 1988 

Add change in market value of 
investments 

Deduct expenses in excess of revenues 


Balance at December 31, 1988 

Add change in market value of 
investments l 

Deduct expenses in excess of revenues 


Balance at December 31, 1989 


See notes to financial statements. 


Total 


$3,898,027 


185,573 
(117492) 


3,966,108 


827,486 
152,552) 


$4,641,042 


Operations 


$2,305,091 


( 117,492) 


2,187,599 


( 152,52) 


Market 
Value 
Adjustments 
$1,592,936 


185,573 


1,778,509 


827,486 


$2,605,995 


THE AMERICAN ECONOMIC ASSOCIATION STATEMENTS OF CHANGES IN RESTRICTED FUND BALANCE 


Balance at 
January 1 


Receipts 


Disburse- 


ments 


Balance at 


December 31 





YEAR ENDED DECEMBER 31, 1988: 

The Alfred P. Sloan Foundation, 
Ford Foundation, and Federal 
Reserve System grants for 
increase of educational opportunities 
for minority students in economics 

The Minority Scholarship Fund for 
minority students applying for 
graduate work in economics 

The Andrew W. Mellon Foundation, 
and Alfred P. Sloan Foundation 
grants to study economic 
graduate education in the United States 

Sundry 


YEAR ENDED DECEMBER 31, 1989: 

The Alfred P. Sloan Foundation, 
Ford Foundation, and Federal 
Reserve System and Rockefeller 
Foundation grants for the increase of 
educational opportunities for minority 
students in economics 

The Andrew W. Mellon Foundation, 
Alfred P. Sloan Foundation and 
National Science Foundation grants 
to study economic graduate 
education in the United States 

The Olin Foundation grant 
to confer on intellectual 
property rights 

The Minority Scholarship Fund for 
minority students applying for 
graduate work in economics 

Sundry 


$107,002 


5,000 


2,893 
$114,895 





$188,838 


115,960 





$288,965 





$115,136 


178,886 


12,500 





$207,129 


84,040 





$291,169 


$235,786 


317,322 


3,011 


6,500 





$562,619 





$138,838 


5,000 


115,960 
8,493 


$318,291 





$ 68,188 


( 22,476) 
9,489 
5,000 
3,093 


$ 63,294 
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THE AMERICAN ECONOMIC ASSOCIATION STATEMENTS OF CAsH FLOWS 
FOR THE YEARS ENDED DECEMBER 31, 1989 anp 1988 


1989 1988 
CASH FLOWS FROM OPERATING ACTIVITIES: 
Receipt of membership dues and subscriptions $2,468,136 $2,230,521 
Disbursements to suppliers and employees ( 2,853,839) | ( 2,557,537) 
Receipts of restricted funds ' 307,622 494,565 ` 
Disbursements made from restricted funds ( 562,619)  ( 291,169) 
Net CASH USED IN OPERATING ACTIVITIES € 640700  ( 123,20) 
CASH FLOWS FROM INVESTING ACTIVITIES: UC 
Purchase of office furniture and equipment (18,856) (11,584) 
Purchases of investments (3,500,195) — ( 6,459,995) 
Proceeds from sale of investments 2,712,930 6,251,160 
Proceeds from interest and dividends on investments 1,124,540 466,085 
Net Cash PROVIDED Bv INVESTING ACTIVITIES 318,419 245,666 
NET (DECREASE) INCREASE IN CASH —— ( 322281) 122,046 
Cash at Beginning of Year 972272 850,226 
Cash at End of Year ` $ 649,991 $ 972,272 


RECONCILIATION OF EXPENSES IN EXCESS OF REVENUES 
TO Ner CAsH USED IN OPERATING ACTIVITIES: 
Expenses in excess of revenues l ($ 152,552) ($ 117,492) 
Adjustments to reconcile expenses in excess 
of revenues to net cash used in 
operating activities: 


Depreciation 21,318 15,868 
Changes in assets, liabilities, and fund 
balances: 
Decrease (increase) in accounts receivable 101,957 ( 123,071) 
(Increase) decrease in inventory ( 26,736) ` 27,299 
Increase in prepaid expenses : ( 8775) ( 3,843) 
Increase in accounts payable and 
accrued liabilities 62,575 27,419 
Increase in deferred revenue 32,431 57,711 
(Decrease) increase in accrual for 
Survey of Members ( 118,867) 69,605 
Investment income recognized ( 297,054) ( 280,512) 
(Decrease) increase in restricted fund 
balance ( 254,997) 203,396 
"Total adjustments ( 488448 ( 6,128) 


NET Casu USED IN OPERATING ACTIVITIES ($ 640,00 ($ 123,620) 


See notes to financial statements. 
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The American Economic Association Notes to Financial Statements for the 
Years Ended December 31, 1989 and 1988 


A. Summary of Significant Accounting Policies . 


Investments are accounted for on a market value basis. The' investment income recognized is modified to reflect 
only the Association's approximate historical average rate of return, which is currently 5%. Investment income 
represents 5% of the total cash and market value of investments at the beginning of the year. The change in market 
. value of investments and dividends and interest earned net of investment income recognized is recorded directly to 
the general fund. : 


Accrual for Survey of Members. Every three to five years the Association publishes a survey which lists, among 
other things, the names and addresses of its membership. This survey was most recently published in 1989 and 
distributed at no cost to the membership. In order to properly match the publishing cost of this survey with revenue 
from membership dues, the Association provided $70,000 in 1989 and in 1988 for estimated publishing costs which 
will reduce actual survey expenses. in the year of publication. 


_ Deferred revenue represents income from membership dues and subscriptions t to the various periodicals of the ` 
Association which are deferred when received. These amounts are then recognized as income following the 
distribution of the specified publications to the ‘members and subscribers of the Association. Income from life 
membership dues is recognized over the estimated average life of these members. 


The American Economic Association files its federal income tax return as an educational organization, 
substantially exempt from income tax under Section 501(c) (3) of the Internal Revenue Code. As required by 
Section 511(a) of this Code, the Association provides for federal income taxes on certain revenues which are not 
substantially related.to its tax exempt purpose. This “unrelated business income” includes income from advertising 
and the sale of mailing lists. The Association has been determined to be an organization which is not a private 
foundation. ; 


Certain restricted funds are administered on a | reimbursement basis; therefore, disbursements are allowed prior 
to receipt of grant proceeds. , 


Certain reclassifications have been made to the 1988 amounts in order to conform to the 1989 presentation. 
B. Investments and Investment Income 


Investments consist of: 





December 31, 1989 ' December 31, 1988 























l Cost ` Market Cost - Market 
Government obligations, bonds, ma RC 
. andcommercialpaper  , ~ $ 586,999 $ 617,179  ...$ 922,760 $ 954,291 
Mutual funds m 4,578,432 5,138,890 4,004,113 4,014,513 
: ae $5,165,431 $5,756,069 $4,926,873 . $4,968,804 
Investment income recognized consists of: 
` . Year Ended December 31, 
ME ' he g 1989. ' 1988 
Government obligations, bonds, and "m ‘ f 
commercial paper—interest $..92,817 $ 90,847 
Corporate stocks and mutual . i l 
funds—cash dividends 471,922 218,198 
Corporate stocks and mutual . i 
funds—net gain on sale 11,094 15,792 
Change in market value ] i 548,707 . 141,248 
Transfer to general fund, net ( 827,486) ( 185,573) 








Investment income recognized, net . $297,054 $280,512 
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C. Retirement Annuity Plan 


Employees of the Association are eligible for participation in a contributory retirement annuity plan. Payments 
by the Association and participating employees are based on the employee" s compensation. Benefit payments are 
based on the amounts accumulated from such contributions, The total pension expense was approximately $42,000 
for both 1989 and 1988. 


` D. Ratio of Net Worth to Expenses 


The ratio of net worth at December 31, 1989 to 1990 budgeted Senia is 1.49 and the ratio of net worth at 
December 31, 1988 to actual 1989 expenses is 1.42. 


ExHIBIT I—THE AMERICAN ECONOMIC ASSOCIATION STATEMENTS OF SUNDRY 
'. REVENUES FOR THE YEARS ENDED DECEMBER 31, 1989 ann 1988 

















1989 1988 

AER submission fees $49,855 $45,860 
Royalties 27,833 29,062 
CSWEP membership dues 4,721 ; 16,280 
Donations 368 529 
Permission to reprint ; 327 ' 200 
Foreign postage : . 192 290 
Miscellaneous income - 704 

$83,296 $92,925 





Exuisir II—THE AMERICAN Economic ASSOCIATION STATEMENTS OF OTHER 
GENERAL AND ADMINISTRATIVE EXPENSES FOR THE YEARS ENDED 
‘DECEMBER 31, 1989 anp 1988 











1989 . 1988 
Dues and subscriptions ` $ 594183 -— $ 66,405 
Mailing list file maintenance ` 33,446 : . 37,532 
Postage | 26500 — 18,371 
Depreciation (straight-line method) . 21,318 15,868 
Accounting and legal 20,292 20,222 
Investment counsel and custodian fees 16,801 18,445 
Office supplies : 11,547 7,816 
President and president-elect expenses 7221 6,000 
. Insurance and miscellaneous : 6,722 6,302 
Election expenses 5,936 . 10,131 
Telephone . 4,479 4,298 
Bank chárges 713 12,207 
Travel 649 5,372 
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The Origins of American Industrial Success, 1879-1940 


By Gavin Wricut* | 


The United States became the world’s preeminent manufacturing nation at the 
turn of the twentieth century. This study considers the bases for this success by 
examining the factor content of trade in manufactured goods. Surprisingly, the 
most distinctive characteristic of U.S. manufacturing exports was intensity in 
nonreproducible natural resources; furthermore, this relative intensity was increas- 
ing between 1880 and 1920. The study then asks whether resource abundance 
reflected geological endowment or greater exploitation of geological potential. It 


was mainly the latter. (SEL 042) 


Recent thinking about American eco- 
nomic performance has been marked by 
alarm over the country’s loss of its “compe- 
titive edge.” Most of this discussion is not 
rooted in an understanding of the histori- 
cal origins of the economic leadership 
now thought to be in jeopardy. Modern 
economists tend to assume that the Ameri- 
can advantage has been technological and 
dates from the remote recesses of history, 
about as far back as anyone really cares to 
go. In a volume on U.S. competitiveness, 
Harvey Brooks writes: “Both our firms and 
our government, long accustomed to being 
the technological leaders in almost every field, 
have until recently measured their perfor- 
mance against domestic rather than foreign 
competitors” (Bruce R. Scott and George 
C. Lodge, 1985, p. 331; emphasis added). 
For one country to maintain a technologi- 
cally based advantage over others for long 
historical periods is anomalous, and surely 
calls for explanation. Indeed, it is difficult to 
see how policies can respond appropriately 


*Department of Economics, Stanford University, 
Stanford, CA 94305. This research was supported by 
the Center for Economic Policy Research at Stanford, 
and the paper was written while in residence at the 
Institute for Advanced Study in Princeton, NJ. Those 
who have offered useful advice are too numerous to 
list, but the author is particularly grateful to Paul 
David for suggesting and encouraging the project, and 
to Stanley Engerman, Alex Field, Albert Hirschman, 
Nate Rosenberg, Gary Saxonhouse, and Robert Staiger 
for specific suggestions. Research assistance by David 
Green and Jeff Sundberg was much appreciated. 
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to *what we have lost" without knowledge 
of what it was that we had and how we got 
it. It would be an understatement, however, 
to say that the subject has been understud- 
ied. This paper makes a modest beginning 
by analyzing American trade in manufac- 
tured goods between 1879 and 1940. The 
competitive success of American manufac- 
turing exports in foreign markets is by no 
means a comprehensive measure of 
"success." But because the turn of the cen- 
tury marked the emergence of the United 
States to a position of world economic pre- 


-eminence, we may hope to learn something 


about the broader questions by studying the 
characteristics of the country's trade with 
the rest of the world during that key era. 
The results are surprising. They suggest 
that the single most robust characteristic of 
American manufacturing exports was inten- 
sity in nonreproducible natural resources. 
In fact, their relative resource intensity was 
increasing over the half-century prior to the 
Great Depression. This does not mean that 
there was no American technological lead- 
ership, in the broad sense of that term. 
Abundant resources were themselves in 
many ways a reflection of the advanced state 
of American technology. But-the distinc- 
tively American industrial innovations were 
in many respects specific to the pre-World 
War II U.S. resource environment and na- 
tional market, both of which were unique 
among the countries of the world. Since 
then, relative American resource abun- 
dance has greatly diminished, not primarily 
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from depletion of national reserves but be- 
cause of the integration of world markets 
for minerals and other commodities. Twen- 
tieth-century patterns of resource discovery 
and production suggest that the historic ba- 
sis for U.S. mineral abundance was much 


more a matter of early “development” than ` 


of geological “endowment.” 


I. The Ascendance of American Industry 
on a Global Scale 


Americans have enjoyed high material liv- 
ing standards since the eighteenth century if 
not earlier, and the acceleration to modern 
rates of per capita growth occurred during 
the first half of the nineteenth century. 
Broadly based American industrial leader- 
ship on a worldwide basis, however, can 
only be dated from the very end of the 
nineteenth century. According to Paul 
Bairoch (1982), the U.S. share of total world 
manufacturing output passed Great Britain’s 
between 1880 and 1900 (Chart 1). In per 
capita levels of industrial output, the United 
States was a weak fourth among the nations 
of the world in 1880, and surpassed Britain 
only after 1900 (Chart 2). Contemporary 
testimony suggests that American technol- 
ogy and manufactured goods began to play 
a qualitatively different role in the world as 
of the 1890s or shortly thereafter. The first 
wave of alarmist European books on 
“Americanization” dates from 1901 and 
1902, with titles and themes (The American 
Invaders, 1901; The Americanization of the 
World, 1901; The American Invasion, 1902) 
that would again become familiar in the 
1920s and 1960s (William Woodruff, 1975, 
p. 123). Rapid inflows of standardized, ma- 
chine-made American shoes after 1894 (said 
to be more comfortable and more stylish 
than the traditional types) caused conster- 
nation in the British boot-and-shoe industry 
and forced a drastic technological overhaul 
(R. A. Church, 1968). Equally dramatic was 
the burst of American exports of machine 
tools and other engineering goods after 
1895, not only to Britain but to the Conti- 
nent and other parts of the world (Roderick 
C. Floud, 1974, pp. 60—62; 1976, pp. 72-82). 

Though the suddenness of the American 
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CHART 1. SHARES OF WORLD INDUSTRIAL 
Output, 1830-1953 
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Source: Bairoch (1982, pp. 296, 304). 


CHART 2. INDUSTRIAL OUTPUT PER CAPITA 
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Source: Bairoch (1982, pp. 294, 302). 


“Invasion” after 1895 may be attributable to 
temporary factors, it seems clear that a 
crossover point of some sort was reached at 
that time.! 


Industry studies seem to confirm this tim- 


ing. Robert Allen has shown that prior to 
the 1890s American blast furnaces had no 
distinctive world-class status in either labor 
productivity or fuel efficiency (Allen, 1977, 
pp. 608-609). By 1900, after key break- 


1$. J. Nicholas (1980) argues that the apparent de- 
cline in the price of American “engineering goods” 
mainly tracks the prices of iron and steel praducts, and 
that the sudden “invasion” of U.S. goods reflected 
temporary delivery lags by British firms during 
1895-1900. As argued below, both of these elements 
reflected more lasting features of American industrial 
success. 


VOL. 80 NO. 4 


throughs in adapting the technology to the 
new Mesabi iron ore, the U.S. industry was 
the world leader by both of these indicators. 
Pig iron was an input in the production of 
steel, which was in turn crucial for railroads, 
construction, and a wide range of machinery 
and manufactured goods. According to 
Allen, before the 1890s American steel rails 
would not have been competitive in the 
domestic market without tariff protection 
(Allen, 1981). Advances in steel were in 
turn complementary to progress in other 
industries. U.S. rubber-tire makers, for ex- 
ample, were well behind the French during 
the bicycle craze of the 1890s, and only 
gained a productivity advantage in conjunc- 
tion with mass production of automobiles 
shortly before World War I (M. J. French, 
1987, p. 66). None of this denies that twen- 
tieth-century U.S. technology emerged from 
an evolutionary learning process over a 
much longer period, as economic historians 
have long stressed (Paul David, 1975; 
Nathan Rosenberg, 1976; David Hounshell, 
1984), But the qualitative changes in indus- 
trial America’s place in the world after 1890 
justify a closer look at this period. 

The timing of U.S. industrial perfor- 
mance corresponds closely to the more 
comprehensive finding of U.S. world leader- 
ship by Angus Maddison, based on esti- 
mates of Gross Domestic Product per man- 
hour (Maddison, 1982, p. 212; compare also 
Moses Abramovitz, 1986). But Maddison 
seems to assume that U.S. leadership in 
productivity corresponded closely to a posi- 
tion of “world leadership” in technology. 
This is surely not the only possibility. In 
terms of conventional growth-accounting, 
the U.S. edge could equally well have been 
attributable to capital or natural resources. 
Interestingly, Maddison’s figures actually 
show that the world leader in GDP per 
man-hour prior to World War I was not the 
United States but Australia. His explana- 
tion is confined to a footnote: “In defining 
productivity leadership, I have ignored the 
special case of Australia, whose impressive 
achievements before the First World War 
were due largely to natural resource advan- 
tages rather than to technical achievements 
and the stock of man-made capital” (p. 258). 
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Can we be certain that the United States 
was not also a "special case" whose perfor- 
mance depended on “natural resource ad- 
vantages"? 

Contrary to expectations of increasing re- 
source scarcity, post-Civil War American 
development featured declining relative 
costs of materials (Louis P. Cain and 
Donald G. Paterson, 1981, pp. 358-360). 
Major new metals discoveries continued un- 
til World War I, while the rate of discovery 
of new oil fields accelerated after 1900 (U.S. 
National Resources Committee, 1937, p. 
149). The timing of leadership in industrial 
production coincides remarkably with 
American world leadership in coal produc- 
tion (after 1900), and that margin also grew 
over time. The United States was also the 
world's leading producer of copper, petro- 
leum, iron ore, zinc, phosphate, molybde- 
num, lead, tungsten, and many other miner- 
als. At the same time, continuing advances 
in internal transportation reduced the real 
costs to manufacturers, creating what histor- 
ical geographers call the “minerals-domi- 
nant economy" of the late nineteenth cen- 
tury (Harvey S. Perloff and Lowden Wingo 
Jr., 1961, pp. 193-197). The improvements 
were often qualitative as well as quantita- 
tive, most strikingly perhaps in the iron ore 
from the rich Mesabi range, which began to 
arrive in the steel mills of the lower Great 
Lakes in the 1890s. Allen's estimates of 
total-factor-productivity in iron and steel as 
of 1907-1909 put the United States at a par 
with Germany (15 percent ahead of Britain), 
but the ratio of horsepower to worker was 
twice as large in America as in either of the 
other two contenders (Allen, 1979, p. 919). 
If we were to adopt the conventional view 
that resource abundance is an alternative to 
technologically based manufacturing, we 
might well be led to question the authentic- 
ity of America's leadership position before 
World War II. But as argued below, this is 
not the only choice available to us. 


II. Hypotheses from the International 
Trade Literature 


A number of hypotheses bearing on 
American industrial history emerge from the 
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literature on the bases fcr international 
trade. According to the Heckscher-Ohlin 
model, the composition of a country’s trade 
reflects the relative abundance of factors in 
that country’s endowment. Simple two-fac- 
tor versions of this theory have frequently 
been rejected, beginning with the “Leontief 
Paradox,” which revealed that in 1947 U.S. 
exports were more capital-intensive than 
were competitive imports (Wassily Leontief, 
1953). Attempts to rationalize this result, 
however, have generated more refined 
propositions. According to the “neo-factor- 
proportions" approach, American exports 
have actually been intensive in skills or hu- 
man capital. This interpretation was sug- 
gested by Leontief himself, and has been 
supported by an empirical regularity first 
identified by Irving B. Kravis (19562), that 
average wage levels in American export in- 
dustries have been persistently higher than 
wage levels in import-competing industries. 
It has become a standard convention in 
empirical trade studies to take the relative 
industry wage as a proxy for skill require- 
ments, and on this basis the skill intensity of 
American exports has been claimed as a 
pattern as far back as 1899 if not earlier 
(Helen Waehrer, 1968). Studies for more 
recent periods have supported this view with 
detailed evidence on the occupational struc- 
ture of the labor force (Donald B. Keesing, 
1968). 

An alternative “third factor" interpreta- 
tion for the paradox is that capital is com- 
plementary to natural resources, and that 
the United States had moved into a position 
of resource scarcity by 1947 (Kravis, 1956b). 
This possibility is supported by Jaroslav 
Vanek's important study of the natural re- 
source content of U.S. foreign trade, 
1870-1955 (Vanek, 1963), which showed 
that the country had moved from a net 
export to a net import position in natural 
resources over that period. This finding 
raises the possibility that U.S. comparative 
advantage may have had a different basis at 
an earlier time. 

A different (though not necessarily mutu- 
ally exclusive) intellectual strategy is taken 
by the "neo-technology" approach. The 
concept of a "technological gap" between 
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the United States and the rest of the world 
was à commonplace in discussions of trade 
and direct investment during the 1950s and 
1960s (Atlantic Institute, 1970). Though 
theory makes a sharp distinction between 
"factor proportions" and "technology" ef- 
fects, in practice the two ideas are often 
similar. Employment of skilled professional 
and scientific personnel is correlated with 
investment in research, often called “R&D 
intensity" or simply the "technology factor" 
(Raymond Vernon, 1970). Similarly, Ameri- 
can "technology" has often been linked as 
much with managerial performance as with 
science-based production methods. Since 
the vertically integrated modern business 
corporation developed earlier and diffused 
more widely in the United States than else- 
where (Alfred D. Chandler and Herman 
Daems, 1980), the conceptual correlations 
among technology, organization, and per- 
sonnel are likely to be high. 

A more difficult conceptual challenge is 
technological leadership manifest in the 
form of new products, exported from the 
United States because they were unavail- 
able elsewhere (Kravis, 1956b). Because ex- 
ports were small as a percentage of output 
for almost all American industries, the U.S. 
case would seem to be a likely example of 
the historical process described by Staffan 
Burenstam Linder (1961) whereby new 
products originally designed for the domes- 
tic market begin to enter foreign trade as 
production expands: "International trade is 
really nothing but an extension across na- 
tional frontiers of a country's own web of 
economic activity" (p. 88). Vernon's “prod- 
uct-cycle" model is perhaps the best-known 
version: New products tended to appear first 
in the United States because they were re- 
sponsive to high-income wants, and because 
they were associated with an environment 
of high labor costs. As processes became 
more mature and routine, trade would be 
displaced by production abroad, but the vol- 
ume of U.S. exports was maintained by a 
continuing flow of new innovations (Vernon, 
1966). 

There is an ever-present danger of 
anachronism in applying such concepts his- 
torically. The United States did not invent 
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the firearm, the shoe, the bicycle, the cam- 
era, or the automobile, and the American 
versions of these goods were not regarded 
in European countries as well suited to 
“high-income wants” (which were better 
served by the English or French). The size 
and character of the U.S. domestic market 
were certainly crucial, but the bulk of the 
new American exports were producers 
goods, whose “novelty” lay not so much in 
consumer taste as in technical specifications 
or quality. The approach taken here there- 
fore concentrates on the supply side, by 
analysis of the changing factor content of 
manufacturing trade over the era of Ameri- 
can ascendancy. Though we cannot claim to 
measure or establish the nature of Ameri- 
can “technological leadership” in a rigorous 
sense, we can illuminate that subject by 
finding the characteristics of those U.S. 
products that had the greatest impact on 
world markets. 

This has been the approach of earl- 
ier historical work.” Using the standard 
methodology of empirical trade studies, 
N. F. R. Crafts and Mark Thomas present 
an analysis of comparative advantage in 


British manufacturing trade between 1910 . 


and 1935, which they contrast unfavorably 
with that of the United States (Crafts and 
Thomas, 1986). They find that Britain con- 
tinued to export products intensive in capi- 
tal and unskilled labor and to import goods 
intensive in human capital (as reflected in 
the average industry wage). A similar re- 
gression for the United States in 1909 shows 


An extensive literature on the so-called "labor- 
scarcity paradox" takes a similar tack, assessing U.S. 
performance indirectly by measuring the factor-saving 
bias of U.S. technology relative to British. The sugges- 
tion by H. J. Habakkuk (1962) that American technol- 
ogy was capital-intensive and labor-saving has given 
way to a more complex picture: American methods 
were more intensive in the use of raw materials and 
fuel and were characterized by a faster pace and more 
intensive utilization of capital (David, 1975; Field, 
1983). The provocative early successes of the "Ameri- 
can system" were limited to a small subset of industries 
in the 1850s (John James and Jonathan Skinner, 1985). 
This work concentrates on the mid-nineteenth century, 
giving little attention to change over time or to the 
overall scope of U.S. industrial performance. 
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a reverse result. They conclude: “The U.S. 
appears already to be following the 'ad- 
vanced country’ pattern of exporting human 
capital intensive goods and importing un- 
skilled labor-intensive goods in 1909" (p. 
637). The next section considers whether 
this impression should be modified on the 
basis of a richer data set. 


III. New Evidence on American Trade 
in Manufactures 


A. Average Factor Intensities 


One of the reasons that American manu- 
facturing trade has been understudied is 
that the Commerce Department trade data 
are entirely separate from the censuses of 
manufactures, which have no information 
about foreign markets. It is not a simple 
task to match these two sources. Fortu- 
nately, a Stanford dissertation by Mary 
Locke Eysenbach estimated production co- 
efficients for 165 industries according to the 
system used in Leontief's 1947 interindustry 
study, and matched these to export and 
import data for 1879, 1899, and 1914 
(Eysenbach, 1976). The present research has 
replicated her procedures and extended the 
data set to 1909, 1928, and 1940? For most 
sample years there are just over 100 usable 
observations, providing a level of detail 
roughly comparable to three-digit SITC cat- 
egories. 

To explore the factor intensity of manu- 
facturing trade, I have used Eysenbach's 
production coefficients to trace relative 
changes over the entire period of observa- 
tion. Her capital and labor coefficients are 
primarily from the census of 1899, while the 
natural resource coefficients were taken 
from Vanek (1963) and hence originate in . 
the input-input table for 1947. 'Thus, this is 
primarily a study of compositional changes 
in manufacturing trade over time rather than 
the actual implicit factor flows in each year. 
As a sensitivity check, however, estimated 
coefficients for alternative years have been 


5David Green deserves most of the credit for the 
detective work that this task entailed. 
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TABLE 1 —CaPrrAL-LaBoR RATIOS FOR MANUFACTURED Goons, 1879-1940 
($000 per EMPLOYEE IN 1909 DoLLAns) 





A. 1899 Coefficients 


1879 189 1909 1914 1928 1940 
Exports 4.186 4.059 4.052 3.961 3.946 3.374 
Imports 2.608 2.886 2.785 2.850 2.907 3.221 
Exports /Imports 1.61 1.41 1.46 1.3 1.36 1.05 
B. 1909 Coefficients 
1879 1899 1909 1914 1928 1940 
Exports 5.405 4.877 4.967 4,811 4959 ` 4.193 
Imports . 2.999 3.079 3.020 3.073 3.486 4.444 
Exports /Imports 1.80 1.58 1.64 1.57 1.42 0.94 
C. 1947 Coefficients 
1879 189 1909 1914 1928 1940 
Exports 4.725 5.170 6.350 6.790 6.330 5.265 
Imports 2.910 3.440 3.420 3.690 4.325 5.850 
Exports/Imports 1.62 1.50 1.86 1.84 1.46 7 0.90 





Sources: 1899 coefficients from Mazy Locke Eysenbach, American Manufactured Exports, 1897-1914, New York: 
Arno Press, 1976, pp. 302-306; 190% coefficients from U.S. Census of Manufactures; 1947 coefficients form Wassily 
Leontief, “Factor Proportions and the Structure of American Trade," Review of Economics and Statistics, 
November 1956, 38, 403—407. 

Trade Figures: for 1879, 1899, 1914 from Eysenbach, pp. 271-275; 1909, 1928, 1940 from U.S. Commerce 
Department, Foreign Commerce and Navigation of the United States. Exact industry groupings available on request. 


TABLE 2—Measures OF SKILL INTENSITY OF MANUFACTURED Goons, 1879-1940 





A. Percentage Earning More than $12/Week in 1890 





1879 189 1909 1914 1928 1940 
Exports 52.3 48.7 48.2 45.9 46.6 42.9 
Imports 48.5 45.7 47.1 44.1 42.3 41.3 
Exports /Imports 1.08 1.07 1.02 1.04 1.10 1.04 

B. Average Wage (1909) 

1879 1899 1909 1914 1928 1940 
Exports 0.467 0.482 0.487 0.502 0.504 0.541 
Imports 0.431 0.433 0.460 0.426 0.463 0.471 
Exports/Imports 1.09 1.11 1.06 1.18 1.09 1.15 

C. Percentage Women and Child Labor (1909) 

1879 — — 1899 1909 1914 1928 1940 
Exports 10.1 10.7 9.9 11.0 112 10.4 
Imports 30.6 29.0 30.2 27.8 24.2 21.1 
Éxports/Imports 0.33 0.37 0.33 0.40 0.46 0.49 





Sources: Percent $/week from Eysenbach, pp. 307-311; average wage from 1909 Census of Manufactures (wage bill 
divided by labor force); women and child labor from 1909 Census of Manufactures (females aged 16 and over, 
under 16, and males under 16, divided by labor force). 


used wherever possible. Since all of the ^ and the results (shown in Tables 1 through 
coefficients are U.S.-based, the question of — 3) are suggestive. 

whether the factor content of imports accu- Table 1 does confirm that American man- 
rately corresponds to foreign production ^ ufacturing exports were more capital-inten- 
techniques is not addressed. Despite these sive than American imports from 1879 to 
limitations, the procedures follow the spirit 1928. But in terms of contemporary coeffi- 
of much of the literature on these subjects, cients, the country's surge to world indus- 
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TABLE 3—NONRENEWABLE NATURAL RESOURCE COEFFICIENTS IN MANUFACTURING GOODS, 
1879-1940 (1947 COEFFICIENTS) 





A. Direct Use 


1879 1899 1909 1914 1928 1940 
Exports 0.0742 0.0677 0.0918 0.0988 0.09984 0.0564 
Imports 0.0131 0.0194 0.0170 0.0133 0.0290 0.0369 
Exports / Imports 5.66 3.49 5.40 7.43 3.39 1.53 
B. Direct and Indirect Use : 
1879 — 1899 1909 1914 1928 1940 
Exports 0.1107 0.1239 0.1647 0.1800 0.1635 - 0.1240 
Imports 0.0565 0.0747 0.0766 0.0749 0.0934 0.1127 
Exports /Imports 1.96 1.66 2.15 2.40 1.75 1.10 


Sources: Coefficients from Eysenbach, pp. 297~301; trade figures, see Table 1. 


trial supremacy was not marked by a shift 
toward capital-intensive manufacturing ex- 
ports, nor by an increasing tendency to trade 
capital-intensive for labor-intensive manu- 
factures with the rest of the world. (It is 
interesting that the relative capital intensity 
of exports in terms of 1947 coefficients did 
rise until 1914, after which it declined.) 
Movement in the direction of the Leontief 
Paradox within manufacturing is detectable, 
at least after World War I. 

It should be noted that the figures in 
Table 1 omit refined sugar, an industry that 
if included would single-handedly generate 
a Leontief Paradox for manufacturing in 
every sample year. If classified as a manu- 
factured good (following Eysenbach), re- 
fined sugar would account for nearly one- 
quarter of manufacturing imports before 
1900, and sugar refining (in the United 
States, at any rate) had a capital-labor ratio 
five times as high as the average for manu- 
facturing. It is open to question whether 
sugar refining techniques outside the United 
States were really this capital-intensive. Be- 
cause the industry is exceptional and be- 
cause we are not in any case trying to ac- 
count for all international flows, it seems 
more informative to leave it out. Though 
extreme, sugar refining does illustrate one 
of the compositional reasons for the trend 
shown in the first two panels of Table 1, 
namely, the high capital intensity of many 
agricultural processing industries, which 
were declining in relative prominence 
among U.S. exports. Two of the largest con- 
tributors to the decline in relative capital 


intensity of exports were grain mill prod- 
ucts, and meat packing and wholesale poul- 
try. 
Table 2 displays two indices of skill inten- 
sity: (1) following Eysenbach, the percent- 
age of the labor force earning more than 
$12 per week in 1890, and (2) the average 
industry wage in 1909.4 By both measures, 
there is some tendency for export industries 
to pay higher wages than import-competing 
industries. But there is little sign of a trend 
in the relative skill intensity of exports and 
imports. As measured by the 1890 “high- 
wage” index, the skill content of exports 
went steadily downward. As measured by 
the 1909 average wage, however, the skill 
content of exports had an upward trend. 
There was also an upward trend, however, 
in the skill content of imports by the same 
measure (excepting 1914). One of the rea- 
sions for this puzzling pattern is suggested 
by the third panel of Table 2, which reveals 
a much more dramatic contrast between 
exports and imports in the percentage of 
the labor force who are women and chil- 
dren (under the age of 16). It is perhaps not 
surprising to see that imports are far more 
women-and-child-intensive than exports, 
since these workers are associated with 
“low-wage” and labor-intensive processes 
(but it is interesting that this direct measure 
of labor-force composition is a clearer sepa- 


^Several other skill indices were proposed by 
Eysenbach, all based on 1890 data. They give results 
similar to those presented here. 
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rator than capital-intensity or wage levels, 
which one might take to be more funda- 
mental). What is striking is the decline over 
time in this relative intensity, entirely con- 
centrated on the import side. Here we have 
another likely contributor to the trend to- 
ward the Leontief Paradox. Employment of 
women and child workers in American man- 
ufacturing was concentrated in only a hand- 
ful of industries: canning, preserving, and 
freezing on the one hand, and textiles and 
apparel on the other. The first remained a 
strong net export category, but in the sec- 
ond, the growth of imports was increasingly 
stifled by tariff barriers, particularly after 
the 1922 Fordney-McCumber tariff. 

Easily the largest factor-intensity differ- 
entials were in nonreproducible natural re- 
sources, as shown in Table 3. Recall that 
these are weighted averages for manufac- 
tured goods alone and exclude entirely ex- 
ports of agricultural goods and crude mate- 
rials. We still find not only thet U.S. exports 
had far higher natural resource content than 
imports but that this trend was growing 
both absolutely and relatively over precisely 
the historical period when the country was 
moving into a position of world industrial 
preeminence, Using the more inclusive index 
of direct and indirect use, the resource in- 
tensity of manufacturing exports grew by 64 
percent to its peak, and even after a slight 
decline, the 1928 level was still nearly 50 
percent higher than that of 1879. The fig- 
ures confirm a little-noticed analysis by 
Robert E. Lipsey (1963): “The composition 
of manufacturing exports has been changing 
ceaselessly since 1879 in a fairly consistent 
direction— away from products of animal or 


vegetable origin and toward those of mineral. 


origin" (p. 59; emphasis added). 

Table 3 also clearly shows that the re- 
source intensity of imports was growing as 
well, and that signs of a reversal in the 
relative balance are détectable even in 1928. 
By 1940, the historic U.S. specialization had 
virtually disappeared. This is the modern 
trend identified first and most clearly by 
Vanek (1963), of no small importance for 
interpreting recent American industrial his- 
tory. But because of his choice of dates and 
coverage, Vanek missed the fact that the 
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declining phase had been preceded by a 
long epoch of rising natural resource inten- 
sity, of no less importance in interpreting 
the country's place in the industrial world. 


B. Regression Analysis 


Simple factor-intensity comparison be- 
tween exports and imports is not conclusive 
in the presence of more than two factors 
(Edward Leamer, 1980). An apparent pat- 
tern of specialization may merely represent 
the effect of a third factor, acting as a 
complement or substitute for one of the 
other two. This section therefore follows the 
general format of Crafts and Thomas (1986) 
and earlier studies in the international trade 
literature by regressing the net trade bal- 
ance for each industry against measures of 
factor intensity. On no account should the 
coefficients be viewed as structural esti- 
mates within a Heckscher-Ohlin framework 
(compare Leamer and Harry P. Bowen, 
1981). They are best considered as descrip- 
tive summaries of trade patterns in a multi- 
factor setting, a way of pointing out areas of 
distinctive strength and tracking changes 
over time. Because the industry or commod- 


‘ity groupings are inevitably arbitrary, R? 


levels by themselves are not particularly 
meaningful; but t-tests on individual coef- 
ficients are a reasonable standard for con- 
fidence in that factor's contribution, and R? 
comparisons across years should reflect 
changes in the tightness-of-fit according to 
factor content. Following Crafts and 
Thomas, all reported standard errors were 
recomputed according to the procedure 
suggested by Hal White (1980) to adjust for 
heteroskedasticity in the error structure. 
The effect generally is to reduce the larger 
t-ratios, so that what is reported here is a 
conservative version of the account that 
leaps from the data using ordinary-least- 
squares. The results are robust to changes 
in precise variable definitions and to trans- 
formations of the coefficients into factor 
shares at various discount rates. Trade val- 
ues have been deflated by export and im- 
port price indices (Lipsey, 1963, pp. 
142-143; 19132100) so that coefficients 
may be compared across years. 
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TABLE 4—REGRESSIONS FOR MANUFACTURED NET Exports OF THE Unitep STATES, 1879-1940 
` Natural 
Capital / Resource Average Percent Women and 
Constant Labor Coefficient Wage Children R? 
1879 —3127 2092** — 10830 — 1853 0.079 
(0.68) (2.24) (0.74) (0.27) 
— 228 1725* — 12690 —156 0.103 
(0.06) (1.77) (0.83) (1.53) 
1899 — 4068 3729* —4324 — 802 0.075 
(0.66) (1.73) (0.11) (0.07) 
1735 3140 — 8727 — 255** 0.093 
(0.28) (1.46) (0.21) (2.02) 
1909 — 8965 2648 46950 959 0.146 
(0.92) (1.17) (1.17) (0.06) 
260 1810 44154 — 380** 0.193 
(0.04) (0.75) (0.99) (2.25) 
1914 —21041** 1600 103103* 28468** 0.261 
(2.56) (0.53) (1.71) (2.12) 
216 1038 98271* —329* 0.275 
(0.02) (0.33) (1.55) (1.93) 
1928 —21067 5040 112264** 18856 0.143 
(1.20) (0.83) Q.19) (0.52) 
—4342 4413 107406** —333 0.149 
(0.17) (0.67) Q.01) (0.87) 
1940 — 31898 —1862 126449** 85642 0.085 
(1.13) (0.42) Q.22) (1.38) 
23714 — 2750 117138** — 629* 0.077 
(1.24) (0.58) (2.11) (1.79) 


Notes: Method of estimation is ordinary least-squares, f-ratios (in parentheses) adjusted for heteroscedasticity 
following procedure of White (1980). *Denotes statistical significance at the 5 percent confidence level; **denotes 
the 1 percent confidence level. There are 64 nonzero observations in 1879, 83 in 1899, and 96 in the remaining 


years. 


The results in Table 4 are broadly consis- 
tent with those of the previous section. The 
capital-labor coefficient is significant in 1879, 
but it becomes steadily less so in subsequent 
years and is actually negative by 1940. Thus 
indications that the Leontief Paradox 
emerged historically are still present in a 
multivariate setting. The natural resource 
coefficient, on the other hand, begins nega- 
tive and becomes significantly positive after 
1909, reaching its peak (in both level and 
significance) in 1928. 

The coefficients of the two labor force 
variables are also interesting. The coeffi- 
cient of the average wage is significantly 
positive in only one year (1914). The coef- 
ficient on the percentage of women and 
child laborers, by contrast, is significantly 
negative in four of the six years and nearly 
so in the remaining two. When both vari- 
ables are included (not shown), the coeffi- 


cient on the average wage is negative or 
insignificant in every year. Furthermore, 
there is an evident inverse relationship be- 
tween natural resource intensity and the 
presence of women and children. It ap- 
pears, therefore, that the concentration of 
American net exports in “high wage” indus- 
tries early in the century was attributable to 
the absence of women and child workers in 
these “heavy” industries? 

An important amendment to this account 
emerges from Table 5, which uses a new 


This does not necessarily mean that the effect is 
purely compositional, that is, directly explained by the 
lower wages paid to women and children. Men who 
worked in these occupational-industrial categories also 
received lower wages. But these wages did not reflect 
"skill" levels so much as the ease with which women 
and children could be substituted for men in these 
industries. 
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TABLE 5— REGRESSIONS FOR MANUFACTURED Net Exports or THE U.S., 1879-1940 


























Note: See Table 4. 


variable created by multiplying the capital- 
‘labor ratio and the natural resource co- 
efficient. The results strongly imply that 
capital and natural resources were comple- 
mentary factors of production. The coeffi- 
cient of the new variable is positive through 
the entire period, growing steadily larger 
and more significant through 1928. Compar- 
ison of R? levels between Tables 4 and 5 
shows that this new interactive variable is 
more powerful in accounting for net export 
performance than the combined effect of its 
two components, entered separately. The 
strongest effects are found in 1914 and 1928; 
in the latter year, for example, the R? rises 
from 0.149 to 0.252 merely by substituting a 
single variable, the product, for the original 
two. 

This result should caution us against a 
too-hasty and too-complete rejection of 
“capital intensity" as a characteristic of 
American industry. The suggestion is, how- 
ever, that capital intensity derived not from 
economy wide abundance of capital per se, 





Percentage 
Capital and Natural Average Women and 
Constant Resources /Labor Wage Children R? 
1879 236 2741** 977 0.058 
(0.05) (2.17) (0.14) 
3815 2234 ~ 182* 0.095 
(1.31) (1.54) (1.88) 
1899 2495 5650** 4617 0.057 
(0.32) (2.81) (0.40) 
10015* 4677* —314** 0.088 
(1.98) (1.95) (2.58) 
1909 —2974 9312** 6052 0.165 
(0.31) (3.46) (0.37) 
6955* 8045** —428* 0.229 
i (1.93) (2.68) Q.67) 
1914 ~ 15799** 13279** 33918** 0.299 
Q.08) (3.50) Q.57) 
7317** 12198** — 386** 0.321 
(2.23) (3.07) (2.68) 
1928 — 10667 24084** 28310 0.241 
(0.75) (2.87) (0.88) 
9857 22954** — 399 0.252 
(1.09) (2.61) (1.40) 
1940 ~ 33084 12118** 86974 0.095 
(1.14) (2.23) (1.36) 
19478** 10590** -575 0.083 
(2.00) (1.89) (1.87) 


but from specialization in an industrial 
technology in which capital was comple- 
mentary to natural resources. Strictly speak- 
ing, these sorts of tests only describe the 
direction of trade, not the overall “success” 
of American industry. But the coincidence 
of timing between resource intensity and 
American industrial ascendance obliges us 
to consider the proposition that the abun- 
dance of industrial minerals was a deeper 
cause of American industry's distinctive 
strength. 


IV. Natural Resources and American 
industrial Success 


Since industrial success like other histori- 
cal outcomes requires an uncountable num- 
ber of mutually interdependent elements, 
do natural resources really deserve special 
attention? The scope of America's world 
leadership in natural resources is displayed 
in Chart 3, which shows U.S. production of 
14 major industrial minerals as a percentage 
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Cuart 3. U.S. MINERAL OuTpuT, 1913: 
PERCENTAGE OF WoRLD TOTAL 


Natural Gas 
Potroleum 
Copper 
Phosphate 
Coal 
Molybdenum 
Bauxite 
Zinc 
Iron Ore 
lead 
Silver ^ 
Salt 20 : i to Transvaal) 

Gold 


Tungsten 
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Source: Smith (1919), using data from U.S. 
Geological Survey (1913). , 


of world totals in 1913. The 95 percent of 
world natural gas and 65 percent of world 
petroleum were perhaps of somewhat less 
economic moment in 1913 than they would 
be at a later date. But copper, coal, zinc, 
iron ore, lead, and other minerals were at 
the core of industrial technology for that 
era, and in every single case the United 
States was the world's leading producer by a 
wide margin. In an era of high transport 
costs, the country was uniquely situated with 
respect to almost every one of these miner- 
als. Even this understates the matter. Being 
the number one producer in one or another 
mineral category is less important than the 
fact that the range of mineral resources 
abundantly available in the United States 
was far wider than that in any other coun- 
try. Surely the link between this geographi- 
cal status and the world success of Ameri- 
can industry is, more than incidental. Cain 
and Paterson (1986) find that between 1850 
and 1919, material-using technological bi- 
ases were significant in nine of twenty 
American sectors, . including those with the 
strongest export performance, such as 
petroleum, metals, and machinery. 
Resource abundance was a background 
ingredient in many other distinctively Amer- 
ican industrial developments. Continuous- 
process, mass-production methods, closely 
associated with modern forms of corporate 
organization in the analysis of Chandler 
(1977), were characterized by “high 
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throughput" of fuel and raw materials rela- 
tive to labor and production facilities (com- 
pare Michael Piore and Charles Sabel, 
1984). Oliver Williamson (1980) notes that 
cheap, reliable sources of energy and heat 
were crucial to this development. Coal was 
of strategic early importance as a direct 
source of heat and power, and at a later 
point as a source of thermal energy for 
electricity, essential to the efficiency of the 
moving assembly line and other quasi-flow 
processes. Alex Field (1987) points out that 
organizational innovations of this type may 
be considered “capital-saving” overall, even 
though firm-level capital requirements were 
high. In export markets, contemporary com- 
ments emphasized non-price competition 
and particularly the short delivery lags on 
the part of U.S. suppliers (Nicholas, 1980, 
pp. 581-587). Quick delivery is-a feature 
one would expect to see where exports have 
a “vent-for-surplus” quality, because of the 
length of a production run on a standard- 
ized item. In addition, American producer 
and consumer goods were often specifically 
designed for a resource-abundant environ- 
ment. Some of the adjustment problems of 
U.S. auto companies in recent years stem 
from their decades of specialization on large, 
fuel-using cars. There was a parallel prob- 
lem facing U.S. locomotive manufacturers 
in the 1920s, who found their foreign sales 
handicapped by their design for standard- 
gauge rails, heavy motive power, and heavy 
train loads (Markets of the United States, p. 
71). 

The emergence of cheap American steel 
at the end of the nineteenth century was 
particularly important. Whereas S. J. 
Nicholas (1980) suggested that the fall in 
relative U.S. machinery prices was mislead- 
ingly proxied by iron and steel prices, it may 
be that the world success of American engi- 
neering goods was buoyed by exactly that 
development. Table 6 shows the major role 
played by iron and steel exports over the 
half-century under discussion. If we aggre- 
gate the three headings under which iron 
and steel products were listed, we find that 
their share of U.S. manufacturing exports 
grew steadily, from 5.5 percent in 1879 to 
37.5 percent in 1929. If we add in one other 
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TABLE 6—SHARES OF UNITED STATES MANUFACTURING Exports, 1879-1929 (PERCENT) 








Iron and Steel 





Products (except 
: Machinery and Automobiles SUM Petroleum SUM 
Vehicles) Machinery and Parts (1,2,3) Products (1,2,3,5) 

1879 21 34 - 5.5 12.1 17.6 
1889 24 6.1 - 8.5 13.3 21.8 
1899 7.6 10.7 - 18.3 9.2 27.5 
1913 10.9 14.5 23 217 10.1 37.8 
1923 8.8 12.4 6.4 27.6 13.1 40.7 
1926 5.6 12.9 11.5 30.0 16.8 46.8 
1927 5.1 13.9 13.3 32.3 14.7 47.0 
1928 5.3 16.4 15.7 37.5 13.9 51.4 
1929 5.4 16.4 15.7 37.5 13.9 51.4 





Source: 1879-1923 (1963), Tables A-& and A-12; 1926-1929, U.S. Department of Commerce, Foreign Commerce and 
Navigation of the United States for the Calendar Year 1929, Vol. 1, Tables XII and XXIV. 


heading in which resource abundance was 
evidently important, petroleum products, we 
find that by late 1920s, we have accounted 
for more than half of all American manu- 
facturing exports. The union of these two 
sectors is, in essence, the automobile indus- 
try. The United States was unquestionably 
the world's technological leader in automo- 
bile production during the 1920s. At the 
same time, American producers had enor- 
mous cost advantages over competitors in 
raw materials, especially steel. Ford UK 
faced steel input prices that were higher by 
50 percent or more than those paid by the 
parent company (James Foreman-Peck, 
1982, p. 874). It was not accidental that 
Leontief chose motor vehicles as his most 
prominently displayed example of the econ- 
omy as an intricate input-output machine: 
each million dollars worth of automobiles 
in 1947 "contained" nearly half that 
much value in iron and steel, nonferrous 
metals, and other fabricated metal products 
(Leontief, 1953, p. 334). 

We may also conjecture that there were 
links between the economy of high through- 
put and the intensity of the work pace, 
which also seems to have been a distinctive 
feature of U.S. industry (Clark, 1987). 
American firms paid the world's highest real 
wages and apparently extracted greater ef- 
fort from the labor force in return. But it is 
an anachronism to associate “high wages" 
with “high skill” technologies for the era in 
which the United States surged to world 


industrial preeminence. The United States 
was a well-educated country, but most of 
the workers in the fast-paced, heavy- 
industry, mass-production manufacturing in 
which the country led the world were not 
well-educated native-born Americans. In 
1910 the foreign born and sons of foreign 
born were more than 60 percent of the 
machine operatives in the country, and more 
than two-thirds of the laborers in mining 
and manufacturing (U.S. Senate, 1911, pp. 
332, 334). There is no reason to believe that 
this labor force was particularly well edu- 
cated by world standards. Key industries 
like iron and steel and motor vehicles paid 
high wages to unskilled workers (who were 
nonetheless much cheaper than the skilled 
craft workers used with older technologies), 
presumably because it was rough, disagree- 
able, dangerous, demanding work, and be- 
cause it was vital to have an ample excess 
labor supply available (compare Daniel Raff, 
1988). In the 1930s these industries were 
central to the movement for industrial 
unionism, which subsequently provided an 
alternative mechanism for the continued as- 
sociation between high-wage industries and 
American industrial success. 


V. What Became of American 
Resource Abundance? 


The marked changes in coefficients for 
1940 seem to portend the post- World War 
II pattern, when the United States moved 
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CHART 4. U.S. Ner MINERAL IMPORTS AS A 
PERCENTAGE OF CONSUMPTION 





“10 
1870 1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 


Source: Manthy (1978, Tables MCI and MC2). 


steadily and increasingly into a position of 
net mineral imports (Chart 4). Beginning 
mainly in the 1920s, one important mineral 
after another began to enter the net import 
column: nonferrous metals, bauxite, lead, 
zinc, copper, iron ore, and petroleum among 
others. Without conducting extensive global 
cost comparisons, it is evident that a coun- 
try for whom resource prices are deter- 
mined at the margin by imports is not going 
to have a major locational advantage in 
resource costs over its industrial rivals. But 
what exactly was the process of change in 
America’s resource position? A popular 
conception is that the country has largely 
exhausted its resource endowment and has 
had to import so as to avert domestic 
shortages. Kindleberger has proposed a 
weaker version of this scenario within the 
Heckscher-Ohlin framework, in which the 
more rapid growth of labor and capital rela- 
tive to resources has turned the country 
from a net-export to a net-import position 
with respect to resources (Charles P. 
Kindleberger 1960, pp. 347-348) It is 
doubtful that this account is generally valid. 
Indeed, a closer look at the trend in world 
mineral supplies casts a different light on 
the character of the original position. 

In 1919 it could confidently be written 
that “the United States is more richly en- 
dowed with mineral wealth than any other 
country” (George Otis Smith, 1919, p. 282), 
and such a statement was consistent with 
the best geological and industrial knowledge 
of the day. But the clear pattern of discover- 
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Cuart 5. Wonrp Iron Ore Reserves, 1910 
AND 1955 
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ies since that time indicates that there was a 
systematic historical bias in these percep- 
tions, in that American resources had been 
much more thoroughly explored and ex- 
ploited than those of other parts of the 
world. Chart 5 illustrates this process, by 
comparing world iron ore “reserves”, as in- 
dicated by a 1910 survey by the Interna- 
tional Geologic Congress, with those re- 
ported in a United Nations survey in 1955. 
Granted that quality differences and extrac- 
tion and transport costs are neglected in a 
simple chart, nonetheless the pattern is so 
clear as to be beyond dispute. Europe and 
North America had by far the largest re- 
serves in 1910, but their “endowments” 
(which, to be sure, had increased and not 
decreased) had grown only slightly in the 
intervening 45 years. What had been a dom- 
inating advantage in 1910 was no more than 
a respectable presence in 1955. 

The case of petroleum is even more ex- 
treme (Chart 6). Recall that the United 
States in 1913 (and for a half-century be- 
fore) had been the world's largest petroleum 
producer and exporter, by a wide margin. 
As Chart 6 shows, as late as 1948, North 
American reserves were nearly equal to 
those of the Middle East. In 1988, though 
again reserves of all areas had increased, 
North America was a minor part of the 
world petroleum picture. It is difficult to 
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CHART 6. WORLD CRUDE Orr Reserves, 1948 
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avoid the inference that mineral supplies 
were more a matter of “development” than 
“endowment.” 

Where world geological surveys are not 
available, similar conclusions can be reached 
by other routes. In the case of bauxite, 
which takes its name from the French vil- 
lage where it was first developed, the United 
States and France alternated as first and 
second in the world until the 1950s. With 
discoveries in the West Indies in the 1950s, 
Jamaica quickly moved into first place, at 
annual production levels larger than those 
ever achieved in either France or the United 
States, despite the fact that production lev- 
els in those two countries did not decline 
but continued to grow to levels higher than 
they themselves had ever achieved. In the 
late 1960s, Australia replaced Jamaica as 
number one, again setting new production 
records without causing an absolute decline 
in any of the older countries. In both Ja- 
maica and Australia, bauxite production was 
negligible before World War II. Since the 
real price of bauxite has declined, it is not 
the case that domestic reserves have been 
“exhausted” or that distant supplies have 
simply been coaxed out along a world sup- 
ply curve. Rather, early discoveries and 
mining took place in areas proximate to the 
early centers of industrial and technological 
development and within the boundaries of 
their national jurisdiction. 
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The last phrase points toward another 
sense in which resource abundance was his- 
torically rather than geologically deter- 
mined. The United States was the world’s 
largest mineral producing nation, but it was 
also one of the world’s largest countries! 
Even without Alaska, at 3.5 million square 
miles, the United States is twice the size of 
all the countries of eastern and western 
Europe and Scandinavia combined (exclud- 
ing Russia). Yet coal and iron ore produc- 
tion in Europe was 30 to 50 percent higher 
than the U.S. total in the 1910-1913 era. 
If coal and iron were the imperatives of in- 
dustrial location ca. 1900, a hypothetical 
United States of Europe would have rivaled 
America. 

More important than sheer geographic 
size is economic distance. The United States 
was a vast free trade area for internal com- 
merce, and the opportunities created by this 
status provided the incentive for massive 
investment in transportation infrastructure, 
including the highly efficient lake transport 
system that linked Mesabi ore to Pennsylva- 
nia coal. In the case of copper, only the 
combination of national size and efficient 
internal transportation allow use to say that 
the “same” economy retained world leader- 
ship across the period of American indus- 
trial ascendancy, since the early production 
center in Michigan gave way to remote but 
larger and richer locations in the Mountain 
and Southwest regions between 1870 and 
1930. 

The argument does not stop with national 
size and efficient transportation. The pro- 
cess of mineral discovery and development 
was also a prime outlet for creative energies 
and innovations, often at high levels of tech- 
nical and organizational sophistication. The 
United States Geological Survey, formed in 
1879 by consclidation of several existing 
federal surveys, had intimate links with the 
mining industry. Reports by government ge- 
ologists in Colorado in the 1880s were cru- 
cial in encouraging mining activity and 
adapting metallurgical knowledge to local 
requirements (Rodman Wilson Paul, 1960). 
The American Institute of Mining Engi- 
neers became the first speciality group to 
break away from the American Society of 
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Civil Engineers. Scientifically trained per- 
sonnel were also important in expanding 
the range of uses for available minerals. An 
early report by Yale geologist Benjamin 
Silliman, Jr., foresaw the commercial possi- 
bilities of “cracking” petroleum into various 
compounds, opening up arrays of new uses 
for what had been considered a useless 
waste material (Robert V. Bruce, 1987, pp. 
140—142). But as Nathan Rosenberg (1985) 
points out, much of the early use of science 
by American industry did not deploy new 
knowledge at the scientific "frontier," but 
involved repetitive procedures (such. as 
grading and testing materials) for which sci- 
entific training was needed but where the 
learning was specific to the materials at 
hand. The abundance of mineral resources, 
in other words, was itself an outgrowth of 
America's technological progress. 

This view of the matter suggests the an- 
swer to the question posed above. The 
country has not become "resource poor" 
relative to others, but the unification of 
world commodity markets (through trans- 
port cost reductions and elimination of trade 
barriers) has largely cut the link between 
domestic resources and domestic industries. 
American corporations and engineers have 
been in the forefront of the globalization of 
the mineral economy. In essence, the pro- 
cess by which the United States became a 
unified “economy” in the nineteenth cen- 
tury has been extended to the world as a 
whole. 'To a degree, natural resources have 
become commodities rather than part of 
the “factor endowment” of individual coun- 
tries. Presumably this is why international 
economists now distinguish resource-based 
“Ricardo goods” from others and treat them 
separately (for example, Robert Stern and 
Keith E. Maskus, 1981). This procedure may 
be appropriate for the contemporary world, 
but it would be hard to do justice to the 
historic success of American industry within 
this conception. 


SWilfred J. Ethier and Lars E. O. Svensson (1986) 
show that in a Heckscher-Ohlin framework with mobil- 
ity of some factors a country's trade pattern in goods is 
affected only by its endowment of nontraded factors. 
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VI. Conclusion 


Why has the importance of mineral re- 
sources in American industrial history been 
underappreciated? Concern for the future 
of natural resources is an ancient theme in 
economics, but most of the attention has 
been channeled into two rather different 
issues: fear of rising costs from increased 
resource scarcity and fear of national strate- 
gic inadequacy in the event of war. Refuting 
the first fear has long been the economist’s 
favorite pastime, as it has been easy to show 
that producers substitute away from rela- 
tively scarce resources and that the real 
prices of “nonrenewable” resources have 
historically declined. The second fear has 
always seemed noneconomic in character, if 
not indeed a cooked-up rationalization for 
subsidy or protection. Having thus dealt with 
the “problem” of resource exhaustion, it 
was easy to overlook a logically distinct as- 
pect of the matter: the contribution of re- 
source location to the competitive potential 
of a country’s industries. Some economic 
historians, to be sure, have long analyzed 
national economic histories in terms of 
world geographical patterns (William N. 
Parker, 1984). But it is perhaps understand- 
able that Americans have not been inclined 
to attribute their country’s industrial suc- 
cess to what appear to be accidental or 
fortuitous geographic ‘circumstances. An- 
other reason is that American industrial 
leadership took on a rather different shape 
after World War II. Over the course of the 
twentieth century, the country was able to 
parley its resource-based industrial prosper- 
ity into a well-educated labor force, an in- 
creasingly sophisticated science-based tech- 
nology, and .world leadership in scientific 
research itself. In the wake of World War 
II, there were no serious international rivals 
in such a wide range of industries that it 
was easy to lose sight of the resource di- 
mension of industrial performance. After 
the war, there was a brief period of concern 
that the nation's resource position had been 
eroded, culminating in the Paley Commis- 
sion Report of 1952. But such doubts and 
fears were largely swept away in the Ameri- 
can-led world prosperity of the next 25 years. 
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To be clear about the argument, there is 
no iron law associating natural resource 
abundance with national indvstrial strength. 
But the distinctive American technologies 
have, as a matter of history, been relatively 
resource-using. We have now moved from 
an era in which the rest of the world adapted 
to an American technology, with varying 
degrees of difficulty, to an era in which U.S. 
firms have had to do the adjusting. The 
adjustment is not made much easier by the 
consideration developed in this paper, that 
historical resource abundance was itself 
largely an outgrowth of American industrial 
Success. 
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Unexpected Inflation, Real Wages, and Employment 
Determination in Union Contracts 


By Davi Carp* 


This paper examines the effect of nominal contracting provisions on employment 
determination in union contracts. In most contracts the nominal wage rate is 
wholly or partially predetermined. Real wage rates therefore contain unanticipated 
components that reflect unexpected price changes and the degree of indexation. 
The empirical analysis, based on a large sample of indexed and nonindexed 
contracts, suggests that unexpected real wage changes are associated with system- 
atic employment responses in the opposite direction. I conclude that nominal 
contracting provisions play a potentially important role in the cyclical properties 
and persistence of employment movements in the union sector. (JEL 130,820) 


What role do nominal wage contracts play 
in the determination of employment and 
the characteristics of the business cycle? An 
influential series of papers by Stanley Fis- 
cher (1977), Edmund S. Phelps and John B. 
Taylor (1977), and John B. Taylor (1980) 
argued that fixed wage contracts create a 
link between employment and aggregate de- 
mand. More recent models of macro fluc- 
tuations stress other channels for the träns- 
mission and persistence of aggregate shocks. 
Real business cycle models (for example, 
Finn E. Kydland and Edward C. Prescott, 
1982) assume that supply and demand in 
the labor market are equilibrated at Wal- 
rasian levels and ignore the institutional 
structure of wage determination. Recent 
models in the Keynesian tradition, on the 
other hand, have shifted attention from 
nominal wage rigidities to real wage rigidi- 
ties (for example, Olivier J. Blanchard and 
Lawrence H. Summers, 1986) or nominal 
price rigidities (for example, N. Gregory 
Mankiw, 1985; Olivier J. Blanchard and 
Nobuhiro Kiyotaki, 1987). 

This shift in interest reflects dissatisfac- 
tion with both the theoretical underpinnings 


*Department of Economics, Princton University, 
Princeton, NJ 08540. I am grateful to Robert Hall, 
Robert King, and two referees for their comments on 
earlier drafts. Thomas Lemieux and Sara Turner.pro- 
vided expert assistance in data preparation. 
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and empirical performance of nominal con- 
tracting models. One the one hand, there 
are as yet no convincing theoretical expla- 
nations for the existence of nominally fixed 
contracts. Many of the models developed 
over the past decade predict constant real 
wages or constant real earnings. On the 
other hand, the evidence in support of nom- 
inal contracting models is also weak. The 
simplest of these models asserts that aggre- 
gate demand shocks lead to real wage 
changes that induce movements along a 
downward-sloping demand schedule. Al- 
though unanticipated price increases are 
apparently correlated with real economic 
activity (see the review by Jo Anna Gray 
and David Spencer, forthcoming), the ab- 
sence of a clear negative correlation be- 
tween aggregate employment and real wages 
(Patrick T. Geary and John Kennan, 1982) 
poses a serious challenge to models of nom- 
inal wage rigidity. 

This paper presents new evidence on the 
consequences of nominal contracting provi- 
sions for employment determination in the 
unionized sector of Canadian manufactur- 
ing. The analysis, based on data for 1300 


!See the survey of implicit contracting models by 
Sherwin Rosen (1985). A concise summary of the im- 
plications of these models from a macroeconomic per- 
spective is presented by Stanley Fischer (1987, pp. 
42-50). 
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indexed and non-indexed contracts written 
between 1966 and 1982, suggests that nomi- 
nal contracting provisions play an important 
role in the link between aggregate demand 
and employment. As predicted by the sim- 
ple models of Fischer (1977) and Jo Anna 
Gray (1976), I find that real wage changes 
induced by aggregate price surprises lead to 
systematic employment responses in the op- 
posite direction. Unexpected real wage 
changes also affect subsequent wage deter- 
mination: the empirical results suggest that 
roughly one-third of such changes carry over 
to the following contract. Unanticipated 
price increases therefore generate short-run 
employment responses and persistent wage 
changes among firms in the union sector. 
Two features of the empirical analysis 
distinguish these results from eariier at- 
tempts to measure the effects of nominal 
wage rigidities. First, the analysis is based 
on individual contract data rather than ag- 
gregate or industry-level data.? Since union 
contracts differ in their negotiation dates 
and degrees of indexation, it is possible to 
calculate contract-specific measures of un- 
expected price increases and unexpected 
real wage changes, and to estimate the sep- 
arate effects of price surprises and real wage 
surprises. Variation in contract lengths and 


?Much of the earlier literature on aominal contract- 
ing models focuses on their implications for aggregate 
price and wage dynamics: see Taylor (1980) and Orley 
Ashenfelter and David Card (1982). A recent study by 
Shaghil Ahmed (1987) correlates the degree of wage 
flexibility in an industry, measured bv the elasticity of 
indexation among indexed labor contracts, with the 
slope of the industry-specific Phillips curve. Ahmed's 
measure of wage flexibility is based on a sample of only 
98 contracts in 20 industries, and fails to take into 
account any of the characteristics of the nonindexed 
contracts in an industry. Furthermore, his measure of 
"flexibility only pertains to workers in large union con- 
tracts and ignores variation across :ndustries in the 
extent of unionization or the share of [arge firms. Thus, 
I do not interpret his findings as strong evidence for or 
against the hypothesis that nominal contract rigidities 
are important. The approach taken by Mark Bils (1989) 
is perhaps most similar to that in this paper. He 
compares the variability of industry employment growth 
in months with a significant number of contract negoti- 
ations to the variability in other months. 
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the staggering of expiration dates also make 
it possible to control for aggregate-level dis- 
turbances that affect all contracts at a point 
in time. Second, the analysis pavs special 
attention to the issue of endogenous wage 
determinaticn? Even in a simple Fischer- 
Gray contracting framework this is a poten- 
tially serious problem, insofar as the bar- 
gaining parties have information on future 
employment demand that is unavailable to 
an outside data analyst. If predictable com- 
ponents of future employment demand af- 
fect wages, they create a simultaneity bias in 
ordinary least-squares estimates of the elas- 
ticity of employment with respect to real- 
ized wage rates. 

To solve this problem I use the unex- 
pected component of real wages as an in- 
strumental variable for the level of wages. 
By assumption, unexpected changes in real 
wages are correlated with wages but uncor- 
related with information known at the nego- 
tiation date of the contract. Unexpected 
wage changes therefore form a valid instru- 
mental variable for a structural analysis of 
employment demand. This procedure also 
provides a direct test of the role of nominal 
wage rigidities in generating employment 
responses to nominal shocks. The instru- 
mental variables estimate of the elasticity of 
labor demand is nonzero if and only if em- 
ployment is correlated with unexpected real 
wage changes. 

The empirical results confirm the value of 
this approach. In ordinary least-squares re- 
gressions, changes in employment are only 
weakly related to changes in contract wages. 
When unexpected real wage changes are 
used as an instrumental variable, however, 
employment is found to be systematically 
negatively related to wages. This finding 
continues to hold when unexpected price 
changes are added directly to the employ- 
ment demanc equation. It is also robust to 
the addition of unrestricted dummy vari- 


John Kennan (1988) presents an illuminating analy- 
sis of the difficulties that arise in the interpretation of 
aggregate employment and wage data when the data 
are generated by a simple model of demand and sup- 


ply. 
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ables representing each year of the sample. 
I conclude that nominal wage contracts play 
an important role in determining the cycli- 
cal properties and persistence of employ- 
ment in the union sector. 


I. Employment and Wages in a Simple 
Contract Model 


A. Interpreting the Correlation of 
Employment and Wages 


This section outlines a simple model of 
long-term contracting in which nominal 
wages are predetermined and employment 
is set unilaterally by the firm after aggregate 
prices and firm-specific demand shocks are 
observed. Even in this simple model the 
interpretation of the partial correlation of 
employment and real wages is clouded by 
the fact that the contracting parties may 
have better information on future demand 
shocks than is available to an outside data 
analyst. To develop this point more for- 
mally, suppose that wages are negotiated in 
some base period (period 0) for a contract 
of duration T. Let n(t) and w(t) denote the 
logarithms of employment and real wages in 
period ¢ of the contract, respectively, and 
assume that hours per worker are fixed. The 
notion of “nominal contracting” is captured 
by the assumption that the bargaining par- 
ties do not set w(t) directly: rather, they 
establish a series of nominal wage increases 
from the start of the contract, possibly in 
conjunction with an indexation formula.* 
Let w*(t) represent the parties’ expectation 
of w(t), conditional on their information in 
the negotiating period, and let u(t) repre- 
sent the forecast error w(t)— w*(t). The 
distribution of u(t) depends on the length 


“The nature of typical indexation formulas in North 
American labor contracts is described in my 1983 pa- 
per. The only case in which the real wage is set directly 
by the parties is the case of a contract in which nomi- 
nal wages are indexed to the consumer price level with 
a formula that increases the wage by one percent for 
each percentage point increase in prices. Such formu- 
las are rare, particularly in the manufacturing sector of 
the United States and Canada. 
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of the contract and whether or not it con- 

tains a cost-of-living escalation clause.? 
Assume that n(f) is determined by an 

employment demand schedule of the form 


(1) n(t) = az(t)+ Bw(t) + e(t), 


where z(t) is a vector of observable vari- 
ables shifting the demand for labor, B rep- 
resents the elasticity of labor demand (f < 
0), and «(t) is an unobservable component 
of employment variation. The specification 
of z(t) and the corresponding interpreta- 
tion of B are discussed in the next section. 
Note that supply considerations are explic- 
itly ignored: there are assumed to be enough 
available workers to fill the firm’s demand 
irrespective of the forecast error in real 
wages. This assumption is a plausible one in 
the context of the available data, which per- 
tain to unionized manufacturing establish- 
ments. 

This simple model is completed by a spec- 
ification of the determinants of w*(r). As- 
sume that the expected real wage rate in 
period ¢ is determined at the negotiation 
date by variables known at that time, say 
x(0), and by the parties' expectations of z(1) 
and e(t), z*(t) and e*(t), respectively: 


(2) w*(t) =az*(t) + bx(0) + ce*(t). 


The realized real wage rate in the tth Pe 
riod of the contract is therefore 


w(t) =az*(t) + bx(0) +ce*(t) + u(t). 


The presence of simultaneity bias in ordi- 
nary least squares (OLS) estimates of the 
employment demand equation (1) depends 
on two factors. If £*(t) = 0, then the parties 
have no informational advantage and there 
is no simultaneity problem. Alternatively, if 
c=0, negotiated wages are unaffected by 
expected employment demand and again: 
there is no simultaneity problem. If the par- 
ties are better able to forecast employment 


‘This point is made by Wallace E. Hendricks and 
Lawrence M. Kahn (1987). 
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demand than an outside observer, however, 
and if higher forecasted demand leads to an 
increase in negotiated wage rates, then real 
wage rates will be positively correlated with 
the error in the employment equation, lead- 
ing to a positively biased estimate of the 
wage elasticity B. 

Irrespective of the parties’ wage setting 
behavior, the elasticity may be consis- 
tently estimated by considering the correla- 
tion between unanticipated wage rates and 
employment outcomes. The forecast error 
u(t) forms a natural instrumental variable 
for w(t): by definition, it is correlated with 
wages but uncorrelated with information 
available to the parties at the time of their 
negotiations.Ó Additional instruments may 
also be available if there are determinants 
of negotiated wages that can be legitimately 
excluded from the employment demand 
equation (the variables denoted as x(0) in 
equation (2) above). 

There are two important caveats to this 
procedure. The first is the possibility that 
forecast errors in real wages are directly 
correlated with unobservable determinants 
of labor demand. Suppose for example that 
employment demand shocks are positively 
correlated with unexpected price increases.’ 
Then unexpected real wage increases are 
negatively correlated with employment de- 
mand shocks, leading to a negative bias in 
the instrumental variables estimate of the 
wage elasticity 8. A simple way to control 
for this possibility is to include unexpected 


*It is interesting to compare this procedure to the 
one suggested by Bennett T. McCallum (1976) for the 
estimation of a structural equation that contains the 
expected value of a future endogencus variable. Mc- 
Callum’s procedure replaces the expected future value 
by its actual value and uses the predicted value (from a 
linear forecasting equation) as an instrumental vari- 
able. His procedure therefore eliminates simultaneity 
bias induced by a correlation between the dependent 
variable and the unexpected component of the ex- 
planatory variable. In the present context, the si- 
multaneity bias arises from a correlation between the 
dependent variable and the expected value of the ex- 
planatory variable. Hence, the proposed instrument is 
the , unexpected component of the explanatory variable. 

"This may arise if employers have imperfect infor- 
mation on their relative demand shocks. 
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consumer price increases directly in the em- 
ployment equation and to use variation 
across contracts in the degree of indexation 
to separately identify the effects of unex- 
pected wage changes and unexpected price 
changes. A complementary approach is to 
include dummy variables representing the 
year in which employment is measured. 
These year effects absorb any aggregate de- 
mand shocks (or supply-side shocks) that 
affect all contracts in any given year. 

A second difficulty may arise if unex- 
pected changes in real wages during the 
term of a contract are immediately offset in 
subsequent negotiations. If this is the case 
then unexpected changes in real wages are 
inherently short-lived, and the presence of 
adjustment costs will substantially dampen 
the employment responses to such changes.? 
In the empirical analysis reported below I 
investigate the effect of real wage surprises 
on subsequent wage negotiations, and find 
that real wage rates in the subsequent con- 
tract move in the same direction as unex- 
pected wage changes occurring during the 
previous contract. Thus, unexpected changes 
in real wages generate persistent effects on 
the cost of contractual labor, and should be 
expected to generate significant employ- 
ment effects if the wage elasticity B is 
nonzero. 


B. Specification of the Employment 
Demand Function 


This section discusses the specification of 
the employment demand function (1) intro- 
duced above. An important limitation of the 
contract-based data set used in the empiri- 
cal analysis is the absence of firm-specific 
price or output data. Selling prices, inter- 
mediate input prices, and output indexes 
are only available at the three-digit industry 
level. Nevertheless, these industry-level data 
may be used as proxies for the underlying 
firm-specific variables. To derive an inter- 
pretation of the resulting specification, sup- 
pose that output is produced from three 


SUnexpectedly low real wage rates could induce an 
increase in overtime hours, however. 
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factors: labor, capital, and intermediate in- 
puts (raw materials and energy). Ignoring 
firm-specific constants, assume that the log- 
arithm of employment at a given firm in a 
particular industry in period t,n(t) is re- 
lated to the logarithm of firm-specific out- 
put, y(t), the logarithm of firm-specific 
wages, w(t), the logarithm of firm-specific 
nonlabor input prices, v(t), the user cost of 
capital in period t,r(t) (assumed to be con- 
stant across firms and industries), and an 
error term 7(t): 


(3) n(t)=B,w(t)+B2v(2) - 
—(B,+ B3)r(t) + oy(t) + a(t). 


This equation can be derived from an un- 
derlying Cobb-Douglas production function, 
or alternatively it can be interpreted as a 
loglinear approximation to an arbitrary em- 
ployment demand equation. The restriction 
that the elasticities of employment demand 
with respect to the three factor prices sum 
to zero is a consequence of the homogeneity 
of the cost function. This restriction implies 
that the equation is invariant to the deflator 
used to index wages and other factor prices. 
The magnitude of the coefficient o reflects 
the degree of returns to: scale: constant re- 
turns to scale implies o = 1. 

Let y(t) represent the logarithm of indus- 
try output in period t, and let W(t) and P(t) 
represent weighted averages of wages and 
intermediate input prices in the industry. 
Ignoring constants, assume that the loga- 
rithm of the firm's relative share of industry 
output is given by 


(4 y- PE) =y (w) -9()) 
+ y(() -9(0) +Ù). 


This equation can be derived by assuming 
that firms with identical Cobb-Douglas pro- 
duction functions act as price takers with 
respect to firm-specific selling prices. Alter- 


?Specifically, the Cobb-Douglas assumption implies 
that the output supply equation of the ith firm can be 
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natively, equation (4) can be interpreted as 
an approximation to the output share equa- 
tion arising from a simple differentiated 
product oligopoly model. In either case, the 


-error component (t) represents a mixture 


of firm-specific relative demand shocks and 
firm-specific productivity shocks. 

The combination of equations (3) and (4) 
leads to an expression for firm-specific em- 
ployment in terms of firm-specific wages, 
industry-level output and intermediate in- 
put prices, the aggregate cost of capital, and 
industry wages: 


(5) n(t) = (Bi * ey w C + BC) 
- (By + Ba)r(t)+ ejr) - ov P(E) 
(By t ey) Q) - 9(0) 
+ oó(t) - n(t). 


Under the assumption that increases in 
marginal cost at a particular firm lead to 
decreases in its relative share of industry 
output, the coefficients y, and y, are nega- 
tive. Thus, the elasticity of employment with 
respect to firm-specific wages, holding con- 
stant industry output, is larger in absolute 
value than the elasticity holding constant 
firm-specific output. Under the assumption 
of price-taking behavior the elasticity hold- 
ing constant industry output is the uncondi- 
tional elasticity of employment with respect 
to wages, allowing for the effect of changes 


written as 
y(t) = vw) + vov (t) + yar(t) 
(yit y2 + ya) (t) + 9C). 
where q(t) is the selling price for the output of the firm 
and 6(r) represents a total factor productivity shock. 
Define industry output as a geometric weighted aver- 
age of the outputs of the individual firms in the indus- 


try. Then aggregate output follows a similar equation, 
and equation (4) can be derived directly, with 


$(r) = 0(t)— (0) 


-(Qnt vai v3) (a0) - a0»). 
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in wages on the output supply decision of 
the firm. Under these same assumptions the 
predicted elasticity of employment with re- 
spect to industry wages is positive, reflecting 
the fact that as industry wages increase 
(holding constant the firm’s wage) the firm’s 
share of industry output will increase. 


C. Allowing for the Presence of ' 
Efficient Contracting 


The specification of equation (5) assumes 
that employment levels are determined by 
the firm taking the realized real wage rate 
as given. Except under very special circum- 
stances, however, unilateral employment 
determination by the firm fails to provide an 
efficient allocation of employment between 
contractual and extra-contractual opportu- 
nities.? For this reason, the empirical rele- 
vance of simple nominal contracting models 
has been sharply criticized (see Robert J. 
Barro, 1977, and Robert E. Hall, 1980, for 
example). The efficient determination of 
contractual employment is formally ad- 
dressed in the implicit contracting literature 
and also the more recent efficient contract- 
ing literature." The point of both litera- 
tures is that a jointly optimal contract (i.e., 
one that maximizes profit subject to a utility 
constraint for workers) determines employ- 
ment on the basis of a shadow wage that 
can differ from the contractual wage. A 
contracting model with homogeneous work- 
ers and unrestricted transfers between em- 
ployed and unemployed workers implies that 
the appropriate shadow wage is the marginal 
productivity of workers in their best alterna- 
tive job. Brown and Ashenfelter (1986) refer 
to this as the “strong form" efficient con- 
tracting hypothesis. Strong form efficiency 
implies that contractual wages (and contrac- 
tual wage rigidities) are irrelevant for em- 


See Robert E. Hall and David Lilien (1979). 

l'The implicit contracts literature is reviewed by 
Sherwin Rosen (1985) See Ian M. McDonald and 
Robert M. Solow (1981) for a theoretical treatment of 
efficient contracting and James N. Brown and Orley 
Ashenfelter (1986) for a concise summary of the empir- 
ical implications of simple efficient contracting models. 
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ployment determination and serve only to 
transfer income between employers and em- 
ployees.'? 

In light of the differing implications of 
efficient contracting models and models with 
unilateral employment determination, it is 
important to adopt an empirical framework 
that encompasses either possibility. In prin- 
ciple this can be accomplished by including 
a measure of the appropriate shadow wage 
of labor in the employment demand func- 
tion. A convenient assumption is that the 
shadow wage in an efficient contract is a 
weighted average of the observed contract 
wage and some measured alternative wage.'? 
This leads to a specification of employment 
demand that includes both the contract wage 
and the measured alternative wage. Even 
though this procedure cannot provide a 
definitive test against the efficient contract- 
ing hypothesis,^ it can provide useful evi- 
dence for or against the unilateral em- 
ployment determination model, when the 
alternative is a testable version of the effi- 
cient contracting hypothesis. 


II. Data Description and Measurement 
Framework 


The empirical analysis in this paper is 
based on a sample of 1293 contracts negoti- 
ated by 280 firm and union bargaining pairs 
in the Canadian manufacturing sector.!^ The 
available information for each contract in- 


PSee John M. Abowd (1989) for an attempt to test 
this hypothesis using stock market data on negotiating 
firms. 

This hypothesis can be motivated formally by as- 
suming that employees' preferences are represented by 
a Cobb-Douglas utility function defined over employ- 
ment and the difference between the contractual wage 
and the alternative wage: see Brown and Ashenfelter 
(1986, p. S54). 

See Thomas E. MaCurdy and John H. Pencavel 
(1986), especially p. S13. 

The data set only includes contracts with 500 or 
more workers. The sample is drawn from a public use 
tape distributed by Labour Canada. A complete de- 
scription of the sample and its derivation is presented 
in the Data Appendix. Louis N. Christofides and An- 
drew J. Oswald (1987) have also analyzed employment 
and wage data drawn from this source. 
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TABLE 1— CHARACTERISTICS OF ExPIRING CONTRACTS BY YEAR 
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Number Percent with 
of Average Escalation 
Year Contracts Duration Clause 
(1) Q) (3) 
1968 5 11.2 0.0 
1969 23 21.9 0.0 
1970 87 26.9 12.6 
1971 68 29.0 17.6 
1972 76 26.3 14.5 
1973 90 28.9 11.1 
1974 82 29.4 28.0 
1975 92 26.9 32.6 
1976 104 25.6 52.9 
1977 113 23.7 50.4 
1978 134 224 27.6 
1979 81 22.7 34.5 
1980 114 24.8 37.7 
1981 64 25.9 40.6 
1982 85 27.4 38.8 
1983 75 28.5 65.3 
Overall 1293 25.9 32.9 


Source: See Data Appendix. 


Real Employ- 
Wage ment Average 7 
Index? Index Forecast Error 
1971 = 100 1971=100 Prices Real Wages 
(4) (5) (6) (0) 
87.6 104.4 -0.1 0.1 
89.5 101.8 —0.9 0.9 
94.1 108.0 —2.0 18 
100.0 100.0 —4.6 3.8 
104.6 103.6 -3.0 2.8 
104.8 103.3 11 -14 
104.5 110.4 Ti —6.1 
106.2 105.9 7.0 —63 
115.2 108.1 19 -1.2 
118.9 105.7 -2.2 1.8 
118.5 105.6 0.1 —0.3 
118.2 112.8 1.1 —0.9 
117.8 1121 19 -1.2 
115.9 109.9 4.5 -3.3 
119.1 111.7 49 —3.8 
1222 104.6 -0.5 1.2 
- - 12 —0.9 


“Estimated wage index for level of real wages at the end of expiring contracts. 

"Estimated employment index for level of employment at the end of expiring contracts. 

“Average percentage difference between price level (or real wage) at the end of contract and expected price level 
(or real wage) as forecast at the signing date of contract. See text. 


cludes its starting (or effective) date, its 
ending (or expiration) date, and the base 
wage rate in each month of the contract.'® 
The number of employees covered by the 
agreement is only available at renegotiation 
dates. I associate this level of employment 
with the expiring agreement. Thus, each 
sample point consists of an end-of-contract 
employment observation and a series of 
wages, including the beginning-of-contract 
and end-of-contract wage rates. 

Some summary characteristics of the sam- 
ple are presented in Table 1. The expiration 
dates of the contracts span a 16-year period 
between 1968 and 1983, with relatively few 
contracts in the first 2 years. The average 
duration of the contracts is 26 months, al- 


‘©The base wage rate is typically the wage paid to 
the lowest-skill group covered by the collective bargain- 
ing agreement. An important assumption for the analy- 
sis in this paper is that variation over time in intracon- 
tract wage differentials is small enough to be safely 
ignored. 


though durations vary somewhat by year, 
with relatively shorter contracts in the mid- 
1970s. The fraction of contracts with escala- 
tion clauses shows a steadily increasing trend 
until the mid-1970s and then varies errati- 
cally, with an overall average of 33 percent. 

An indication of the trends in employ- 
ment and wages in the sample is provided 
by the indexes in columns (4) and (5) of the 
table." Real wage rates among expiring 
contracts show significant growth until 1977 
and then remain relatively constant. Aver- 
age employment shows no secular trend but 
reflects cyclical downturns in 1971, 1975, 
and 1983. 

The empirical strategy of this paper is to 
fit regressions based on equation (5) to end- 


"The wage and employment indexes represent esti- 
mated year effects from regression equations for con- 
tract-to-contract percentage changes in end-of-contract 
wages and employment. These indexes therefore con- 
troi for the composition of the set of expiring contracts 
in each year. 
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of-contract observations on employment and 
wages for each contract. Assuming that the 
employment demand function is homoge- 
neous of degree zero in factor prices, the 
analysis is invariant to the choice of defla- 
tors for wages and intermediate input prices. 
Given the nature of wage indexation clauses, 
however, it is particularly convenient to work 
with real wages deflated by the consumer 
price index. In the remainder of the paper, 
wages and industry prices are therefore ex- 
pressed as real variables, deflated by the 
consumer price index. 

The real wage rate at the end of each 
contract is measured directly. This rate dif- 
fers from its expectation as of the negotia- 
tion date of the contract by a component 
that depends on the indexation provisions 
of the contract and the deviation between 
actual and expected prices at the end of the 
contract. Following the notation above, let 
w*(T) represent the expected value of the 
logarithm of the real wage at the end of the 
contract. In a nonindexed contract, the log- 
arithm of the actual real wage rate at the 
end of the contract, w(T), is related to 
w*(T) by 


(6) w(T) =w*(T)—( p(T) - p*(T)). 


where p(T) represents the logarithm of the. 


consumer price index at the end of the 
contract, and p*(T) represents the parties’ 
expectation of p(T), formed T months ago 
at the negotiation date of the contract. 

In an indexed contract, unexpected 
changes in prices generate unexpected 
changes in real wage rates only to the extent 
that indexation is incomplete. For example, 
if an escalation clause increases nominal 
wages by e percent for each one percent 
increase in the consumer price index, then 
w(T) and w*(T) are related by 


(7) w(T)-w*(T)-(1- e) 
(p(T) - p*()). 

Although most escalation clauses in North 

American labor contracts do not specify a 


fixed elasticity of indexation, this equation is 
approximately correct when e is defined as 


SEPTEMBER 1990 


the marginal elasticity of indexation evalu- 
ated at the expected level of prices at the 
end of the contract. 

Given an estimate of the elasticity of in- 
dexation, &, and an estimate of the parties’ 
expected price level at the end of the con- 
tract, p(T), it is possible to decompose the 
real wage rate at the end of a contract into 
an estimate of its expected component, 
wT), and an estimate of its unexpected 
component: 


w(T) - (T) * (T), 
where 


&(T) 2w(T) * (1- 9(p(T) - p(T)). 


Using the definition of Y(T), the estimated 
unexpected component of real wages can be 
written as 


a(T)=u(T)+(é-e) 
x (p(T) - p*(T)) 
+(1-2@)( A(T) - p*(T)). 


This estimate differs from the true value 
u(T) by two terms: one that depends on the 
difference between the actual and measured 
elasticity of indexation (and is therefore 
identically zero in a nonindexed contract), 
and another that depends on the difference 
between measured price expectations and 
the parties' true expectations. Provided that 
the measurement errors in the indexation 
elasticity and the expected price level are 
orthogonal to unmeasured components of 
employment demand, however, these errors 
do not preclude the use of (T) as an 
instrumental variable for the level of wages 
at the end of the contract. 

In this paper I use a naive forecasting 
model to form estimates of the expected 
price level at the end of the contract, based 
on the average rate of inflation over the 12 
months prior to the negotiation date.!? This 


The forecasting equation predicts the one-year 
ahead inflation rate at the negotiation date t as 0.0144 
«0.7858 DP(t—12) where DP(t —12) is the actual 
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model was selected by comparing the non- 
contingent wage increases in the first year 
of 24-36 month nonindexed contracts to 
alternative forecasts of the 12-month infla- 
tion rate formed at the negotiation date of 
the contract. I have also experimented with 
more sophisticated forecasting equations 
and found few differences in the results. 
Since the forecasts are only used to form 
instrumental variables, the choice of an in- 
efficient forecasting model should not bias 
the empirical results. 

The other ingredient in the calculation of 
unexpected real wage changes is the elastic- 
ity of indexation e. Precise information on 
the actual indexation formulas in the sam- 
ple is not readily available. I therefore use 
the ratio of total escalated increases over 
the life of the contract to the total increase 
in consumer prices over the life of the con- 
tract as a rough estimate of e. This measure 
is reasonably accurate for contracts with no 
restrictions on the escalation formula. For 
contracts with restricted escalation formulas 
that delay the start of indexation or specify 
a maximum escalated wage increase, this 
measure introduces some noise into the cal- 
culation of (T). 

Column 6 of Table 1 reports the average 
forecasting errors in the end-of-contract 
price level. The average annual forecast er- 
ror is 1.2 percent, but it varies considerably 
by year, ranging from 7.0 percent for con- 
tracts expiring in 1974 and 1975, to —4.5 
percent for contracts expiring in 1971. As 
the formulas in equations (6) and (7) imply, 
forecasting errors in end-of-contract real 
wage rates are negatively correlated with 
the forecast errors in prices. The average 
forecast errors in real wages in column 7 of 
the table are close to mirror images of the 
associated price forecasting errors. Relative 
to the forecasting errors in prices, however, 
the forecast errors in real wages are damp- 
ened by the indexation provisions of the 





percentage change in prices over the preceding 12 
months, The two-and three-year-ahead inflation rate 
forecasts generated by this equation are 0.021 + 0.693 
DP(t — 12), and 0.026 - 0.6135 DP(t — 12), respectively. 
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escalated contracts. The average estimated 
elasticity of indexation among indexed con- 
tracts is 0.50, implying that the forecast 
errors in real wages among these contracts 
are about one-half as large as the corre- 
sponding forecast errors in prices.” 

The average forecast errors in end-of- 
contract real wages are also negatively cor- 
related with the employment index in col- 
umn (5): the correlation coefficient over 16 
annual observations is —0.54, and the re- 
gression coefficient of the employment in- 
dex on unanticipated real wage changes is 
— 0.70, with a standard error of 0.27. This 
provides some evidence that contractual 
employment outcomes are negatively re- 
lated to unexpected changes in real wages. 
By comparison, the employment index is 
positively correlated with the index of real 
wage levels in column (4). 

Contract-specific correlations between 
employment and wages are reported in 
Table 2. All the data in this table are mea- 
sured as changes from the expiration date 
of the previous contract, using the sample 
of negotiations described in Table 1. Also 
presented in the table are the correlations 
of employment and wages with two mea- 
sures of outside wages: the average real 
wage rate in the same (two-digit) industry, 
measured in the expiration month of the 
contract, and the average real wage for un- 
skilled nonproduction laborers in the same 
province, measured in the expiration year of 
the contract.”° Finally, the last two rows of 
Table 2 present the. correlations of employ- 
ment and wages with contract-specific mea- 
sures of unexpected price changes and un- 
expected real wage changes. 


The forecast error in end-of-contract real wages is 
—(1— e)p, where p is the forecast error in end-of-con- 
tract prices, and e is the elasticity of indexation. The 
average forecast error in real wages is therefore —(1— 
&)p + covariance(e, p), where e is the average elasticity 
of indexation and p is the average forecast error in 
prices. 

The provincial wage is measured from data col- 
lected annually by Labour Canada in its area wage 
survey. Data in this survey is collected by city. I have 
used the wage rate for the largest city in each province 
as a measure of the province-specific wage. See the 
Data Appendix. 
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TABLE 2—MEANS AND CORRELATION OF EMPLOYMENT AND WAGE CHANGES 
. BETWEEN CONSECUTIVE CONTRACT* 


Mean 


1. Employment 
(End of Contract) 

2. Real Contract Wage 
(End of Contract) 

3. Industry Wage 
(Expiration Month) 

4, Provincial Wage 
(Expiration Year) 

5. Unanticipated Change in 
‘Real Wages Over Contract? 

6. Unanticipated Caange in 
Consumer Prices Over 
Contract? 


0.052 
0.045: 
0.044 
— 0.004 


0.006 


Real 
Contract 
Employment Wage 
Standard (End of (End of 
Deviation Contract) Contract) 
— 0.017 0.201 1.00 —0.07 
0.075 — 0.07 1.00 
0.056 —0.04 0.59 
0.060 — 0.07 0.51 
0.060 —0.12 0.45 
0.069 0.13 —0.44 


*Sample size is 1293. All variables are measured as changes in logarithms between 


expiration dates of consecutive contracts. 


Percentage difference between real wage at end of contract and expected real 


wage forecast at signing date of contract. 


"Percentage difference between Consumer Price Index at end of contract and 
expected price index forecast at signing date of contract. 


These simple correlations reveal three 
features of the contract-level data. First, 
changes in employment are only weakly 
negatively correlated with changes in end- 
of-contract real wage rates. Second, the cor- 
relations between employment and outside 
wages are of similar magnitude to the corre- 
lations between employment and contract 
wages. Third, changes in employment are 
more strongly negatively correlated with 
changes in the unexpected component of 
real wages. Thus, the OLS estimate of the 
elasticity of employment with respect to 
contract wages is much smaller in absolute 
value than the corresponding instrumental 
variables estimate formed usiag unexpected 
changes in real wages as an instrumental 
variable. The OLS estimate is —0.19, with a 
standard error of 0.08, while the instrumen- 
tal variables estimate is — 0.70, with a stan- 
dard error of 0.18. As will be seen below, 
this pattern continues to hold when other 
covariates are added to the employment 
determination equation. 


III. The Effect of Previous Wage Rates on 
Subsequent Wage Determination 


As a preliminary step in the analysis of 
employment demand, this section presents a 
brief summarv of estimated wage equations 
for the sample of collective bargaining con- 
tracts described above. The purpose of this 
analysis is to identify any “spillover” effect 
from real wage rates at the end of one 
contract to wage rates in the next contract. 
A finding of significant spillovers implies 
that unexpected changes in real wages have 
persistent effects on the cost of contractual 
labor. A finding of insignificant spillovers, 
on the other hand, implies that these unex- 
pected changes are relatively short-lived. 
The degree of persistence in unexpected 
wage changes is important for assessing the 
magnitude of the effect that these changes 
will exert on employment determination. 

The analysis is based on two alternative 
measures of negotiated wages: the real wage 
at the start of the contract and the expected 
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average real wage over the term of the 
entire contract. In the presence of adjust- 
ment costs the wage at the start of the next 
contract is particularly relevant for employ- 
ment setting behavior in the last few months 
of an existing agreement. The expected av- 
erage real wage over the next contract gives 
a longer-term measure of the costs of con- 
tractual employment. 

A convenient statistical framework for 
analyzing the determinants of wages is a 
simple components-of-variance model of the 
form ` 


(8 wj= 


where wi; represents the measure of wages 
(either the real wage at the start of the 
contract or the expected average real wage 
over the life of the contract) for the jth 
contract of the ith firm, 6, represents a 
permanent firm-specific component of wage 
variation, x; represents a vector of deter- 
minants of wages (measured at the negotia- 
tion date), w(T);.., represents the real wage 
at the end of the previous contract, and Zi; 
represents a contract-specific component of 
variance. The parameters b and A can be 
estimated by taking contract-to-contract 
first-differences: 

(9) Aw; ; = Wij — Wij- 


= bAx; + AAw(T)ij-1+ Aj. 


9, + bx; + AW(T)ij-1t čij 


Ordinary least-squares estimates of this 
first-differenced wage equation may be in- 
appropriate, however, if there is any corre- 
lation between the real wage at the end of 
the (j — 1)st contract and the error compo- 
nent é; — jii in the first-differenced wage 
equation?! This problem is readily over- 
come by using instrumental variables for the 
Jagged change in ending real wage rates. 
Suitable instruments include the first-dif- 
ference in the unexpected component of 
ending real wages and any exogenous com- 


“This problem is similar to one of estimating the 
effect of a lagged dependent variable in a panel data 
model: see Douglas Holtz-Aitken, Whitney Newey, and 
Harvey S. Rosen (1988). 
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ponents of Ax; ,. First-differencing also 
introduces a moving average error compo- 
nent into consecutive wage observations 
from the same bargaining pair. The esti- 
mated standard errors and test statistics 
throughout this paper therefore allow for a 
first-order moving average error component 
among the observations from each bargain- 
ing pair, as well as for arbitrary conditional 
heteroskedasticity. l 
Estimation results for the first-differ- 
enced wage equation (9) are reported in 
Table 3. Columns 1-4 of the table report 
estimates using the real wage at the start of 
the contract as the measure of wage out- 
comes, while columns 5-8 report estimates 
using the first-difference of the expected 
average real wage rate over the life of the 
contract as the dependent variable.” The 
components of x;; include the regional un- 
employment rate and the real wage rate in 
aggregate manufacturing (measured in the 
effective month of the contract), a province- 


-© specific real wage rate for unskilled workers 


(measured in the effective year of the con- 
tract), and a set of unrestricted year effects 
for the effective date of the contract. The 
year effects capture a number of omitted 
factors, including a period of wage-price 
controls between 1975 and 1978. Their ad- 
dition provides a significant improvement in 
the fit of the wage equations, although they 
hardly affect the estimated coefficient of 
previous wages. I have also estimated wage 
equations that include industry-specific out- 
put and price variables. These are only 
weakly related to negotiated wages, how- 
ever, and their inclusion has virtually 
no effect on the reported coefficients in 
Table 3. 

Colunms 1 and 5 of Table 3 report OLS 
estimates of equation (9) for the two alter- 


2The expected average real wage in each month of 
the contract is estimated by formulas analogous to 
equations (6) and (7), using estimates of the expected 
price level in that month and estimates of the elasticity 
of indexation as described above. The expected aver- 
age real wage is an unweighted average of expected 
monthly rates sampled at six-month intervals through- 
out the contract period, starting in the first month of 
the contract. 
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TABLE 3—EsriMATED WAGE DETERMINATION EQUATIONS 
Expected Average 
Real Wage at Start of Contract Real Wage During Contract 
OLS. IV? OLS Ive 
© o o o © © 0 0 
1. Year Effects Yes Yes Yes Yes Yes Yes Yes Yes 
2. Regional Unemployment Rate —0.50 -0.45 -0.46  —0.46 -038 -0.44  Á -045 -0.47 
(0.12) (0.12) (0.12) (0.12) (0.12 (0.13) (0.13) (0.13) 
3. Real Wage in 0.04 0.11 0.11 0.10 0.40 0.30 0.31 0.26 
Manufacturing (010 — (01) (01D (012 (0.11) (0.12) (0.12) (0.12) 
5. Real Wage in Region 0.02 0.04 0.04 0.03 0.04 0.02 0.02 0.01 
(0.05) (0.04) (0.04) (0.05) (0.05) (0.05 (005 (0.05) 
5. Real Wage at End of 0.48 0.36 0.35 - 0.25 0.41 0.35 - 
Previous Contract (0.05) (0.05) (0.07) (0.05) (0.060) — (0.07) 
6. Expected Real Wage at - - - 0.46 - - - 0.36 
End of Previous Contract (0.08) (0.09) 
7. Unexpected Real Wage at - - - 0.41 - - - 0.43 
End of Previous Contract (0.06) (0.07) 
8. Change in Prices During - - — 0.01 - - - —0.05 - 
Previous Contract (0.03) (0.03) 
9. Standard Error 0.039 0.039 0.039 0.038 0.038 0.039 0.038 0.038 
10, Overidentification - 0.261 0.273 0.037 0.016 0.006 


0.489 - 
Test^ ] 


Note: Standard errors in parentheses. Sample size is 1293. All regressions include a (first-differenced) linear trend. 
The mean and standard deviation of the dependent variable in columns (1)-(4) are 0.050 and 0.066. The mean and 
standard deviation of the dependent variable in columns (5)-(8) are 0.043 and 0.061. Standard errors are corrected 
for first-order moving average error component and heteroskedasticity. 

*In columns (2), (3), (6), and (7), instrumental variables for real wage at the end of the previous contract include 
18-year effects, the real wage in manufacturing at the start of the previous contract and the unanticipated change in 
real wages over the previous contrezct. In columns (4) and (8) instrumental variables for expected real wage at the 
end of the previous contract include 18-year effects, the real wage in manufacturing at the start of the previous 
contract, and the change in consumer prices during the previous contract. 

Probability value of test for orthogonality of residuals and instruments. The statistic is distributed as chi-squared 
with 19 degrees of freedom in columns (2), (3), (6), and (7), and with 18 degrees of freedom in columns (4) and (8). 


native dependent variables, while columns 2 
and 6 report instrumental variables (IV) es- 
timates. These specifications suggest that 
negotiated wages are significantly positively 
related to the level of wages at the end of 
the preceding contract. The OLS estimates 
of the spillover coefficient A (in row 6) differ 
somewhat between the two alternative mea- 
sures of the dependent variable, although 
the IV estimates are closer together. The 
last row of the table reports overidentifica- 
tion test statistics for the instrumental vari- 
ables estimators. There is no evidence 
against the exclusion restrictions implicit in 
the IV procedure for the specification in 
column 2. The test statistic for the specifi- 
cation in column 6, on the other hand, 
presents mild evidence against these restric- 
tions. 


In columns 3 and 7 the change in prices 
over the preceding contract is introduced 
directly into the wage determination equa- 
tion. This addition permits a test of the 
hypothesis that aggregate price movements 
affect future wage determination only to the 
extent that they affect the level of real wages 
at the end of the preceding contract. The 
estimated coefficients in row 8 of the table 
provide no evidence against this hypothesis. 
Finally, the specifications in columns 4 and 
8 relax the assumption that the expected 
and unexpected components of the end- 
of-contract wage w(T);., have the same 
effect on subsequent wages.” Perhaps sur- 


3These equations are estimated using the change in : 
prices over the previous contract, the manufacturing 
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prisingly, there is no evidence against the 
restricted specification: the t-statistics for 
the hypothesis of equal coefficients for the 
expected and unexpected components are 
1.32 in column 4 and 1.22 in column 8. 
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These results suggest that unexpected . 


changes in wages have persistent effects on 
the costs of contractual labor. An unantici- 
pated 10 percent decrease in real wages 


leads to an approximately 3 percent lower . 


real wage throughout the following con- 
tract. Thus even in the presence of substan- 
tial adjustment costs, employment should be 
expected to 
changes in real wages, provided that the 
unilateral employment determination model 
is correct. 


IV. The Determinants of Contractual 
Employment 


This section turns to estimates of the 
employment demand function (5). As in the 
previous section, the framework for the 
analysis is a components-of-variance model 
for the logarithm of end-of-contract em- 
ployment in the jth contract of the ith firm 
(nj): 

(10) n=, + azt Bw(T) i+ ej. 

In this equation, y; represents a permanent 
firm-specific effect, z;, represents a vector 
of determinants of employment, measured 
at the end of the contract, w; (T) represents 
the real wage rate at the end of the con- 
tract, and e;; is a contract-specific distur- 
bance. Assuming that industry output and 
prices are used as proxies for firm-specific 
output and price data, the wage elasticity B 
in equation (10) is related to the underlying 
parameters of the employment demand 
schedule (3) and the relative output equa- 
tion (4) by B —.—(B, t cy,). Note that £ is 


wage at the effective date of the previous contract, and 
year effects for the effective date of the previous con- 
tract as instrumental variables for the expected and 
unexpected components of real wages at the end of the 
previous contract. 


respond to unanticipated '. 
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assumed to be constant across industries. 
Although this is unlikely to be true, the 
relatively small number of contracts in each 
industry makes it difficult to estimate pa- 
rameters other than the average demand 
elasticity across industries. Heteroskedastic- 
ity introduced by variation in f is taken into 
account in the calculation of the standard 
errors. 

Again, a convenient method for eliminat- 
ing the pair-specific effects is to take first- 
differences between consecutive contracts, 
yielding 
(11) An,,=aAz,,+ BAw(T) i; + ^e. 

In many previous studies, employment out- 
comes have been found to follow a partial 
adjustment equation of the form n,,=(1— 
p)n$ tun; where ng represents the 


i optimal level of employment i in the absence 


of adjustment costs, as given by an equation 
such as (5). Partial adjustment is readily 
accommodated within the framework of 
equation (11) by the addition of a lagged 
dependent variable. In the present context, 
however, consecutive employment outcomes 
are 20—36 months apart. Thus, the extent of 
partial adjustment is likely to be much 
smaller than that observed in quarterly or 
annual data. This issue is addressed more 
thoroughly below. 

Estimation results for the first-differenced 
employment equation are presented in Ta- 
bles 4 and 5. Following the discussion in 
Section I, Part B, the determinants of em- 
ployment include the three-digit industry 
input price index (deflated by the consumer 
price index), industry-level real output, and 
the end-of-contract real wage rate. Specifi- 
cations that add outside wage rates and a 
lagged dependent variable are presented in 
Table 5. The odd-numbered columns of 
Table 4 present estimated equations that 
include a linear time trend, while the even- 
numbered columns report estimates that in- 
clude a set of unrestricted dummy variables 
for the different expiration years in the sam- 
ple. I have not made any attempt to mea- 
sure the user cost of capital. On the as- 
sumption that capital costs are constant 
across manufacturing industries, variation in 
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TABLE 4—EsrTiMATED EMPLOYMENT DETERMINATION EQUATIONS 
OLS IV? Iv? 
(1) (2) (3) (4) (5) (6) (7) (8) 
1. Year Effects No Yes No Yes No Yes No Yes 
2. Real Industry Input Price 0.22 0.16 0.20 0.16 0.19 0.16 0.19 0.15 
(0.06) (0.08) (0.06) (0.08) (0.06) (0.08) (0.06) (0.08) 
3. Real Industry Output 0.20 0.29 0.22 0.28 0.23 0.28 0.23 0.28 
(0.07) (0.09) (0.07) (0.09) (0.07) (0.09) (0.07) (0.09) 
4. Real Industry Output 0.17 0.10 0.15 0.11 0.14 0.11 0.14 0.10 
(Previous Year) (0.06) (0.07) (0.07) (0.07) (0.06) (0.07) (0.07) (0.07) 
5. Real Wage at End of 0.15 0.02 0.28 | —045  -039  -051 —0.2 -0.40 
Contract (0.08) (0.10) (0.17) (0.35) (0.12) (0.29) (0.17) (0.42) 
6. Unexpected Inflation - - - - - - — 0.03 0.10 
During Contract (0.13) (0.20) 
7. Standard Error 0.196 0.194 0.196 0.195 0.196 0.196 0.196: — 0.195 
8. Test for Exclusion of E 0.003 - 0.006 - 0.004 - 0.004 
Year Effects ( p-Value) 
9. Overidentification - - - - 0.76 0.97 0.74 0.96 
Test® 


Note: Standard errors in parentheses. Sample size is 1293. All regressions include a (first-differenced) linear trend. 
The mean and standard deviation of the dependent variable are —0.017 and 0.201. Standard errors are corrected 
for first-order moving average error component and heteroskedasticity. 

“Instrumental variable for real wage at end of contract is the unanticipated change in real wages during the 


contract. 


"Instrumental variables for real wage at end of the contract include 18 year effects, the real wage in 
manufacturing at the start of the contract, and the unanticipated change in real wages during the contract. 
“Probability value of test for orthogonality of residuals and instruments. The test statistic is distributed as 


chi-squared with 19 degrees of freedom in all cases. 


the user cost of capital is absorbed by the 
trends and/or time effects in the empirical 
specification. The unrestricted year effects 
also capture any aggregate-level shocks 
(such as aggregate demand shocks or pro- 
ductivity shocks) that are shared by all con- 
tracts in a given year. 

In an effort to capture partial adjustment 
effects, and also to control for the fact that 
industry output is measured annually, the 
employment equations in Tables 4 and 5 
include industry output in both the expira- 
tion year of the agreement and the previous 
year. I have experimented with specifica- 
tions that also include wage rates and input 
prices in the year prior to the expiration 
date, but the effects of these variables are 
always poorly determined and small in mag- 
nitude. 

The first two columns of Table 4 present 
OLS estimates of the employment equation 
with and without dummy variables for the 
expiration date of the contract. Employ- 


ment is positively related to intermediate 
input prices and current and last year’s level 
of output. The elasticity of employment with 
respect to output (i.e., the sum of the coef- 
ficients of current and last years’ output) is 
substantially less than unity, implying in- 
creasing returns to scale in the framework 
of equation (5). The addition of the year 
effects results in a relatively small improve- 
ment in the fit of the employment equa- 
tions: the probablity value of an exclusion 
tests for the year effects is reported in row 8 
of the table. When the year effects are 
included, however, the estimated wage elas- 
ticity of employment demand falls to essen- 
tially zero. 

The estimated wage elasticity is substan- 
tially larger (in absolute value) when the 
end-of-contract wage rate is instrumented 
by the unanticipated change in real wages 
over the term of the contract. The results of 
this exercise are reported in columns 3 and 
4 of Table 4. Without year effects, the esti- 
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TABLE 5—EsTIMATED EMPLOYMENT DETERMINATION EQUATIONS 


OLS 





o) 
1. Year Effects Yes 
2. Real Industry Input Price 0.16 
(0.08) 
3. Real Industry Output 0.29 
. ; (0.09) 
4. Real Industry Output (Previous Year) 0.10 
; (0.07) 
5. Real Wage at End of Contract — 0.03 
(0.10) 
6. Real Wage in Industry 0.06 
(0.22) 
7. Real Wage in Region | - 
8. Lagged Dependent Variable - 
(Instrumented) 
9. Standard Error 0.194 


10. Overidentification - 
Test* 


IV? Iv? 
Q) 6) (4) 65) (6) 7) 
Yes Yes Yes Yes No Yes 
0.16 0.14 0.16 0.14 0.13 0.10 
(0.08) (0.08) (0.08) (0.08) (0.07 (0.09) 
0.29 0.27 0.28 0.27 0.20 0.25 
(0.09) (0.09) (0.09) (0.09) (0.07) (0.09) 
0.10 0.11 0.11 0.11 0.15 0.13 
(0.07) (0.07) (0.07) (0.07) (0.07) (0.08) 
—0.02 -0.56  -051 —0.56  —0.2 -0.58 
(0.10) (0.32) (0.31) (0.33) (0.22) (0.32) 
- 0.23 - 0.23 0.26 0.38 
(0.26) (0.26) (0.22) (0.25) 
— 0.03 - 0.04 0.06 - - 
(0.15) (0.16) (0.21) 
- - - - —0.13  —0.08 
(0.14) (0.15) 
0.194 0.196 0.196 0.196 0.193 0.194 
- 0.972 0.967 0.972 0.451 0.666 


Note: See note to Table 4. Standard errors in parentheses. 
*Instrumental variables for the real wage at the end of the contract include 18-year effects, the real wage in 
manufacturing at the start of the contract, and the unanticipated change in real wages during the contract. 
Estimated on subsample of 1107 observations. Mean and standard deviation of the dependent variable are 
— 0.015 and 0.0200, respectively. Instruments include the instrument set above plus the lagged value of industry 


output. 


Probability value of test for orthogonality of residuals and instruments. The test statistic is distributed as 
chi-squared with 19 degrees of freedom in columns (3)-(5), and 16 degrees of freedom in columns (6)-(7). 


mated elasticity rises from — 0.15 to — 0.28, 
although the estimated standard error rises 
proportionately. With year effects, the 
change in the point estimate is even more 
remarkable: from —0.02 to —0.45. Due to 
the imprecision of the IV estimators, how- 
ever, tests of the difference between the 
OLS and IV estimates are insignificant in 
either case. 

The specifications in columns 5 and 6 
attempt to reduce this imprecision by ex- 
panding the list of instrumental variables 
for the end-of-contract real wage rate to 
include the level of real wages in manufac- 
turing at the start of the contract and year 
effects for the signing date of the contract. 
The additional instrumental variables lead 
to a slight increase in the magnitude of the 
estimated wage elasticities and provide some 
increase in the precision of the estimates. 
Overidentification test statistics for the in- 
ternal consistency of the instruments are 
reported in row 9 of the table. In all cases 


these are below conventional significance 
levels. I have also estimated employment ' 
equations that use only the additional in- 
struments (i.e., excluding the unexpected 
change in real wages) to identify the effect 
of wages on employment. As the overidenti- 
fication statistics suggest, these estimates are 
very similar to those in Table 4. 

Even with the additional instrumental 
variables the estimated elasticity of employ- 
ment demand in column 6 is only signifi- 
cantly different from zero at the 10 percent 
level. Nevertheless, a test of the difference 
between the estimated demand elasticities 
in columns 1 and 5 is significant at the 1 
percent level, and a test of the difference 
-between the estimated elasticities in 
columns 2 and 6 is significant at the 10 
percent level. These results suggest that 
OLS estimates of the elasticity of employ- 
ment demand are positively biased. 

The final two columns of Table 4 present 
employment equations that include the un- 
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expected change in consumer prices during 
the term of the contract as an additional 
explanatory variable. These specifications 
provide a simple check on whether unex- 
‘pected price increases affect employment 
through the contractual wage, or whether 
there is a direct correlation between unex- 
pected inflation and employment demand.” 
Neither specification provides any evidence 
of a direct role for unexpected price 
changes. Nevertheless, the standard errors 
of the wage and price terms in column 8 are 
sufficiently large that one cannot rule out a 
direct effect of inflationary surprises on em- 
ployment demand.” Taken together with 
the other estimates in the table, however, I 
interpret the results in columns 7 and 8 as 
supporting the conclusion that price sur- 
prises affect employment determination 
solely through their effect on realized wages. 

The effect of outside wage rates on con- 
tractual employment is addressed in Table 
5. The theoretical analysis in Section I iden- 
tifies two alternative routes for this effect. 
On one hand, increases in average wages in 
the industry may have a positive effect on 
employment, reflecting the competitive ad- 
vantage implied by higher costs elsewhere 
in the industry. On the other hand, in- 
creases in wage rates representing the alter- 
native value of workers’ time may have a 
negative effect if employment is influenced 
by efficient contracting considerations. In an 
effort to distinguish between these hypothe- 
ses, I have included the industry average 
wage in columns 1 and 3 of the table, and a 
province-specific wage for unskilled laborers 
in columns 2 and 4 of the table. Both wage 
measures are included in column 5. 

The OLS estimates in columns 1 and 2 of 
Table 5 show no evidence of a role for 
either outside wage measure. When the 


247 is worth pointing out, however, that aggregate 
demand shocks (or any other variables that affect all 
contracts at a point in time) are absorbed by the year 
effects included in columns 4 and 6. 

At the suggestion of a referee, I estimated an 
employment equation that includes unexpected price 
increases (and year effects) and excludes wages. In this 
specification the estimated elasticity of employment 
with respect to unanticipated price increases is 0.23, 
with a standard error of 0.14. 
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contract wage is instrumented, however, the 
point estimate of the effect of the industry- 
specific wage rises substantially, while the 
estimated effect of the regional wage mea- 
sure remains close to zero. A similar pat- 
tern emerges in column 5 when both out- 
side wage measures are included. Given the 
imprecision of the estimated elasticities it is 
difficult to draw- strong conclusions from 
these results. Nevertheless, the estimates 
lend much stronger support to the view that 
outside wages belong in the employment 
equation as a proxy for the level of competi- 
tors’ costs than to the view that outside 
wages belong in the employment equation 
as a proxy for the shadow value of employ- 
ees’ time.” If the former view is taken 
literally, the point estimates in column 3 
suggest that the output-constant elasticity of 
employment demand with respect to wages 
is — 0.33, white the elasticity of output sup- 
ply with respect to an increase in wages is 
— 0.70.7 This estimate of the output-con- 
stant demand elasticity is in the midpoint of 
the range of estimates usually reported in 
the static employment demand literature 
(see Daniel S. Hamermesh, 1986, pp. 
451-54). 

The question of whether the estimated 
employment equations are robust to the in- 
clusion of lagged employment is explored in 
the last two columns of Table 5. Since the 
employment models are estimated in first- 
differences, and the covariance of consecu- 
tive changes in employment is biased down- 
ward by any measurement error, the lagged 
value of industry output is added to the list 
of instrumental variables, and lagged em- 
ployment and real wages are treated as 
jointly endogenous. The results show no evi- 


26 My 1986 paper and Stephen J. Nickell and Sushil 
Wadhwani (1987) report employment specifications that 
show a positive effect of outside wages, while Brown 
and Ashenfelter (1986) report positive effects in more 
than one-half of their specifications. 

Recall from equation (5) that the elasticity of 
employment with respect to wages is —(f,-F ey), 
while the elasticity of employment with respect to 
industry average wages is e y,. An estimate of o from 
column (3) of Table 5 is 0.39 (the sum of the coeffi- 
cients of current and last year's output). Using the 
other estimated coefficients from this equation leads to 
the estimates of B, and y, reported in the text. 
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dence of a role for lagged employment. As 
mentioned earlier, this probably reflects the 
20-36 month interval between consecutive 
observations in the data set. Over two or 
three years the effects of partial adjustment 
are likely to be much smaller than over an 
interval of a quarter or year. — 

The estimates in Tables 4 and 5 suggest 
two main conclusions. First, employment 
outcomes are negatively related to contrac- 
tual wage rates. Although the simple corre- 
lation between end-of-contract wage rates 
and employment is small and statistically 
insignificant, this is apparently a conse- 
quence of simultaneity bias. When unantici- 
pated real wage changes and/or other ex- 
ogenous variables are used as instrumental 
variables for the end-of-contract wage, the 
estimated wage elasticity is consistently neg- 
ative and stable in magnitude across alter- 
native specifications. Second, there is no 
evidence that employment is related to out- 
side wages in a manner consistent with sim- 
ple efficient contracting models. Even 
though employment is uncorrelated with re- 
gion-specific wage measures, it is weakly 
positively correlated with industry average 
wages. This positive correlation is consistent 
with the hypothesis that higher average in- 
dustry wages lead to improvements in the 
firm’s competitive position and increases in 
employment. 


V. Conclusions 


This paper presents new evidence on the 
role of nominal wage contracts in the union 
sector. An important feature of these con- 
tracts, emphasized by the simple macro 
models of Fischer (1977) and Taylor (1980), 
is the predetermined nature of nominal 
wages. Real wage rates at the end of a 
contract therefore contain unanticipated 
components that reflect unexpected changes 
in consumer prices and the degree of index- 
ation in the contract. The empirical analy- 
sis, based on a large sample of indexed and 


Bin principle, the coefficient of the lagged depen- 
- dent variable will differ, depending on the duration of 
the previous contract. In view of the imprecision of the 
estimated partial adjustment coefficients in Table 5, 
however, I have not attempted to address this issue. 
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nonindexed contracts, indicates that these 
unexpected real wage changes are associ- 
ated with systematic employment responses 
in the opposite direction. This suggests that 
nominal contracts play a role in the link 
between aggregate demand shocks and real 
economic activity, at least in the part of the 
economy covered by explicit nominal con- 
tracts. , 

Three other findings emerge from the 
empirical analysis. First, the contract-level 
correlation between employment and wages 
apparently reflects both demand and wage- 
setting behavior. Similar simultaneity prob- 
lems may arise in other studies of 
firm-specific employment and wage data. 
Second, unanticipated changes in prices are 
found to generate changes in real wages 
that spill over from existing labor contracts 
to subsequent agreements. Inflation sur- 
prises therefore have persistent effects on 
real wages in the union sector, in addition 
to their short-run effects on employment. 
Finally, the empirical results suggest that 
employment outcomes in union contracts 
are determined on a conventional down- 
ward-sloping demand schedule, taking the 
prevailing contract wage as given. There is 
no indication that employment is related to 
outside wages in a manner consistent with a 
simple model of efficient contracting. 


DATA APPENDIX 
I. Contract Sample 


The contract sample is derived from the December 
1985 version of Labour Canada's Wage Tape. This 
tape contains information on collective bargaining 
agreements covering more than 500 employees in 
Canada. Starting from the 2868 manufacturing con- 
tracts on the tape, I merged togetber contract 
chronologies between the same firm and union cover- 
ing different establishments, and eliminated contracts 
from bargaining pairs with fewer than four contracts. 
These procedures yield a sample of 2258 contracts 
negotiated by 299 firm and union pairs. Further infor- 
mation on the merging process and the characteristics 
of the resulting sample are presented in the Data 
Appendix to my 1988 paper and in Tables 1 and 2-of 
that paper. 
^ The employment data for this sample were then 
checked in two stages. First, the number of workers 
covered in each contract was compared to the number 
covered in the preceding and subsequent agreements. 
Second, in cases where the number of workers changed 
dramatically between contracts, the contract sum- 
maries in the appropriate issue of the Collective Bar- 
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gaining Review were consulted. In 238 contracts, the 
emplcyment counts recorded on the wage tape were 
found to be in disagreement with the counts reported 
in the Collective Bargaining Review. In these cases, 
counts from the published contract summaries were 
used. In cases for which the set of establishments 
covered by the contract changed over time, contracts 
with inconsistent coverage were deleted from the sam- 
ple. Of the 2258 contracts in the subsample of merged 
contracts, valid coverage data are available for 1813 
contracts (80.3 percent). Checking of the employment 
data was performed by Thomas Lemieux. I am ex- 
tremely grateful for his assistance with these data. 

In this paper, employment at the end of a contract 
Is measured by the number of workers covered by the 
subsequent agreement. Furthermore, the estimation 
procedures require information on employment and 
wage outcomes in the previous agreement and on vari- 
ous industry and aggregate data that are only available 
between 1966 and 1983. The saniple of contracts used 
in this paper therefore consists of the subset of con- 
tracts in the initial 2258 contract merged subsample 
that satisfy the following criteria: 

(a) Information on at least one previous contract 
is available in the sample. 

(b) Information on at least one subsequent con- 
tract is available in the sample. 

(c) The expiration dates of the current and previ- 
ous contract are after January 1966 and before Decem- 
ber 1983. 

(d) Valid employment data are available for both 
the current and preceding contract (i.e., valid counts of 
workers covered are available for both the current and 
subsequent contracts). 


Il. Aggregate and Industry-Level Data 


The following aggregate and industry-level data were 
merged to the contract sample. 

(a) Consumer price index, all items, 1981 = 100. 
January 1961 to November 1985: Cansim 1484000, 
from the 1985 Cansim University Base Tape. Decem- 
ber 1985 to June 1986: from the Bank of Canada 
Review, November 1986. 

(b) Average hourly earnings in manufacturing. 
January 1961 to March 1983: Cansim D1518, from the 
1983 Cansim University Base Tape. April 1983 to De- 
cember 1983: Cansim L5607, from the Bank of Canada 
Review, various issues. Data from April 1983 and later 
are multiplied by 1.04035 to correct for the revision in 
the establishment survey. 

(c Average hourly earnings of nonproduction 
production laborers, by province. Annual data on 
hourly earnings for selected occupations are available 
for major cities. I matched data for the following cities 
to their respective provinces: Halifax, St. John, Mon- 
treal, Toronto, Winnipeg, Regina, Edmonton, Vancou- 
ver. The wage rates used are listed as rates for “male 
general laborers" between 1966 and 1977, for "general 
laborers in service occupations" between 1978 and 
1981, end for "nonproduction laborers” between 1982 
and 1985. Data for 1966-72 are trom Wage Rates, 
Salaries, and Hours of Labour, 1966-1972 editions. 
Data for 1973—1986 are from Canada Year Book, vari- 
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ous editions. For contracts that cover two or more 
provinces, I used a weighted average of Montreal, 
Toronto, and Vancouver rates with weights of 0.35, 
0.55, and 0.10, respectively. 

(d) Unemployment rates, seasonally adjusted. For 
contracts in Quebec, Ontario, and British Columbia, I 
used the province-specific unemployment rates for ali 
workers. For contracts in other provinces, I used the 
national average unemployment rate. The series used 
were as follows: Quebec-Cansim D768478; Ontario- 
Cansim D768648; British Columbia-Cansim D769233; 
all others-Cansim D767611. Data for January 1966 
through November 1983 were obtained from the 1983 
Cansim Univezsity Base. Data for December 1983 were 
taken from the Bank of Canada Review, November 
1986. : 

(e) Industry selling prices, input prices, and out- 
put. Three-digit industry level annual data for 1961-71 
were taken from Statistics Canada, Real Domestic 
Product by Industry 1961-71. These data are classified 
by 1960 standard industrial codes (SICs). Data on a 
1971 SIC basis for 1971-83 were taken from the 1978 
and 1984 issues of Statistics Canada, Grass Domestic 
Product by Industry. The 1960 and 1971 SIC codes were 
then matched, and the price and output indexes spliced 
using the 1971 observations from the two sources. Of 
65 three-digit industries represented in the contract 
sample, there were a total of 31 for which three-digit- 
level data were not available on a consistent basis. For 
these industries, two-digit-level data were used. The 
publications report the value of gross output and im- 
plicit price indexes for gross output and intermediate 
inputs. These data were used to construct the value of 
real gross output (the measure of “output” used in this 
paper). Implicit price indexes for gross output and 
intermediate inputs were deflated by the annual aver- 
age consumer price index to obtain real selling prices 
and input prices used in the paper. 

(f) Industry average hourly earnings. Monthly 
two-digit industry-level average hourly earnings data 
for the period January 1961 to March 1983 were taken 
from the 1983 Cansim University Base. Earnings data 
are unavailable for two industries: knitting mills and 
miscellaneous manufacturing. For the former, I used 
earnings in clothing industries. For the latter, I used 
average earnings in all manufacturing. Wage rates for 
April through December 1983 were constructed by 
index-linking wage rates from the new establishment 
survey to the rates in the old survey using their values 
in March 1983. Earnings data from the new survey for 
March-December 1983 were taken from the 1985 Can- 
sim University Bese. 
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Long Swings in the Dollar: 
-Are They i in the Data and Do Markels Know It? 


By CHARLES ENGEL AND JAMES D. HAMILTON* 


The value of the dollar appears to move in one direction for long periods of time. 
We develop a new statistical model of exchange rate dynamics as a sequence of 
stochastic, segmented time trends. We reject the null hypothesis that exchange 
rates follow a random walk in favor of our model of long swings. Our model also 
generates betier forecasts than a random walk. The specification is a natural 
framework for assessing the importance of the “peso problem" for the dollar. We 
nonetheless reject uncovered interest parity. (JEL 431) 


Why did the dollar rise so dramatically in 
the early 1980s only to fall precipitously 
afterward? Explanations have focused on 
such factors as the effects of U.S. monetary 
and fiscal policy on real interest rates 
(Jeffrey Frankel, 1988, and Martin 
Feldstein, 1986), lower capital taxes (Olivier 
Blanchard and Lawrence Summers, 1984), 
or a "safe haven" effect (Michael Dooley 
and Peter Isard, 1985). 

Important features of the dollar's move- 
ments are difficult to reconcile with these 
explanations under the dominant models of 
exchange rate determination. Figure 1 plots 
the number of U.S. dollars required to 
obtain a German mark, French franc, 
or British pound over the period 1973: I- 
1988:1.! One is tempted to share Feldstein's 
(1988, p. 21) summary of these data: “the 
dollar has experienced three big swings." 
The first of these is marked by a sustained 
rise of foreign currencies against the dollar; 
between the beginning of 1977 and the end 
of 1979, the mark gained 33 percent against 
the dollar, the franc gained 21 percent, and 


*Department of Economics, University of Virginia, 
Charlottesville, VA 22901. This material is based upon 
work supported by the National Science Foundation 
under grant no. SES-8720731. We are grateful to Karen 
Lewis, Anthony Rodrigues, and anonymous referees 
for helpful suggestions, and to the Federal Reserve 
Bank of Richmond for assistance with obtaining the 
data for this project. 

'The data are normalized so that 1973: IH = L 0 for 
all three currencies. 
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the pound 26 percent. This was followed by 
a five-year surge in the dollar, at the end of 
which these three European currencies fell 
60-90 percent (logarithmically) against the 
dollar. Early in 1985, foreign currencies once 
more began to rise, gaining 50—70 percent 
against the dollar by the end of 1987. 

The apparent long swings in the exchange 
rate pose important challenges for existing 
theory: In Rudiger Dornbusch's (1976) 
model, a monetary or fiscal policy change 
that drives up real interest rates results in a 
one-time upward jump in the value of the 
dollar. The dollar is then supposed to de- 
preciate steadily, so as to equate expected 
returns across countries. Yet as Dornbusch 
himself noted in 1983, 


The [overshooting] model for the real 
interest rate does well in explaining 
that a rise in U.S. interest rates should 
lead to an appreciation of the real 
exchange rate. But it fails when it pre- 
dicts that the real exchange rate should 
also be depreciating. That has not in 
fact been happening, and a theory is 
needed that will explain why the dollar 
—real or nominal—is both high and 
stuck. [p. 83] 


Indeed, the picture seems to have been 
even worse than Dornbusch painted it—the 
dollar was high and rising for three 
years prior and two years subsequent to 
Dornbusch's remarks. Accounting for the 
gradual, sustained fall in the dollar begin- 
ning in 1985 in a way that is consistent with 
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the explanation given for its rise is equally 
problematic.? 

. A further aspect of the apparent long 
swings that causes difficulty for theories of 
the exchange rate is that during the period 
of the strengthening dollar, forward ex- 
change rates (in dollars per unit of foreign 
currency) were consistently above the spot 
exchange rate. If the forwarc rate reflects 
expectations of future spot rates, then the 
market appeared to believe over a long pe- 
riod of time that a depreciation of the dol- 
lar was imminent. Yet the dollar continued 
its climb upward until the end of 1984. It 
could be argued that this forward rate be- 
havior represents an example of William 


2Changes in nominal price differentials between 
countries were small over this period compared to 
changes in nominal exchange rates. Thus the real and 
nominal exchange rates exhibit essentially the same 
patterns (see Michael Mussa, 1986, on this point). This 
fact poses additional challenges for theory. In our 
empirical analysis we focus on the dynamics of nominal 
exchange rates rather than real exchange rates, When 
we extend the process to a bivariate system including 
the nominal interest differential, this permits us to 
obtain a clean parameterization for testing uncovered 
interest parity without having to commit ourselves to a 
model of price level expectation. 


Krasker's (1980) “peso problem." If in- 
vestors perceived a small probability of a 
large depreciation, then we might see a 
forward rate in excess of the current spot 
rate for a sustained period of time. 

For these reasons, it seems useful to for- 
malize the concept of long swings in the 
exchange rates. What does one mean by 
long swings, and what magnitudes are plau- 
sibly associated with the population param- 
eters? Are long swings a systematic part of 
the process that generated the data in Fig- 
ure 1, or a pattern imposed by the eye on 
the directionless drift of a random walk? If 
long swings are an accurate description of 
population dynamics, what sorts of expecta- 
tions on the part of foreign exchange specu- 
lators are consistent with this process? Ad- 
dressing these questions can provide us with 
a systematic basis for discussing the issues 
raised above. 

The model we investigate is a special case 
of that introduced in James Hamilton 
(19892)? The basic idea is to decompose a 


Graciela Kaminsky (1988) has also fit Hamilton’s 
model to exchange rate data. She uses monthly data on 
the pound, whereas we investigate quarterly data on 
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nonstationary time-series into a sequence of 
stochastic, segmented time trends. Specifi- 
cally, we model any given quarter’s change 
in the exchange rate as deriving from one of 
two regimes, which could correspond to 
episodes of a rising or falling exchange rate, 
respectively. The regime at any given date is 
presumed to be the outcome of a Markov 
chain whose realizations are unobserved by 
the econometrician. The task facing the 
econometrician is to characterize the two 
regimes and the law that governs the transi- 
tion between them. These parameter esti- 
mates can then be used to infer which 
regime the process was in at any historical 
date and provide forecasts for future values 
of the series. 

Our maximum likelihood estimates corre- 
spond closely to the visual impressions of 
Figure 1. In regime 1 the mark is rising 4 
percent per quarter against the dollar, the 
franc 3.3 percent, and the pound 2.6 per- 
cent. Regime 2 is associated with quarterly 
declines in the foreign currencies of —1.2 
percent, —2.7 percent, and —3.8 percent, 
respectively. A given regime is likely to per- 
sist for several years, and the econometri- 
cally imputed historical change points are 
close to those the eye is tempted to draw 
directly from Figure 1. 

We perform both Wald tests and like- 
lihood ratio tests that compare the null 
hypothesis that exchange rates follow a 
martingale with the segmented trends alter- 
native. In every test but one (the likelihood 
ratio tests for German data), we reject the 
martingale hypothesis. The segmented 
trends model reduces the within sample 
mean forecast error by 9-14 percent at 
horizons from two quarters to a year for all 





the pound, mark, and franc. She does not explore the 
hypothesis of uncovered interest parity and does not 
devote as much detail to analysis of the random walk 
hypothesis. She extends Hamilton's model to a case 
where information besides realized exchange rates 
is available on the state of the process. Stephan 
Schulmeister (1987) has also documented the "long 
swings" phenomenon on the basis of nonparametric 
statistical analysis. 
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three currencies, relative to a random walk 
specification. Comparable improvements 
characterize the post-sample forecasts at 
horizons from one to two quarters. We con- 
clude that long swings in the exchange rate 
may well be a real feature of the data-gen- 
erating process. 

In exploring the second question posed 
by our paper—whether markets perceive 
these swings—we investigate the hypothesis 
of uncovered interest parity. This hypothe- 
sis holds that the nominal interest differen- 
tial between two countries forecasts future 
exchange rate changes. This is essentially 
equivalent to the claim that the three-month 
forward exchange rate is a rational forecast 
of the future spot exchange rate. We find no 
evidence to support this hypothesis in the 
data. We conclude that either (a) investors 
did not know the population parameters of 
the long swings model that generated the 
historical data, as our rational-expectations 
calculations assume, or (b) uncovered inter- 
est parity does not hold. Big differences in 
the volatility of exchange rates between the 
two regimes make it possible that (b) is due 
to risk aversion on the part of foreign ex- 
change speculators. 

The plan of the paper is as follows. Sec- 
tion I sets out the basic model we use to 
formalize the long swings hypothesis. Sec- 
tion II characterizes our estimation proce- 
dure. Empirical results are presented in 
Section III, while Section. IV analyzes the 
hypothesis of uncovered interest parity. 
Conclusions are offered in Section V. 


I. A Model of Stochastic Segmented Trends 


Our model postulates the existence of an 
unobserved variable (denoted s,) that takes 
on the value one or two. This variable char- 
acterizes the "state" or "regime" that the 
process was in at date f. When s, 1, the 
observed change in the exchange rate y, is 
presumed to have been drawn from a 
N(u,,0; ) distribution, whereas when s, — 2, 
y, is distributed N(1,, 02); thus when s, — 1, 
the trend in the exchange rate is x ,, whereas 
when s, = 2, the trend is p>. 

We further postulate a Markov chain for 
the evolution of the unobserved state vari- 
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able: 
(1) p(s, 71s, i-71)- Pu 


p(s, = 2|5,., =1) =1-py 


\ 


p(s,=1|s,_,= 2) = 1— py 
p(s, 72|5 192) = Dy. 


The process for s, is presumed to depend 
on past realizations of y and s only through 
$, 4. 
Note the variety of behavior that the 
model allows; in particular, we do not im- 
pose that exchange rates are described by 
long swings. For.example, there can be 
asymmetry in the persistence of the two 
regimes—upward moves could be short but 
sharp (u, large and positive, p,, small), 
whereas downward moves could be gradual 
and drawn out (u, negative and small in 
absolute value, p,, large). Alternatively, the 
exchange rate change this period could be 
completely independent of the state that 
prevailed last period, as in a random walk, 
if p,,—1-— p». A third possibility is the 
long swings hypothesis, which we represent 
as the claim that u; and u, are opposite in 
sign and that values for both p; and p; 
are large. 

Our model resembles a standard proba- 
bility distribution that is called a “mixture 
of normal distributions." This distribution is 
a superposition of two (or more) simple 
normal distributions. A histogram of data 
drawn from such a distribution would repre- 
sent the sum of two overlapping bell-shaped 
curves. The parameters of the distribution 
would be the mean and variance of each of 
the simple normal distributions, and a 
weight given to the first distribution to rep- 
resent the fraction of realizations that were 
likely to have been drawn from it. One 
could use these parameters to calculate the 
probability that any given observation came 
from the first distribution. The difference 
between our model and this mixture of nor- 
mals is that the draws of y, in our model 
are not independent. When we infer the 
odds that a particular y, comes from the 
first distribution, that probability depends 
on the realizations of y at other times. 
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II. Maximum Likelihood Estimation of 
Parameters : 


The probability law for the data {y,} is 
summarized by six population parameters, 


§= (41, 42,01,92; Pits Pay. 


These parameters are sufficient to describe 
(a) the distribution of y, given s,, (b) the 
distribution of s, given s,_, in equations 
(1), and (c) the unconditional distribution of 
the state of the first observation: 


(2) p(s,;=1;8) =p 
= (1— po) ; 
(1- py) * (1- pop)’ 





of course p(s,—2;0)—1-— p. The joint 
probability distribution of the observed data : 
for a sample of size T (y,,...,y7) along 
with the unobserved states (s,,..., Sy) is then 


(3) poss Yosdpossgil) 
= p(yrlsr;0) :p(s71s5.,;0) 


"P(Yr-1lSr-1380) p(s7 1157 2:8) ...- 
p(5315,;8): PC y115158) p(5,:0). 
The sample likelihood function could be 


thought of as the summation of (3) over all 
possible values of (s,,..., 57): 


(4) Pp(y..»p:9) 
2 


= } Sar 


s;=1 


2 
DB(Yy s YT S1- S730). 


sp=l 


In practice we use Hamilton’s (1989a) sim- 
pler algorithm for evaluation of (4) that 
does not require 2" summations. 

Given knowledge of the population pa- 
rameters 8, it is straightforward to charac- 
terize the probability that the process was in 
some particular regime s, at date £ on the 
basis of information available at the time 


pCsIly, Y0). 


We refer to this as the “filter” inference 
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about the probable regime at date f£. Alter- 
natively, one can use the full sample of 
ex post available information (y,,..., Yr) to 
draw an inference about the historical state 
the process was in at some date t: 


3 35:0), 


which we refer to as the “smoothed” infer- 
ence about the regime at date £. 

Note that unlike the model of a mixture 
of normal distributions in which the y, are 
independent, these probabilities depend at 
each time on y's that occur at other times. 
For example, if s,_,=1 and p, is high, 
then y, is more likely to have been gener- 
ated from distribution 1; on the other hand, 
if s, ., — 2 and p» is high, then y, is more 
likely to have been drawn from distribu- 
tion 2. 

Hamilton (forthcoming) showed that 
first-order conditions for maximization of 
(4) with respect to 8 characterize the MLE 
6 as satisfying 


psy... 


E yeb(s = yy yr;8) 
A =1 
(5) â=- F R 
Y p(s, = lys... y7;9) 
t=] 
j=1,2 
(6) o = 
T : 3 : 
L (y, - &) “p(s, = Hs ¥738) 
t=1 č ` 
iss = j| Y. 7:9) 
j=1,2 


(7) 


b 
E ^ 
| X p(s. stir snl) 


y 
N 


T 
+{ E ols- i Hyi». 417,9) 6 
t=2 


- ($= 1p ¥0s8)} 
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(8) By ES 


(x Y p(s, =2, S,— 172i. 


t=2 


^») 
z A 
+| Ep6- = ly py. 571;89)- 6 
t=2 
eps cbr rid) 


Consider first the intuition behind equa- 
tion (5). Suppose that the econometrician 
knows with certainty which observations 
came from regime 1 and which from regime 
2. Then p(s, =1|y,,.. 5,0) is either one 
or zero for all observations, and the esti- 
mated mean for regime 1, f,, is simply the 
average value of y, for those observations 
known to come from regime 1 (that is, the 
average of y, for those dates ¢ for which 
p(s, 7 l|y,,.. Yr; 8) = 1). If the designation 
of regimes is not known with certainty, then 
an observation £ whose smoothed probabil- 
ity of coming from regime 1 is 0.3 (p(s, = 
1|y1,-+-,¥738) = 0.3) is given a weight of 0.3 
in constructing the estimate of u, and of 0.7 
in constructing the estimate of pw. 

Similarly, the variance imputed to regime 
1 (6? in equation 6) is a weighted sum of 
squared deviations of the observations 
around the imputed population mean (£,), 
with weights again proportional to the prob- 
ability that any date f's datum was indeed 
generated from regime 1. 

Finally, the estimate of the Markov tran- 
sition probability (7) is again best under- 
stood by first considering the case where 
designation of regimes is known with cer- 
tainty (p(s, = 1|y,,..., y7;0) 2 1 or 0). Then 
the estimated Markov transition probability 
(Ê) is essentially the number of times the 
transition was made from 1 to 1 as a frac- 
tion of the number of times the process had 
been in state 1 the previous period. In addi- 
tion, there is a slight adjustment in the 
denominator for initial conditions. If the 
process seems to have been in state 1 at 
date 1 (p(s, — 1|y,,..., 7; 0) large), and yet 
an ergodic draw from the Markov process 
(1) is unlikely to be in state 1 (5 in equation 
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2 is small), then the adjustment favors 
choosing a larger value for f£. 

We found solutions to equations (5)-(8) 
by using the EM algorithm developed in 
Hamilton (forthcoming). 

A well-known problem (for example, B. S. 
Everitt and D. J. Hand, 1981) with estimat- 
ing parameters for i.i.d. mixtures of normal 
distributions is that a singularity in the like- 
lihood function arises when, for example, 
the mean of regime 1 is imputed to equal 
the value of the realization of the first ob- 
servation in the sample (u,— y;) and the 
variance of regime 1 is permitted to vanish 
(o, — 0). At such a singularity, the likeli- 
hood function (4) blows up to infinity. This 
paper follows Hamilton (1988b) in incorpo- 
rating a Bayesian prior for the parameters 
of the two regimes, replacing (5) and (6) 
with 


: 
E yc p(s, = lys... 9738) 
OE > 
v+ Y p(s, = jlyy-. s Yr30) 


1 





T 
a (1/2): E p(s,=Jly1,---,¥738) 


t=1 


: a 
x|B- 7/2): X -Y 


t=1 
‘p(s, = jiyi Yriô)+ (1/2) v (AY . 


This Bayesian approach reproduces the 
MLE as a special case of the diffuse prior 
v — a — B — 0. In general, (9) shrinks ĝ; to- 
ward zero for j—1,2, as if one had, in 
addition to the observed data (y,,...,¥7), v 
additional observations from each regime 
that took on the value zero. Equation (10) 
adjusts ó? toward (8/a), as though one 
had 2« observations relevant toward this 
adjustment. Equations (7) and (8) are left as 
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is. The prior thus shifts the MLE estimates 
in the direction of concluding that there is | 
no difference between the two regimes. 

A numerically equivalent way to think 
about this prior is that one is seeking to 
maximize not the likelihood function (4) but 
rather the generalized objective function 


(11) z(0) = log p(y,,...,¥738) 
- [(v-43)/(20?)] 
—[(v-43)/(202)| - «log of 
-alogo? - B/ai — B/oz.. 


Unlike the likelihood function (4), the sin- 
gularities described above are not a feature 
of the objective function (11) for a, B, and 
v » 0. Monte Carlo simulations reported in 
Hamilton (1988b) suggest that very modest 
priors can consistently improve mean 
squared errors. 


III. Empirical Results 
A. Maximum Likelihood Estimates 


The raw data for this project are an arith- 
metic average of the bid and asked prices 
for the exchange rate (in dollars per unit of 
foreign currency) for the last day of the 
quarter, beginning with the third quarter of 
1973 and ending with the first quarter of 
1988. The data are expressed in units of 
percentage change, denoted y^, yF®, and 
yUK 4 
We estimated the parameter vector @ for 
each currency in isolation from the others. 
Table 1 reports maximum likelihood esti- 
mates; the Appendix provides further de- 
tails. 


“All series were taken from the data banks compiled 
by Data Resources, Inc., as of June 1988. The raw data 
have the DRI series names WGCOOA, WGCOOB, 
FRCOOA, FRCOOB, UKCOOA, and UKCOOB. 
Natural logarithms were taken. The data were then 
first-differenced zrd multiplied by 100 to express in 
units of percentzge change. The resulting quarterly 
series (y"6, yF^, and yUK) run from 1973:IV to 
1988: I. 
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TABLE 1—EsriMATES Fit TO INDIVIDUAL CounTRY DATA, 
Yı = €,— e, t= 73:IV To 88:1, 
e, = 100 Times THE Loc or THE EXCHANGE RATE 


(in DOLLARS PER UNIT OF FOREIGN CURRENCY) 




















Parameter Germany France UK 
m 3.987 3.256 2.627 
(1.230) (0.967) (0.872) 
m —1.183 -2712 -3.752 
(1.480) (1.367) (1.139) 
Pu 0.848 0.822 0.927 
(0.122) (0.105) (0.057) 
Pa 0.928 0.908 0.913 
(0.066) (0.063) (0.073) 
of 17.652 9.991 16.918 
(9.351) (5.001) (4.660) 
o2 42.166 36.921 20.247 
(11.242) (10.252) (5.841) 
b r 0.322 0.342 0.542 
PAS, = Uy)... yr; 8) 0.004 0.000 0.373 
Note: Standard errors are in parentheses. 
1.2 
P 1.0 
R 
H .8 
s .8 
s zu 
* .2 
0.0 
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FIGURE 2 


The maximum likelihood estimates asso- 
ciate state 1 with a 4 percent quarterly rise 
in the German mark, a 3.3 percent rise in 
the franc, and a 2.6 percent rise in the 
pound. In state 2 the currencies fall by 
—1.2 percent, —2.7 percent, and —3.8 per- 
cent, respectively, with considerably more 
variability in the exchange rate apparent in 
state 2 than in state 1. 


The bottom panel of Figure 2 plots the 
exchange rate for the German mark (in 
$/mark). The top panel plots the smoothed 
probability that the process was in regime 2 
at each date in the sample; that is, p(s, — 
2]yl6,..., y» 0;0") is plotted as a func- 
tion of t. This inference uses the full sample 
of observations for Germany Cy6,..., y¥) 
and the maximum likelihood estimates of 
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FIGURE 4 


parameters 6"€ to draw an inference about 
the state of the process at each date t. The 
dates at which the econometrician would 
conclude that the process had switched 
between regimes (based on p(s, = 
2]yT6,..., y 29;0"9) 2 0.5) are shown as 


vertical bars. Similar diagrams for France 
and the U.K. appear as Figures 3 and 4. 

The estimates in Table 1 show that move- 
ments in the exchange rate are character- 
ized by long swings. The point estimates of 
p, range from 0.822 to 0.927, while the 
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estimates of p}, go from 0.908 to 0.928. 
These probabilities indicate that if the sys- 
tem is in either state 1 or state 2, it is likely 
to stay in that state. Inspection of Figures 
2-4 shows that by our estimates the switches 
between states are infrequent. All of the 
currencies were in a state of appreciation of 
the dollar (that is, they were in state 2) from 
1980-1984, and were in a state of deprecia- 
tion of the dollar (state 1) from the end of 
1984 to 1987. Thus, our model of long swings 
tends to match closely what one might be 
led to believe from casual inspection of Fig- 
ure 1.5 

States 1 and 2 are differentiated not only 
by their means but also by the variances of 
the conditional distributions. The exchange 
rate seems to be much more variable when 
the dollar is appreciating. For the mark and 
franc, our estimates show that the dollar 
entered the appreciation stage in the mid- 
dle of 1987. This assessment is based on the 
unusual volatility in the exchange rate dur- 
` ing that year. 

It is straightforward conceptually to gen- 
eralize this approach to vector processes y, 
(see Hamilton, 1988b). Here we posit that 
y,ls, ~ N(ps,, Qs,). Equations (7), (8), and (9) 
continue to hold with y, and u; interpreted 
as the corresponding vectors. Equation (10) 
is replaced by 





(12) Q, 
p 1 
a + (1/2): Xs = jb... om ô) 
X[A +(1/2)- Y Gy, — & Xy, — ÀY 


t=] 
‘p(s, = jly,..-.,¥73 8) 
+(1/2)-v- fifi] 


where (a, A) is a multivariate generalization 
of (a, B) based on the Wishart distribution. 


*Kaminsky (1988) fit Hamilton’s model to monthly 
data on the pound. She assumed constant variances 
and arrived at parameter estimates for the means and 
transition probabilities, as well as inference about his- 
torical switch points, that are comparable to those we 
find for quarterly data. 
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Unfortunately; we had little success in using 
these equations to fit all three currencies to 
a process driven by a single scalar state 
variable s, The estimates did not corre- 
spond well with the individual inferences of 
any of the three currencies. The behavior of 
individual exchange rates is determined, of 
course, not only by events in the United 
States but also by events in each of the 
corresponding countries. It appears that 
treating these three exchange rates as a 
group is inappropriate because country- 
specific developments played an important 
role in the evolution of exchange rates in 
the 1970s. For this reason, we proceed in 
our analysis of each of the three countries 
in insolation from the others. 


B. Testing the Null Hypothesis That 
Exchange Rates Follow a Random Walk 


An alternative to the segmented trends 
model is the simple random walk. Michael 
Mussa (1979), Richard Meese and Kenneth 
Singleton (1982), Meese and Kenneth 
Rogoff (1983a, 1983b), and Francis Diebold 
and James Nason (forthcoming) have all 
produced evidence that the log of the ex- 
change rate follows a random walk. David 
Hsieh (1989), however, found evidence con- 
sistent with both the earlier random walk 
conclusions and the predictions of our 
model, asserting that while there was little 
or no linear serial dependence in the log of 
the change in daily exchange rates, there 
seems to be general nonlinear serial depen- 
dence. 

There are some knotty methodological 
problems in testing the null hypothesis that 
exchange rates follow a random walk against 
the segmented trends alternative. If one 
views the null hypothesis as the claim that 
H= k and o, 05, then under the null 
hypothesis, the parameters p,, and p are 
unidentified. Moreover, at the constrained 
MLE 


T 
é$-62- Y, (v - 3) 7T |, 


t=1 
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TABLE 2—TEsrs OF THE NULL HvporHesis THAT EXCHANGE RATES FOLLOW 
A MARTINGALE AGAINST THE ALTERNATIVE OF SEGMENTED TRENDS 








Hj: pu 71-22 Hj: by = Me 
Likelihood Likelihood 
Country Wald Test Ratio Test Wald Test Ratio Test 
Germany 24.70 2.49 8.64 3.00 
(0.00) (0.11) (0.00) (0.08) 
France 29.45 5.70 12.65 6.10 
(0.00) (0.02) (0.00) (0.01) 
U.K. 61.26 6.15 25.80 8.85 
(0.00) (0.01) (0.00) (0.00) 








Note: All statistics are asymptotically y7(1). Asymptotic p-values are in parentheses. 


the derivative of the likelihood function with 
respect to u, or o is identically zero. These 
difficulties combine features of the prob- 
lems discussed by Mark Watson and Robert 
Engle (1985) and Lung-Fei Lee and Andrew 
Chesher (1986). The information matrix is 
singular under the null, and the standard 
regularity conditions for establishing asymp- 
totically valid tests of Hy do not hold in this 

case. 

In this paper we sidestep these issues by 
focusing on the following slightly more gen- 
eral null hypothesis: 


Hg: DPuc7l-Pa 
Bi É ha 


9, $ 05. 


Note that under Hj, the distribution of s, is 
independent of s,. ,; [from (1), the probabil- 
ity that s,=1 is p,, regardless of whether 
S,-1=1 or 2]. Changes in the exchange rate 
under Hj thus comprise an i.i.d. sequence 
with individual densities given by the follow- 
ing mixture of two normals: 





2 
(82 Pu T -(y - #1) 
PA 2T0, P 20$ 
(1- p) (723) 


We can then hope to test Hj against the 
alternative that p;,*1-— pj, , using stan- 


dard distribution theory, since under H4 the 
parameters Hi, 42, 0), 05, Pip and ps; are 
all identified. 

We report two tests of Hj, the fist being 
a Wald test. Let vàr( p,,) denote the asymp- 
totic variance of p; (as estimated from the 
inverse of the negative of the matrix of 
second derivatives of (11)) and cóv( Âi, P22) 
the asymptotic covariance. Then under Hj, 


n 2D 
[Bu B (1 E 53)] 
[vàr( 91; ) * vàr( B22) + 2cóv( Ain 02)] 
zx. 


(13) 


Column 1 of Table 2 reports this Wald 
test statistic for the three currencies. The 5 
percent critical value for a x?(1) variate is 
3.84, implying overwhelming rejection of Hg 
for all three currencies. 

We also tested Hj by using a likelihood 
ratio test. This statistic compares the value 
of the objective function achieved by the 
estimates in Table 1 with the largest value 
achievable when estimated subject to the 
constraint p, — (1— py). The latter estima- 
tion is a straightforward application of esti- 
mating parameters for an i.i.d. mixture of 
two normals; we used the EM algorithm 
described in Everitt and Hand (1981, pp. 


^A. Ronald Gallant (1987, p. 219) argues that the 
likelihood ratio test is apt to be more robust than Wald 
tests in a nonlinear model such as this one. 
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36-37) for this purpose, with Bayesian cor- 
rection as in (9) and (10). Twice the differ- 
ence in the objective function (11) between 
the constrained and unconstrained esti- 
mates is reported in column 2 of Table 2, 
and is presumed asymptotically to have a 

x’(1) distribution. The magnitude of the 
difference between the Wald test statistics 
and the likelihood ratio test statistics is dis- 
concerting.’ - Still, at least in the case of 
France and the U.K., the rejection of Hj 
continues to be fairly decisive. 

It is also interesting to test the hypothesis 
Hg: u, = pa. Under this hypothesis, the ex- 
change rate follows a stochastic process as 
described in Section J, with the mean rate 
of depreciation the same in both states. If 
H= Hz, but o, #0», the states have the 
same mean rate of depreciation but differ- 
ent variances. A Wald statistic for testing 
Hj is given by 

(à; — Àj) 2 x^) 
vàr( A.) +vâr(â,)— 20v Âa) 
The statistics are reported in column 3 of 
Table 2. Column 4 reports an analogous 
likelihood ratio test. The means of the two 
states are different. 

We thus conclude that movements in the 
dollar are described by long swings. The 
dollar enters stages in which it appreciates 
or depreciates and it remains in such stages 
for years. The expected length of state i is 
1/(1— pj;). State 1 is expected to persist for 
seven quarters for Germany, six for France, 
and fourteen for the U.K. On average state 
2 lasts fourteen quarters for the mark, eleven 
for the franc, and twelve for the pound. 


C. Forecasting 


As further evidence on the random walk 
hypothesis, we calculated the in-sample. and 


"This seems due in part to asymmetry in the likeli- 
hood surface for increases and decreases in p; For 
values of p;; above p,;, the Wald approximation dlightly 
understates the true curvature of the likelihood func- 
tion, whereas for pj; < Bjjs the likelihood function 
quickly becomes much flatter than the Hessian evalu- 
ated at fj; predicts. 
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post-sample forecast errors for the seg- 
mented trends model in comparison to those 
of the random walk specification. 

Consider first in-sample forecasts. If one 
takes the MLE 6 (about which the full 
sample y,,...,y7 was used to draw infer- 
ence) as known at date t, then the forecast 
one would make on the basis of observation 
of y through date ¢ and on the basis of 
knowledge of 6 is given by® 


(14) E| Yasil Yi Y58] = fy 
+{6+(-1+ B5 * By)’ 


| p(s, = 1yr- 58) - 5] 


with ô given in (2). Letting 9,,,,, denote the 
expression in (14), we calculated k-period 
ahead forecasts of the level of the log of the 
exchange rate 


8 Lk =e,+ $a T $a pod M 
and calculated the average squared value of 
the forecast error, 


T-k 


2 (ê t kl - eu). /(T- k), 


t=1 


for forecast horizons (k) of one through 
four quarters. 

The top panel of Table 3 compares these 
forecast errors with those of a random walk 
specification, whose forecasts a piven by 
Gem m €, tk: y, where y= Elay T. 
Note that the variance of the latter forecast 
error should, if the random walk specifica- 
tion is correct, rise linearly with the forecast 
horizon k. The actual MSE’s for the ran- 
dom walk specification in Table 3 perform 
more poorly than this, owing to positive 
autocorrelation in y, at lags one through 


*See equation (3.2) in Hamilton (1989a). 
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TABLE 3—-IN-SAMPLE AND PosT-SAMPLE MEAN SQUARED Forecast ERROR AT 
HORIZONS FROM ONE To Four QUARTERS OF SEGMENTED 
TRENDS MODEL AND RANDOM WALK WITH DRIFT 








A. In-Sample Mean Squared Forecast Errors 











Country Forecast Horizon (Quarters) 
(Model) 1 2 3 4 
Germany 
(Random Walk) 38.01 83.79 130.84 199.18 
(Segmented Trend) 36.48 76.39 113.55 174.93 
(Percent Improvement) 4Percent — 9Percent 13 Percent 12 Percent 
France 
(Random Walk) 34.37 82.96 143.99 220.88 
(Segmented Trend) 31.27 72.13 124.31 194.76 
(Percent Improvement) 9 Percent 13 Percent 14Percent 12 Percent 
United Kingdom 
(Random Walk) 29.10 76.06 124.45 187.26 
(Segmented Trend) 25.11 65.95 107.82 170.36 
(Percent Improvement) 14 Percent 13 Percent 13 Percent 9 Percent 
B. Post-Sample Mean Squared Forecast Errors 
Country Forecast Horizon (Quarters) 
(model) 1 2 3 4 
Germany 
(Random Walk) 54.58 141.33 251.62 406.49 
(Segmented Trend) 50.44 133.37 245.77 409.85 
(Percent Improvement) 8 Percent 6Percent 2Percent  —1 Percent 
France 
(Random Walk) 52.47 145.59 266.34 426.76 
(Segmented Trend) 46.80 134.32 255.33 427.01 
(Percent Improvement) 11 Percent 8Percent — 4Percent —0 Percent 
United Kingdom 
(Random Walk) . 4235 117.54 186.68 270.43 
(Segmented Trend) 35.11 98.40 161.28 252.61 
(Percent Improvement) 17 Percent 16 Percent 14Percent 7 Percent 


Notes: A. In-Sample Forecast Errors. In each case, the population parameters were 


estimated by using data from t=1973:IV to 1988:I and mean squared errors are 
those associated with forecasts for dates t — 1973:1V + k to 1938:1 where k is the 
forecast horizon. — . 

B. Post-Sample Forecast Errors. In each case, the population parameters were 
estimated by using data from t = 1973:IV to 1983:IV and mean squared errors are 


those associated with forecasts for dates £ = 1984:I to 1988:1. 


three? and to the fact that y, the mean of 
observations 1 through T, is not quite the 
same as the mean of observations k though 
T. The improvement in forecasting at hori- 


?Recall John Cochrane's (1988, p. 906) result that 


Var(e, 4 — e,) = k|Var(e,,,— ej) 


k-1 


*2 ((k- j)/k):Cov(y. si) l- 


fel 


zons of two to four quarters offered by the 
segmented trends specification is 9-14 per- 
cent for all three currencies. 

It is worth noting that our model is doing 
more than just mimicking an AR(1) specifi- 
cation for exchange rate changes. An AR(1) 
model has an in-sample one-quarter ahead 
R? of 8 percent for the U.K., 4 percent for 
France, and less than 1 percent for Ger- 
many and offers virtually no improvement in 
forecasting at longer horizons. 

To evaluate the post-sample forecasting 
performance of the model, we reestimated 
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the parameters with data only up to the end 
of 1983. We chose the end of 1983 because 
the major turning point in the dollar that 
occurred in 1985 had not yet happened. 
Hence, the entire period of the dollar de- 
preciation of 1985-1987 was not used for 
estimating parameters. Furthermore, with 
our out-of-sample forecasts our model must 


meet the challenge of picking out the turn- 


ing point. 

The parameter estimates for the trun- 
cated sample are similar to those of the full 
sample; using only data through 1983, there 
is evidence in favor of the long swings hy- 
pothesis. 

The bottom panel of Table 3 compares 
the post-sample mean squared error of the 
forecasts of our model with that of a ran- 
dom walk (with the drift term estimated 
from data through the end of 1983). The 
forecasts for the segmented trends model 
were calculated as in equation (14), but 
using the parameter estimates from the re- 
stricted sample. We found that our model 
generally outperformed the random walk, 
particularly at short forecasting horizons. 

Table 3 follows Mussa (1979) and Meese 
and Singleton (1982) in including a drift 
term in the random walk; by contrast, Meese 
and Rogoff (1983a, 1983b) and Diebold and 
Nason (1989) set the drift term a priori to 
zero. The zero-drift random walk specifica- 
tion has a forecasting performance over 
1984:1—1988:1 that significantly beats both 
our specification and the random walk with 
drift. Some would interpret this as evidence 
in favor of the random walk hypothesis. We 
would instead argue that the superiority 
during 1984—1988 of the random walk with- 
out drift over the random walk with drift 
offers conclusive evidence that exchange 
rates do not follow a random walk, with or 
without drift! If the data really followed a 
driftless random walk, then differences in 
post-sample forecasts between the two ran- 
dom-walk specifications should be entirely 
due to error in estimating the drift term. 
Conventional statistical tests lead to clear 
rejection of the null hypothesis that the 
exchange rate data.come from a random 
walk with the same drift term before and 
after 1984 (see also Table 4 below). The 
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. driftless random walk is just a special case 


of this rejected hypothesis. Imposing a par- 
ticular numerical value for the drift (in this 
case, zero) is of course going to improve the 
fit over selected subsamples, but cannot sal- 
vage the model as a specification that de- 
scribes the complete sample. An.apparent 
break in parameter values over particular 
subsamples may be an important feature 
that accounts for the results of Meese and 
Rogoff and Diebold and Nason, and it is 
precisely the feature of the data that our 
long swings Tepiesemiation is attempting to 
model. 

We conclude that the accumulated evi- 
dence from the Wald tests, likelihood ratio 
tests, and forecasting performance favors 
the segmented-trends specification over:the 
random walk. 


D. Specification Testing 


This section presents specification tests 
that fall into four broad groups. The first 
group explores the forecastability of the 
one-quarter-ahead in-sample forecast er- 
rors. Second, we consider tests based on the 
work of Whitney Newey (1985), George 
Tauchen (1985), and Halbert White (1987) 
that examine the null hypothesis that the 
Score statistics are serially uncorrelated. 
Third, we perform Lagrange multiplier tests 
for various sorts of dynamic misspecifica- 
tion. Finally, we split the sample at the end 
of 1979 and again at the end of 1983 and 
perform likelihood ratio tests for changes in 
the stochastic process governing exchange 
rates. 


Forecasting Tests. Our forecasting tests di- 
vide the one-quarter-ahead forecasts of our 


model (expression 14) by their conditional 
standard deviation: + 


+ [73+ 62]- (0 Peri) 


1/2 
-fâr Bat Âz (1- Bisiy)} "E s 
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where 


Braye = (1- By) +(-1+4 Bi + By) 


p(s, = yy... 9,58). 


The resulting standardized one-period- 
ahead forecasts errors (4, ,,) should be un- 
forecastable with any time ¢ variables. We 
calculated many regressions of these errors 
on their own lagged values, on lagged values 
of the log changes in exchange rates, and on 
.various combinations of the squares and 
cross-products of these variables. In no case 
did a joint test of a zero intercept and zero 
slope coefficients reject the null hypothesis. 
For example, a regression of 2, on a con- 
stant, à, , and 2 ,, for t— 74:11-88:1 
yields F(3,53) statistics whose p-values are 
(0.90), (0.84), and (0.70) for the three cur- 
rencies. The smallest p-value in the two 
dozen regressions we looked at was (0.42) 
for the regression of 4, on a,_; and d?_, for 
j=1,2,3,4 for the U.K. 

Hsieh (1989) used the test of William 
Brock, W. Davis Dechert, and José 
Scheinkman (1987) to search for general 
nonlinear dependence in a time-series. He 
found evidence of significant dependence in 
daily data for several currencies. We re- 
peated these tests on our standardized 
residuals.!° We varied N (the dimension of 
* N-histories") between 2 and 6 (in steps of 
1) and ¢ (the distance measure, in standard 
deviations of the data) between 0.5 and 1.5, 
in steps of 0.25. We found no evidence of 
serial dependence in the standardized resid- 
uals. However, in contrast to Hsieh's analy- 
sis of daily data, there is little evidence from 
this test of nonlinear dependence in the raw 
quarterly exchange rate changes. 


Score tests. White (1987) noted that if a 
maximum likelihood model is correctly 
specified, the score statistics (the derivative 
of the conditional log likelihood of the ¢-th 
observation) should be serially uncorre- 
lated. Hamilton (1989b) showed how White's 


We calculated these statistics by using computer 
code kindly distributed to us by W. Davis Dechert. 
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results may be used to construct tests for 
possible alternatives to the Markov switch- 
ing model. Table 4 applies these tests to the 
random walk specification, and Table 5 to 
our long swings model. 

By considering the score with respect to 
the mean, a White test for autocorrelation 
can be constructed (which essentially tests 
for the correlation of the score at time f£. 
with respect to u; and the score at time 
t —1 with respect to u, for i, j = 1,2.) Table 
4 provides evidence of autocorrelation in 
the raw data for the U.K., while Table 5 
finds no evidence of autocorrelation left over 
after fitting the long swings model. 

An ARCH test can be implemented by 
examining the serial correlation properties 
of the scores with respect to o2, i—1,2. 
Table 5 shows that the test for ARCH for 
the U.K. is significant at the 5 percent level. 
However, Hamilton (1989b) concluded from 
Monte Carlo simulations that “For a sam- 
ple as small as 50 observations, one might 
be better off using the 1 percent critical 
value from the asymptotic distributions 
(rather than the 5 percent value) as a rough 
guide for a 5 percent small-sample test based 
on the Newey-Tauchen-White specification 
tests or Lagrange multiplier tests for mis- 
specification of the variance." By this stan- 
dard, the null hypothesis of no ARCH 
should not be rejected. 

The Markov assumption that p(s, — i) de- 
pends only on the state at time t — 1 can be 
tested against the alternatives that it de- 
pends on the state at earlier times or that it 
depends on the realizations of the data y,. ,. 
This test checks whether the score with 
respect to the transition probabilities can be 
predicted by the corresponding lagged score 
or the score with respect to the mean. Table 
5 shows that the Markov specification can- 
not be rejected for any currency. 


Lagrange Multiplier Tests. Tables 4 and 5 
also present Lagrange multiplier tests of the 
random walk and long swings specifications. 
We tested against the alternatives that there 
is omitted autocorrelation only in state 1, 
autocorrelation only in state 2, and autocor- 
relation across regimes. These produced the 
same conclusions as the White tests for 
autocorrelation. 
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TABLE 4—TESTS oF NULL HypoTuesis THAT PERCENT CHANGES IN EXCHANGE RATES 
ARE Li.d. GAUSSIAN 




















Test Germany France U.K. 
White Test for 0.28 2.15 [6.02] 
Autocorrelation (y?(1)) (0.60) (0.14) (0.01) 
White Test for 0.32 0.02 0.39 
ARCH (x?(1) (0.57) (0.89) (0.53) 
LM Test for 0.28 2.14 [6.02] 
Autocorrelation ( y ?(1)) (0.60) (0.14) (0.01) 
LM Test for 0.68 0.00 0.32 
ARCH (x7(1)) (0.41) ' (1.00) (0.57) 
LM Test for Shift in Mean [4.14] [6.70] [3.92] 
79:1V-82:1V (x?) (0.04) (0.01) (0.05) 
LM Test for Shift in Mean [4.53] [7.69] [10.19] 
85:11-83: I (x7(D) (0.03) (0.01) (0.00) 








Notes: All statistics are asymptotically x?(1) [5 percent critical value = 3.84; 1 percent 
critical value = 6.63]. Brackets [ ] denote significant at 5 percent level. Asymptotic 


p-values are in parentheses. 


TABLE S5—Tests or Nutt Hypotuesis THAT EXCHANGE RATES FOLLOW 
THE LonG Swincs MODEL 


Test 


White Test for 
Autocorrelation (y ?(4)) 

White Test for 
ARCH (x?(4) 

White Test of Markov 
Specification (x ?(4)) 

LM Test for Autocorrelation 
in Regime 1 (y7(1)) 

LM Test for Autocorrelation 
in Regime 2 G0» 

LM Test for Autocorrelation 
Across Regimes (y ^(1)) 

LM Test for 
ARCH (x?(1) 

LM Test for Shift in Mean 
79:1V-82:IV (x20) 

LM Test for Shift in Mean 
85:11-88:1 (x7) 


Germany France U.K. 
2.56 4.23 4.61 
(0.63) | (0.38) (0.33) 

3.52 7.26 [10.55] 
(0.47) (0.12) (0.03) 
2.73 4.47 1.59 
(0.60) (0.35) (0.81) 
0.00 3.26 3.02 
(1.00) (0.07) (0.08) 
1.44 0.24 0.30 
(0.23) (0.62) (0.58) 
0.59 1.06 0.65 
(0.44) (0.30) (0.42) 
1.27 0.10 [4.47] 
(0,26) (0.75) (0.03) 
1.66 1.92 1.50 
(0.20) (0.17) (0.22) 
[11.00] [12.24] 228 
(0.00) (0.00) (0.13) 


Notes: The first three statistics are asymptotically y ?(4) [5 percent critical value = 9.49; 
1 percent critical value = 13.28]. All other statistics are asymptotically x?(1) [5 percent 
critical value = 3.84; 1 percent critical value = 6.63]. Brackets [ ] denote significant at 5 
percent level. Asymptotic p-values are in parentheses. 


Tables 4 and 5 also report the results of 
LM tests for ARCH. For the alternative to 
the long swings model, the variance at time 
t, h,, is modeled as 


heeg pec—— 2 


(see Hamilton, 1989b). Under the null hy- 
pothesis of no ARCH, £— 0, and ys, = oj. 
Again, we found some evidence of ARCH 
for the U.K., but we would probably not 
consider it significant at the 5 percent level 
given our number of observations. 

We can also use the Lagrange multiplier 
principle to test whether the mean of the 
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TABLE 6—LIKELIHOOD RATIO TESTS FOR WHETHER 
ALL PARAMETERS OF THE LONG Swincs MODEL 
CHANGE AT SPECIFIED DATES 





Date of 

Sample Break Germany France U.K. 

1979 :IV 8.343 12.313 4.846 
(0.21) (0.06) (0.56) 

1983: IV 4,303 8.050 3.775 
(0.64) (0.23 (0.71) 


Notes: All variables are asymptotically x?(6). Asymp- 
totic p-values are in parentheses. 


process shifted over any subsample. When 
we applied this test to the random walk 
specification, we found evidence for all three 
currencies of a change in the drift associ- 
ated with the changé in U.S. Federal Re- 
serve operating procedures during October 
1979-October 1982 (see Table 4). By con- 
trast, allowing for a separate mean for this 
subperiod does not make a statistically sig- 
nificant contribution to the long swings 
model (Table 5). Thus one feature of the 
data that is inconsistent with a random walk 
that the long swings model captures is the 
persistent tendency for the dollar to appre- 
ciate during the three-year period in which 
the Fed targeted nonborrowed reserves. 

We also tested for a permanent break in 
the mean of the series for all possible change 
points in the sample. It is interesting that 
for all three currencies and for both the 
random walk and long swings specifications, 
the largest value of this statistic. comes 
within one quarter of 1985:1I. Table 4 re- 
veals evidence of a break in the process 
after 1985 that is not captured by the ran- 
dom walk. Table 5 suggests that the long 
swings model is able to capture this break in 
the case of the U.K. but not in the case of 
Germany and France. 


Likelihood Ratio Tests. We also tested for 
shifts in the stochastic process by perform- 
ing likelihood ratio tests for joint changes in 
all the parameters at the end of 1979 and at 
the end of.1983. Table 6 shows that we 
cannot reject the null of no shift at the 5 
percent level for any currency at either date. 
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IV. Testing the Hypothesis of Uncovered 
Interest Parity 


We now turn to the second question posed 
by our paper—Is this apparent forecasta- 
bility of exchange rates reflected in inter- 
country interest differentials? Uncovered 
interest parity posits that a three-month Eu- 
rodollar account should yield the same re- 
turn expected by converting the dollars to 
marks, holding these marks in a Euromark . 
account for three months, and converting 
back into dollars at the then-prevailing ex- 
change rate: 


(15) i5 —iW6 + E (e¥S — erc) ta. 


Here i, is the return on a Eurocurrency 
account for the specified currency, e, is the 
log of the exchange rate (in dollars per unit 
of foreign currency), and u, is a disturbance 
term that reflects measurement and speci- 
fication error. 

Let the log of the forward exchange rate 
be f”° dollars per mark. A pure arbitrage 
opportunity exists unless 


(16) jUS wx {WE 4 fC  QWwa. 


Thus, the hypothesis of uncovered interest 
parity (15) is essentially equivalent to the 
hypothesis that the forward rate is an unbi- 
ased predictor of the future spot rate 


PWG WG 
Jı = Eeri + uy. 


We report our results in terms of testing 
uncovered interest parity rather than of 
testing the risk neutrality of the forward 
currency market, though the two tests are , 
conceptually the same. 

Suppose that investors know the popula- 
tion parameter @ of the segmented trends 
model and further observe the value of s,, 
which governed the mean of the exchange 
rate change between t —1 and t. When s, = 
1, then the change in the exchange rate 
between ¢ and f--1 will be drawn from a 
N(u, of) distribution with probability pu ` 
and from a N(y,,o7) distribution with 
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TABLE 7——INTEREST DIFFERENTIALS AS PREDICTED BY (a) THE UNivaRIATE MLEs For 
EAcH CouNTRY's ExcHANGE RaTE (TABLE 1 PARAMETERS), 
AND (b) THE BIVARIATE MLEs For EAcH CouNTRY's EXCHANGE RATE 
TOGETHER WITH THAT COUNTRY’S INTEREST DIFFERENTIAL 
(TABLE 8 PARAMETERS) ` 


1. Predicted Value for iU5 — i* 
When s, =1 Based on 
(a) Univariate Estimate of 


Puki tC- pipe 
(b) Bivariate Estimate of 
nuQ 


2. Predicted Value for i¥5 — i* 
When s, = 2 Based on 
(a) Univariate Estimate of 
Patt, +(1— po 
(b) Bivariate Estimate of 
(2) 


3. Predicted Value for Change in i5 — i* 


When State Changes from 2 to 1 Bad on 


(a) Univariate Estimate of 
(~1+ pu t Pou u2) 
(b) Bivariate Estimate of 
p (2)- u2) 








Germany France U.K. ; 
3.203* 2.196* 2.159* 
(1.150) (0.934) (0.904) 
0.542* —0.249* | —0.282* 
(0.061) (0.118) (0.090) 
—0.811 — 2.160 —3.199 
(1.335) (1.214) (1.001) 
1.171 — 1.228 — 1.425 
(0.086) (0.238) (0.174) 
4.014* 4.356* 5.367* 
(1.597) (1.447) (1.182) 
—0.629* 0.979* 1.143* 
(0.102) (0.249) (0.193) 


Notes: Standard errors are in parenthéses. The standard error for a nonlinear function 
h(8) of the (p X1) parameter vector 8 was calculated as the square root of 
[A,(6)] Vàr(LA,(8)] where h) denotes the (p X1) vector of derivatives of the 
function A{-) with respect to the elements of 8, evaluated at the MLE 8, and Var(@) 
denotes the (p X p) estimated variance-covariance matrix of 6. 

*Indicates that the 95 percent confidence intervals for (a) and (b) fail to overlap. 

i) denotes the jth element of the vector p, in Table 8; thus j,(2) is the mean 
interest differential when the process is in state 1, and 1.22) is the mean interest 


differential when the process is in state 2. 


probability (1— p,,). Thus when s, — 1, in- 
vestors would forecast a change in the ex- 
change rate between ¢ and t +1 of 


(17a) E (efg - ef?) 


= puk t (07 pu) 


whereas when s, = 2, their forecast would 
be 


(17b) El eng — era) 
= Pas t (1— Pr) ey. 
Substituting (17) into (15) gives 


(18a) i5 — iP? = pu, * (1— py) ea tu, 


when s, —1 


= Pala t (1— p33)it, * u, 
when s, = 2. 


(18b) 


Rows (1a) and Qa) of Table 7 present the 
predicted value for the interest differentials 
based on the univariate maximum likeli- 
hood estimates for each country's exchange 
rate. 

The predictions for state 2 (row (2a)) are 
particularly interesting. State 2 is the state 
in which the dollar appreciates. During the 
period of the dollar appreciation of 
1980—1984, the forward rate generally ex- 
ceeded the current spot rate,!! implying un- 
der uncovered interest parity that markets 
expected a depreciation of the dollar. One 
way to reconcile this finding with rationality 
of expectations is to argue that the econo- 
metrician faces a “peso problem” (see 
Robert Cumby and Maurice Obstfeld, 1984; 


!l$ee for example George Evans (1986), Jeffrey 
Frankel and Kenneth Froot (1987, 1988), Eduardo 
Borensztein (1987), and Robert Cumby (1988). 
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Robert Hodrick and Sanjay Srivastava, 1984; 
and Hodrick, 1987). When there is a small 
probability of a large depreciation, the for- 
ward rate may consistently predict a depre- 
ciation while none occurs. 

This possibility is in principle allowed by 
` equation (17b). Even when the process is in 
state 2, in which the dollar is more likely 
than not to appreciate (u, <0, Pa > 0.5), 
the expected change in the exchange rate 
could be positive if the product of (1— p32) 
and u, is sufficiently large. However, the 
probability of a depreciation could not have 
been large, because the appreciation stage 
lasted so long. The probability of a depreci- 
ation given that we are in state 2 is 1— py; 
our estimate is 0.072 for the mark. The 
value of u, for Germany, 3.987, is large but 
not large enough to justify a positive in- 
terest differential. Our calculations suggest 
that a substantia] negative differential of 
0.928( — 1.183) + 0.072(3.987) = —0.811 was 
warranted despite the potential “peso” ef- 
fect. From mid-1980 to mid-1984 the Ger- 
man mark was in state 2 and yet the U.S.- 
German interest differential was invariably 
positive. There is thus prima facie evidence 
against the joint hypothesis of uncovered 
interest parity and rational expectations. We 
have allowed for a “peso problem," but the 
evidence indicates that the probability of 
leaving state 2 once you aré in it is so small 
that a positive interest differential is unwar- 
ranted. Notwithstanding, a 95 percent 
confidence interval for the predicted inter- 
est differential does include positive values 
—the interval ranges from — 3.427 to 1.805. 

Essentially the same conclusion holds for 
the U.K. The predicted interest differential 
in state 2 is —3.20—indeed, the upper end 
of the 95 percent confidence interval for the 
predicted interest differential (—5.161 to 
—1.237) is negative—while the U.S.-U.K. 
interest differential was almost always posi- 
tive from the end of 1980 to 1984. This is 
again a period when our estimates imply 
that the exchange rate was surely in state 2. 


"This is reminiscent of the argument made by 
Frankel (1985) that a rational stochastic bubble could 
not explain the behavior of the dollar. 
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With France, we cannot make such a bold 
statement. It is still true that when we are in 
state 2 our univariate estimates of the ex- 
change rate indicate the interest differential 
should be negative. However, the U.S.- 
French three-month interest differential was 
frequently negative during 1980—1984. So 
there is not a simple, clear-cut case against 
interest parity in the case of the dollar / franc 
relationship. 

Of course, one could still try to salvage 
the peso story by postulating a possible de- 
preciation of the dollar that is more dra- 
matic than that associated with regime 1. 
According to this view, there is perhaps a 
third possible regime of violent deprecia- 
tion, which was never observed in the sam- 
ple. The problem with this view is that, 
under rational expectations, this massive 
depreciation has to be regarded as an ex- 
tremely unlikely event—it did not happen 
once in 58 observations. Suppose we there- 
fore take the probability of moving into this 
regime, pa, as less than 0.02. For such a 
remote event to be able to change the calcu- 
lations in row (2a) of Table 7 from negative 
to positive, the quarterly depreciation of the 
dollar in state 3 would have to be 40 per- 
cent (logarithmically) against the mark, 108 
percent against the franc, and 160 percent 
agains the pound! 

We now explore the hypothesis of uncov- 
ered interest parity by examining the joint 
behavior of exchange rates and interest 
rates. Expressions (18) predict that the in- 
terest differential at date t should have one 
of two means, selected by the same state 
variable s, that governed tbe realization of 
the exchange rate change observed at t. 
Consider then the two-dimensional vector 


y = (e? - e*3), 015 - i9)]-. 


The model holds that this vector comes 
from one of two distributions: 


ys, 71) ~ Muse 2a a 


ys 2) ~ | 





H2 Q 
P333 t (1— Pao) My did 


where we put no restrictions on the vari- 
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TaBLE 8—Estimates Fir to y, =[(e, — e,_,), GU5 — i”), t = 73: TV-88:1, 


e, = 100 Times THE Loc OF THE EXCHANGE RATE 
(IN DOLLARS PER UNIT OF FOREIGN CURRENCY), 


i, = INTEREST Rate (iN 100-Basis Points AT QUARTERLY RATE) 





Parameter Germany 


ua 2.407 


"m —1.164 


Pu 0.972 


0, 36.553 0.269 
(9.639) (0.358) 
0.269 — 0.095 
(0.358) (0.026) 


a, 36.222 —0.920 
(9.816) (0.528) 
—0.920 0.195 
(0.528) (0.052) 


Note: Standard errors are in parentheses. 


ance-covariance matrices Q, and Q,, whose 
properties are governed by the behavior of 
the specification error u, in (15). 

The unrestricted version of this model is 
thus a simple vector generalization of the 
process in Section I: 


(19) ys, ~ N(p,,0, ). 
Our objective now is to maximize’? 


(20) log p(y,,.-.,¥738) - (v72) 
[MOm] 7 (72): [u507 !p.;] 
— alog|O,|— alog|Q,|—0.50%° — 0.107 


—0.5o5* — 0.10, 


13 ; .[|05 0 
The prior A | 0 Ol 


weight the variance of the exchange rate innovations to 
five and the variance of interest rate innovations to 
one. A different scale variable is appropriate since 
exchange rates are considerably more variable than 
interest rate differentials. 


| was used in (12) to 


France U.K 

1.319 0.216 

(1.164) (0.845) 
— 0.249 — 0.282 

(0.118) (0.090) 
—3.042 —2.407 

(1.163) (1.057) 
— 1.228 — 1.425 

(0.238) (0.174) 

0.916 0.983 

(0.050) (0.019) 

0.889 0.969 

(0.071) (0.044) 

34.423 —2.157 31.306 — 1.950 
(8.819) (0.733) (6.702) (0.577) 
—2457 0.365 —1.950 0.342 
(0.733) (0.096) (0.577) (0.073) 
25.068 — 2.036 15.282 —0.335 
(7.586) (0.974) (5.733) (0.680) 
—2.036 0.603 —0.335 0.406 


(0.974) (0.202) 


(0.680) (0.153) 
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where w§°, for example, denotes the (1,1) 
element of 0; !. 

We then fit the unrestricted bivariate 
model (19) to the exchange rate data along 
with interest rates. The series used for the 
latter were the average of bid and asked 
prices on three-month Eurocurrency rates 
(quarterly rates, in 100-basis points) as of 
the close of the London market on the last 
day of the quarter.’4 

The parameter estimates associated with 
the highest value for (20) are reported in 
Table 8 along with asymptotic standard er- 
rors. Figures 5 through 7 plot the data and 
imputed change points, with the two means 
for the interest differential shown as hori- 
zontal dashed lines. 


“These are from the data banks of DRI (called 
WGD03A, WGD03B, FRD03A, FRD03B, UKD03A, 
and UKD03B). Data were converted from annual to 
quarterly rates as 100- ([1 t (i /100)]925 — 1), with i the 
average of the bid and asked returns. 
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FIGURE 6 


It is difficult to find much support for the 
hypothesis of uncovered interest parity in 
these results. Germany is the only country 
for which the segments identified by the 
bivariate system (the top panel of Figures 
5-7) at all resemble those identified by the 


univariate process for exchange rates (Fig- 
ures 2-4), and here the interest diferential 
moves in the opposite direction from that 
predicted by the theory—the period when 
the mark was falling was a period when U.S. 
interest rates were unusually high relative 
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to Germany. The interest differential is 62.9 
basis points higher in state 2 than in state 1 
according to the estimates in Table 8 (1.171 
— 0.542 = 0.629), rather than 400 basis 
points lower as predicted from the univari- 
ate estimates in row (3a) of Table 7. 

There are some interesting statistics that 
help to reveal the inconsistencies between 
the univariate model of exchange rates and 
a bivariate model that imposes uncovered 
interest parity. From the standard errors in 
row (1a) of Table 7 we can construct 95 
percent confidence intervals from the uni- 
variate estimates for the predicted level of 
the interest differential in state 1. That is, 
we can construct confidence intervals for 
Pupa t (0.— pu)us; which is the predicted 
value for iV$ — jWG if we are in state 1. 
These confidence intervals never overlap 
with the 95 percent confidence interval for 
the interest differential from the bivariate 
estimation for state 1 (row (1b), in Table 7). 
This is a conservative test in that the 
marginal significance level is strictly less than 
0.05. This is because if the true parameter 
vector were in the gap between the two 
confidence intervals, two events (either of 
which. alone has probability less than 0.05 of 
having occurred) would have to have both 


occurred. Even if the events were perfectly 
correlated, the probability of both occurring 
together could be no greater than 0.05, and 
in general it must be less than 0.05. | 

Row (3a) of Table 7 gives the change in 
the interest differential in moving from state 
2 to state 1 predicted by the univariate 
estimates. Row (3b) compares these with 
the estimates of the actual change in the 
interest differential, 4,2) — (2), as in- 
ferred from the bivariate system, where the 
subscript refers to the state and the “(2)” 
indicates the second element of the vector 
p. In no case do the confidence intervals 
overlap. This offers evidence against not 
only the hypothesis of interest parity, but 
also of a constant risk premium. 

The above calculations assumed that, un- 
like the econometrician, agents knew the 
state of. the process s, governing the 
most recent observation on exchange rates 
(e, — e,..,) with certainty at date t. Charles 
Engel (1985) and Karen Lewis (1989), for 
example, discussed models of the exchange 
rate in which individuals do not know the 
current monetary policy regime and learn 
about it gradually through Bayesian infer- 
ence. Our results change little if we postu- 
late that agents are learning about the state 
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s, in the same way as the econometrician. 
The real-time forecast of the exchange rate 
change in this case would not be (17) but 
rather (14): ' 


E| yesili Ye- V18] = fi, 
+(6+(-1+ but br) 
[ese tbe 58) - 8] 0 ~ fa}, 


which collapses to (17) in the special case 
when the econometrician has no uncertainty 
about the state (p(s, = 1]y,,..., y,;6)=0 or 
1). Equation (18) then becomes 


(Qu) 25-108 =p, 
+{6+(-14 by + by) 
‘[p(s,=1y1,-.-5958)— A]} 
(s Âa} + uy. 


Hamilton (1988a) showed how equation (21) 
could be estimated jointly with the process 
for exchange rates. Here we settle for a 
more modest descriptive statistic, obtained 
from the regression of the interest differen- 
tial on the output of the filter from the 
univariate estimator 


i-i? p, 
+ Bil p(s, = 1p yp e nth 58) | tu, 


This OLS regression has an R? of 0.01 for 
all three currencies, which we take as con- 
vincing evidence that uncovered interest 
parity can not explain much of the move- 
ments in interest differentials. 

Thus neither the assumption that markets 


know the regime with certainty nor the as- - 


sumption that they are learning about it 
through the rule p(s, = 1|y;,..., y,; 8) offers 
a very appealing account of time variation 
in cross-country interest differentials. 


V. Conclusion 


Movements in the dollar appear to be 
characterized by long swings. We have pre- 
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sented a formal statistical model of what it 
means for the dollar to follow a pattern of 
long swings, and we find that the model fits 
the data well. We conclude that the phe- 
nomenon of long swings deserves more at- 
tention from exchange rate theoreticians. 

Can we offer an explanation of these ex- 
change rate and interest rate movements? 
Dornbusch (1986, 1987), Bernard Dumas 
(1987), Stephan Schulmeister (1987), and 
Betty Daniel (1989) have suggested models 
that allow persistence in movements in the 
exchange rate. Robert Flood and Peter 
Garber (1983) have discussed models in 
which anticipated future events affect the 
current exchange rate, generating nonlinear 
behavior of the exchange rate akin to that 
described here. Hsieh (1988) has described 
a model in which monetary policy stochasti- 
cally shifts between two regimes. Kaminsky 
(1988) has generated a simple model that 
also leads to nominal exchange rate move- 
ments of the type we describe in Section I, 
though empirically identifying the particular 
fundamentals that have shifted in the way 
postulated by her model poses a challenge 
for future research. Jeffrey Frankel and 
Kenneth Froot (1988) have described a 
model in which the behavior of irrational 
“chartists” interacts with rational agents to 
produce potentially long movements in one 
direction in the dollar and a failure of un- 
covered interest parity. These models seem 
able to account for some, but not all, of the 
empirical regularities uncovered here. 

A model that allowed only rational in- 
vestors would need to explain the pattern of 
the dollar and be able to generate risk pre- 
mia that varied enough over time to explain 
the pattern of interest differentials. This is. 
an imposing task (although see Cumby, 
1988). 

Earlier researchers found little evidence 
of linear serial dependence in exchange rate 
changes, supporting the conclusion that the 
exchange rate follows a random walk. We 
reproduce this result but nevertheless find 
compelling evidence of nonlinear serial de- 
pendence in the data characteristic of long 
swings. Our evidence indicates that move- 
ments in the dollar in one direction persist 
over long pericds of time. Furthermore, in- 
terest differentials do not seem to take into 
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account how long these movements are. Our 
estimation method provides a natural way 
of parameterizing the “peso problem,” yet 
we still reject the uncovered interest parity 
hypothesis. In the absence of a plausible 
story about foreign exchange risk premia, 
we conclude that there are long swings in 
the dollar and that markets do not know it. 


APPENDIX 


Treating each currency separately, we began from 
an initial starting value for 80 = (49, (O^. p9, 
pS, a(P, cS). We then iterated on equations (2), (9), 
(10), (7), and (8) until the largest element of 6^? — 
6 was less than 1X 107? in absolute value. For the 
Bayesian parameter v that appears in (9), we specified 
v = 0.1, which roughly corresponds to proceeding as if 
we had observed one-tenth of an observation drawn 
from each regime that took on the value zero. We 
further specified a=0.1 and B=0.5, as if we had 
two-tenths of an observation from each regime whose 
sum of squared deviations from the population mean 
of that regime was f /a = 5. 

For each currency we employed several hundred 
different starting values 8%. These starting values all 
led to a single unique solution to the normal equations 
in the case of France. However, two local maxima were 
found for Britain and four for Germany. The maximum 
likelihood estimates reported in Table 1 are those that 
achieved the highest value of the objective function 
(11). When a diffuse Bayesian prior is used (v = a = 
B = 0) and iteration is begun from the starting values in 
Table 1, the parameter estimates are changed very 
little. It further appears that, apart from the singulari- 
ties, these correspond to the largest bounded local 
maxima of the raw likelihood function (4). Thus, use of 
the prior has essentially no consequences for any of the 
tests of conclusions reported in this paper. By contrast, 
other local maxima of (4) or (11) exhibit large changes 
in parameter estimates for slight changes in the priors. 
Hamilton (1988b) has argued that such a finding should 
be construed as an additional factor supporting selec- 
tion of the global maxima reported in Table 1 for th 
U.K. and Germany. : 

Our bivariate analysis found four local maxima for 
France and Germany and six for the U.K. 
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Testing the Rationality of Price Forecasts: 
New Evidence from Panel Data 


By MıcnaeL P. KEANE AND Davip E. RUNKLE* 


This paper tests the rationality of individual price forecasts in a panel of 
professional forecasters. Here, unlike in most previous studies, rationality is not 
rejected. The results here differ because (1) using individual forecasts avoids 
aggregation bias, (2) comparison of forecasts to initial data avoids bias due to 
data revision, (3) the professional forecasters have economic incentives to state 
their expectations accurately, (4) a new covariance matrix estimator consistent 
when forecast errors are correlated across individuals is used. (JEL 132) 


How people form their expectations of 
future economic events has been an impor- 
tant issue in macroeconomics for many 
years. Businessmen’s expectations play a 
central role in the business-cvcle theories 
of both Arthur Pigou (1927) and John 
Maynard Keynes (1936). Since the way in 
which expectations are formed has impor- 
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Winter Meetings of the Econometric Society for help- 
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rial assistance. The views expressed herein are those of 
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Reserve Bank of Minneapolis or the Federal Reserve 
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The authors' previously published paper, Keane and 
Runkle (1989), also deals with the subject of price 
forecast rationality. The main object of that paper is to 
explain for a general audience the basic issues involved 
in testing rationality. It presents several of the rational- 
ity test results presented here. However, that paper 
does not describe the statistical methods we develop to 
deal with aggregate shocks and moving-average errors, 
or discuss the issues of aggregation bias or the prob- 
lems involved in measuring expectations. Nor does it 
contain the empirical results on the importance of 
aggregate shocks and moving-average errors, the re- 
sults on the importance of data revisions, or the results 
on private information. 
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tant implications for economic behavior, 
many economists have used survey data to 
test hypotheses about expectation forma- 
tion. In particular, several researchers have 
tested the hypothesis that expectations are 
rational in John Muth’s sense (Muth, 1961). 
In this paper, we provide a further test of 
the rational expectations hypothesis. 
Specifically, we test the rationality of price- 
level forecasts. We use price-forecast data 
from the ASA-NBER survey of professional 
forecasters, initiated by Victor Zarnowitz 
(1969, 1974, 1984, 1985). Zarnowitz, like 
most other researchers in this literature, 
finds that price forecasts are not rational.! 
However, using different statistical meth- 
ods, we find strong evidence that price fore- 
casts are rational. 

We believe that our further testing of 
price-forecast rationality is warranted be- 
cause of severe problems in almost all exist- 
ing tests, which suffer from one or more of 
the following four flaws. First, some use 
average survey response data rather than 
individual data. This can bias tests in two 
ways. It can lead to false rejection of ratio- 
nal expectations because average forecasts 
that are conditional on different information 
sets are not rational forecasts conditional 
on any particular information set. Further, 


1Zarnowitz (1985) found that the forecasts of other 
variables, besides prices, were rational. We explain his 
finding of price-forecast irrationality. Lovell (1986) re- 
views the literature on tests of rational expectations. 
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it can lead to false acceptance of rational 
expectations by masking systematic individ- 
ual bias that may be randomly distributed in 
the population. Second, many tests fail to 
deal properly with the pervasive problem of 
systematic data revision. Tests of forecast 
rationality depend upon correct assump- 
tions about what the forecasters tried to 
predict and what they knew when they made 
their predictions. Much work has implicitly 
tested whether forecasters rationally fore- 
cast revised data conditional on other re- 
vised data, none of which was available until 
long after the forecasts were made. Third, 
most studies of forecast rationality use pre- 
dictions from individuals who are not pro- 
fessional forecasters; these people have few 
economic incentives to report their expecta- 
tions precisely. Finally, many studies that 
use micro data fail to account properly for 
the covariance structure of the forecast er- 
rors. This failure can take two forms. First, 
some studies assume that forecast errors 
must be white noise. In fact, lags in the 
availability of relevant data can produce 
serially correlated errors even when agents 
are rational. Second, most studies fail to 
account for the fact that shocks to the ag- 
gregate economy produce forecast errors 
that are correlated across individuals. In 
either case, improperly assuming indepen- 
dent, identically distributed errors can pro- 
duce severely biased results. 

In this paper we avoid these problems. 
We use the individual data on quarterly 
GNP deflator forecasts from the ASA- 
NBER survey of forecasters to test the ra- 
tional expectations hypothesis. We treat the 
data as a panel to avoid aggregation biases, 
and we reconstruct ‘the actual information 
sets available to agents when they made 
their forecasts. We compare these forecasts 
only to the GNP deflator data announced 
45 days after the end of the quarter, before 
systematic data revisions. The survey data 
include only forecasts from professional 
forecasters, who have an economic incentive 
to be accurate. Because these professionals 
report to the survey the same forecasts that 
they sell on the market, their survey re- 
sponses provide a reasonably accurate mea- 
sure of their expectations. Thus, these data 
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are less subject to the criticism made by 
opponents of survey forecast rationality tests 
that the respondents had nothing to lose if 
they made bad forecasts.? Finally, we de- 
velop a new covariance matrix that is consis- 
tent both in the presence of aggregate 
shocks and in the presence of serial correla- 
tion resulting from delays in data availabil- 
ity. When we use all these procedures, we 
find that the rational expectations hypothe- 
sis cannot be rejected, and we demonstrate 
how the hypothesis is rejected when some 
of these procedures are not followed. We 
also report some findings that extend be- 
yond the existing literature. We test the 
hypothesis that differences in individual 
forecasts are due solely to asymmetric infor- 
mation and cannot be explained by any pub- 
licly available information. To our knowl- 
edge, this strong implication of the rational 
expectations hypothesis has not been tested 
previously. We find that it cannot be re- 
jected. Finally, we find that, because of 
lagged data availability, forecast errors are 
MA(1) despite the rationality of the fore- 
casters. This error structure implies that the 
class of rational expectations models in 
which output varies only because of mone- 
tary surprises can produce persistent move- 
ment in output. 

The outline of the paper is as follows. 
Section I addresses the argument, made by 
some economists, that the rational expecta- 


?Note that some authors (for example, Cukierman, 


` 1986) have called this inaccuracy "measurement error." 


But what these authors are discussing is different from 
the usual interpretations of measurement error. An 
example will make quite clear what they call “measure- 
ment error." Suppose someone calls Keane and asks 
for his forecast of the three-month T-bill rate for the 
next quarter. He is busy writing a paper for a confer- 
ence—the activity for which he receives monetary re- 
ward—that is due in three days. Quickly, Keane tells 
the caller 8 percent. While reading the Wall Street 
Journal later in the day, Keane sees that the forward 
rate on three-month T-bills is 9 percent. He does not 
run out to buy bonds in the expectation that rates will 
fall to 8 percent because, when he thinks about it, 9 
percent seems reasonable. Thus, 8 percent is an erro- 
neous measure of his true expectation because he does 
not act in the market as if that were his expectation. 
But that is not measurement error as it is usually 
understood, so we prefer to call the problem "lack of 
an economic incentive to accurately state expectations." 
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tions hypothesis cannot be tested using sur- 
vey data. Section II reviews the literature 
and describes the problems with previous 
research that have motivated our work. Sec- 
tion III discusses our econometric methods, 
including a new covariance matrix estimator 
for panel data that is consistent in the pres- 
ence of aggregate shocks. Section IV de- 
scribes the ASA-NBER survey data and dis- 
cusses the complex data timing and revision 
issues. Section V presents our empirical re- 
sults. Section VI concludes. 


I. On the Use of Survey Data to Test 
Theories of Expectation Formation 


Some economists have questioned 
whether it is valid to use survey data on 
agents' expectations to test among various 
mechanisms of expectation formation. For 
example, Edward Prescott (1977, p. 30) has 
argued that "surveys cannot be used to test 
the rational expectations hypothesis. One 
can only test if some [economic] theory, 
whether it incorporates rational expecta- 
tions or, for ... [that] matter, irrational ex- 
pectations, is or is not consistent with obser- 
vations." Others, such as Zarnowitz (1984, 
p. 15), have argued that “it is not good 
‘positive economics’ to dismiss it [evidence 
from survey data] on the ground that only 
theories, not their assumptions, can be 
tested." ; 

Economists do agree that the ability of 
economic models to explain behavior de- 
pends on the assumed expectation-forma- 
tion mechanism. For example, the intertem- 
poral substitution model proposed by 
Robert Lucas and Leonard Rapping (1969) 
provides a far better explanation of employ- 
ment fluctuations if expectations are “adap- 
tive” rather than rational. Since any test of 
a model involves a joint test of the behav- 
ioral equations and the assumed expecta- 
tion-formation mechanism, researchers face 
an identification problem. First, if the model 
is rejected, we do not know whether the 
behavioral equations or the expectations 
mechanism is being rejected. Second, if two 
different models (different in both the be- 
havioral equations and expectation-forma- 
tion mechanism) explain data equally well, 
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we cannot identify the proper behavioral 
model without knowing the expectation- 
formation mechanism.? 

Many economists have assumed the valid- 
ity of the rational expectations hypothesis 
and regarded joint tests of behavioral equa- 
tions and this particular expectation-forma- 
tion mechanism as identified tests of behav- 
ioral equations. This willingness to assume 
the validity of Muth’s rational expectations 
hypothesis perhaps results from a wide- 
spread view that forming expectations ratio- 
nally is simply the logical consequence of 
optimizing behavior. Yet there are many 
reasons why the rational expectations hy- 
pothesis does not follow directly from the 
assumption of optimizing behavior. For ex- 
ample, agents’ expectations in stochastic 
equilibrium may be rational in Muth’s sense, 
while the expectations of Bayesian learners 
in a nonstationary environment are not (see 
John Caskey, 1985). Or, perhaps, informa- 
tion necessary to form expectations on the 
basis of the true economic model may be 
impossible cr too costly to obtain (see 
Kenneth Arrow, 1978). In this paper we 
adopt Zarnowitz’s view that the hypothesis 
of rational expectations should be tested 
and not simply assumed valid. For further 
discussion of the philosophical issues, we 
refer the reader to Michael Lovell (1986) or 
Zarnowitz (1984). 


II. A Critique of the Literature 


Given the large number of empirical tests 
of the rational expectations hypothesis in 
the literature, one might wonder whether 
yet another test is necessary. We contend 
that almost all existing tests are either in- 
correct or inadequate for four reasons. First, 
most tests use sample mean forecasts rather 
than individual forecasts. Second, respon- 
dents in most surveys have little incentive to 
make accurate forecasts. Third, most tests 
compare survey forecasts to revised rather 
than initial data. Fourth, many tests are 
based on incorrect assumptions about the 


>This is the well-known problem of observational 
equivalence (see Sargent, 1976). 
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covariance structure of forecast errors. We 
discuss these problems with the existing lit- 
erature in this section. 

Many tests of the rational expectations 
hypothesis that use survey data examine the 
rationality of the sample mean or “con- 
sensus? forecast constructed from surveys 
of individual forecasters. These include most 
tests using the Livingston data, as well as 
the work by Jonathan Leonard (1982) on 
the Endicott survey of employers' wage ex- 
pectations, that by Frank de Leeuw and 
Michael McKelvey (1981) on the price ex- 
pectations of business firms, that by Jeffrey 
Frankel and Kenneth Froot (1987) on vari- 
ous surveys of exchange rate expectations, 
and that by Benjamin Friedman (1980) on 
the Goldsmith-Nagen survey of interest rate 
expectations. 

There are two problems with using con- 


sensus forecasts to test rationality. First, 


doing so causes serious specification bias. If 
forecasters are rational, their forecasts will 
differ only because of differences in their 
information sets? The mean of many indi- 
vidual rational forecasts, each conditional 
on a private information set, is not itself a 
rational forecast conditional on any particu- 
lar information set (see Stephen Figlewski 
and Paul Wachtel, 1983). This seemingly 
minor issue can produce severe bias. We 
discuss this problem further in Section III, 
Part B. 

A second problem with using consensus 
forecasts is that this approach can mask 
individual deviations from  rationality. 
Albert Hirsch and Michael Lovell (1969), 
looking at data from individual firms in the 
Commerce Department Manufacturers’ In- 
ventory and Sales Expectations survey, 
found (p. 71) that some firms are consis- 
tently optimistic about future sales while 


^See Brown and Maital (1981), Carlson (1977), 
Gramlich (1983), Mullineaux (1978), Pearce (1979), 
Pesando (1975), and Rich (1987). 

This is a very important point. If each forecaster 
had exactly the same information, all forecasts would 
have to be the same, if they were rational. Forecasters 
can make different rational forecasts if they each have 
some different private information. We discuss this 
issue more fully in Section III, Part C. 
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others are consistently pessimistic. Averag- 
ing expectations, however, can cancel these 
biases across firms so that industry mean 
expectations show no bias. Muth (1985) 
looked at anticipated production for indi- 
vidual Pittsburgh steel firms and found the 
same phenomenon. 

For both of these reasons we argue that 
researchers must use individual data in or- 
der to test hypotheses about how people 
form expectations. These data can be used 
to test rationality either on an individual- 
by-individual basis or by running pooled 
time-series cross-section regressions. Both 
individual and pooled regressions are repre- 
sented in the work of Hirsch and Lovell 
(1969), Muth (1985), Figlewski and Wachtel 
(1981), Thomas Urich and Paul Wachtel 
(1984), de Leeuw and McKelvey (1981), and 
Zarnowitz (1984, 1985). 

A second, equally severe problem with 
most tests of forecast rationality is that sur- 
vey data on expectations do not necessarily 
reflect the true expectations of the forecast- 
ers. But in order to test the rational expec- 
tations hypothesis, one needs data that can 
be reasonably assumed to reflect the fore- 
casters’ expectations. As de Leeuw and 
McKelvey (1984) found, this is certainly not 
the case with surveys, such as the BEA data 
on sales price expectations of individual 
firms, that ask individuals or firms to give 
“rough estimates" of expected future quan- 
tities. As mentioned earlier, Hirsch and 
Lovell (1969) and Muth (1985) found evi- 
dence against rationality on the firm level. 
However, the problem found by de Leeuw 
and McKelvey may well plague the Hirsch- 
Lovell and Muth studies as well. As we 
discussed in the introduction, it is reason- 
able to assume that this problem is limited 
when dealing with the forecasts of profes- 
sional forecasters who receive a monetary 
reward for producing accurate forecasts and 
who report to the survey the same forecasts 
they sell on the market. This is not true of 
the Livingston price forecast data, because 
the economists polled by Livingston were 
not all professional forecasters. Hence we 
cannot be sure that the rejection of rational- 
ity found by Figlewski and Wachtel (1981), 
who ran a pooled regression using the 
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Livingston data, is not the result of improp- 
erly measured expectations. 

Since 1968 Victor Zarnowitz has worked 
with the NBER and the American Statisti- 
cal Association on the ASA-NBER Survey 
of Forecasts by Economic Statisticians. This 
survey is well suited for testing because it is 
limited to professional producers of quar- 
terly forecasts of GNP and its major compo- 
nents and other economic indicators for 
each of several quarters ahead. Since re- 
spondents report forecasts that they pro- 
duce professionally, the problem of inaccu- 
rate reporting of expectations is probably 
small. Interestingly, the ASA-NBER also 
conducted a few annual surveys, in the early 
years of the period covered, of a much 
larger group of economists of whom three- 
quarters are “occasional” rather than “pro- 
fessional” forecasters. In this survey, with 
method similar to that of Livingston, 
Zarnowitz (1969) finds that “a number of 
the occasional forecasters submitted ex- 
treme and rather unreasonable predictions.” 
This finding adds to our concern about inac- 
curacies due to lack of proper economic 
incentives in the Livingston data. 

Using the ASA-NBER survey, Zarnowitz 
(1984, 1985) has performed tests of the ra- 
tionality of the respondents’ forecasts of 
inflation and other macroeconomic vari- 
ables. Using both survey means and pooled 
data, he rejects the rational expectations 
hypothesis for inflation forecasts. We con- 
sider these results questionable, however, 
because Zarnowitz uses revised rather than 
initial price data in his tests. Using revised 
data does not necessarily invalidate tests of 
unbiasedness, but their use is suspect in this 
case because of the nature of the data revi- 
sions.” We show in Section V, Part B that 
these data revisions introduce a systematic 
bias that may invalidate Zarnowitz's tests of 
unbiasedness. 


$For a description of the survev, see Zarnowitz 
(1969, 1974). 

TIn Section IV we discuss the circumstances under 
which it is appropriate to use revised data to test for 
unbiasedness. 
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A final problem with much of the existing 
literature is incorrect assumptions about the 
covariance of forecast errors. across fore- 
casters. When they pool individual data, 
Figlewski and Wachtel (1981) assume that 
forecast errors are independent across fore- 
casters? Since aggregate shocks affect the 
price level, this is certainly not true. Falsely 
assuming independent errors creates a se- 
vere downward bias in estimated standard 
errors, tending to cause false rejection of 
the rational expectations hypothesis. In Sec- : 
tion III, Part D we discuss how to account 
correctly for the effects of aggregate shocks 
on inference in tests of forecast rationality. 
In Section V, Part A we show the empirical ' 
importance of these effects. 


III. Econometric Issues 
A. Definitions 


Expectations are rational in Muth's sense 
(Muth, 1961) if they are equal to mathemat- 
ical expectations conditional on the set of 
all information relevant for forecasting. For 
an individual forecaster, we can express this 
relationship as 


(1) Pree ECP ay Hy, 


where P,,, is the realized value of the time 
series P at time. t + k, ,P;,,, is a k-step- 
ahead prediction of P made at time t by 
forecaster i, I; , is the information available 
at time ¢ to forecaster i, and E is the 
mathematical expectation operator. This is 
equivalent to the statement: E(e; , | J; ,) = 0, 
where £j, — Pj4, —,P;,,,. This statement 
can be broken down into the separate hy- 
potheses that forecasts are unbiased and 
efficient. 

For an individual forecaster, a test of 
rationality can be performed by running the 


SZarnowitz argues, correctly, that it is invalid to 
pool individual data while assuming independent fore- 
cast errors within each cross section if there are aggre- 
gate shocks. 
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regression 


(2) Prag = Qo t+ Oy Pi rae a5 X; + Las 


where X,, is any variable in forecaster i's 
information set at time t. Unbiasedness re- 
quires that, in a regression without Xi, 
variables, the coefficients in equation (2) 
may be restricted to o = 0 and a, =1. Ef- 
ficiency requires that any variable known at 
time £t or before be E to e; ,; that 
is, a, = 0 for any X;, € I;, 


B. The Aggregation Problem and the 
Advantages of Panel Data 


As we discussed in Section II, using sur- 
vey means to test the rational expectations 
hypothesis leads to serious specification er- 
ror. This bias can be illustrated by compar- 
ing the results of three different estimation 
methods. If we were to run three separate 
tests of unbiasedness using equation (2), 
first doing a separate regression for each 
individual and taking the mean of the esti- 
mated coefficients, second using the pooled 
data, and third using survey means, we could 
call the a; estimates ĉu, @,,, and Ĝim, 
respectively, and we would have 


Pug P, t+k 
T 2 
toil eek 


-— n 
E dy E E 


T N 
i Ibis t+k 
i wp i,t+k 





(4) âp= 


= E unb Bus) 
Eal n n) ? 





where i indexes individuals from 1 to N. 


?1f forecasters have differential costs of over- and 
underprediction, it could be rational for them to pro- 
duce biased forecasts. If we were to find that forecasts 
are biased, it could still be claimed that forecasters 
were rational if it could be shown that they had such 
differential costs. 
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We see that plim; à;; = plim; âp but that 


ay 


(6) =- 


Xim 


p 





~ Var(,P, a — vut E TR 


where ,P. wae EON j,4,í, and the 
grand mean of „Pirs ls assumed to be 0. 
Thus, as long as individual forecasts differ, 
Ĝim Will be biased upward. The importance 
of this bias can be seen in the work of Urich 
and Wachtel (1984) who, using 20 individu- 
als and 95 time periods to test the rational- 
ity of money supply forecasts, obtained the 
following estimates: 


Estimator Qo ay 
Individual Means —0.13 0.78 
Pooled Data —0.12 0.77 
Sample Means — 0.29 1.06 


The upward bias in the sample means esti- 
mator 4,, is seen here to be quite severe 
(about 40 percent). We avoid this aggrega- 
tion problem by using only individual data 
in a pooled time-series cross-section regres- 
sion. 

By using panel data to test the rational 
expectations hypothesis, we avoid the aggre- 
gation problem. We also achieve two other 
important advantages over time-series data. 
First, we can detect systematic bias of indi- 
vidual forecasters by testing for the pres- 
ence of individual effects. Second, using 
panel data increases the degrees of freedom 
available to test hypotheses, making our tests 
of rationality more powerful. 

It is not immediately clear that using panel 
data makes tests of rationality more power- 
ful. Since the dependent variable is the same 
for all individuals, forecast errors will be 
highly correlated across individuals and the 
effective number of degrees of freedom will 
not equal the number of observations minus 
the number of parameters. OLS standard 
errors will therefore be severely biased. 
Other authors who have used panel data 
(for example, Figlewski and Wachtel, 1981; 
Urich and Wachtel, 1984; and de Leeuw 
and McKelvey, 1984) have not adjusted the 
OLS standard errors. We discuss our 
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method of consistently estimating the co- 
variance matrix in Section III, Part D. 


C. The Forecasters’ Information Sets 


We now address two issues concerning 
the covariance structure of the errors in (2). 
First, what is the information structure im- 
plied by (2)? Second, what are the effects of 
aggregate shocks on inference in (2)? Cer- 
tainly any public information known at time 
t or before should be orthogonal to s; ,. But 
an additional restriction requires that any 
private information known by person i at 
time t or before also be orthogonal to el p. 
One’s own prior forecasts and prior forecast 
errors are examples of such private informa- 
tion. Further, if forecasts are publicly an- 
nounced, then other people’s prior forecasts 
and prior forecast errors should be orthogo- 
nal to £;, as well. Finally, the rational 
expectations hypothesis implies that fore- 
casts should only differ across individuals if 
individuals have different private informa- 
tion (i.e., if the conditioning set J; , differs 
among forecasters). This implies that no 
readily available data should be able to ex- 
plain the differences between individuals’ 
forecasts. This proposition is implied by ra- 
tional expectations but has not been tested 
previously. 

Unfortunately, merely stating these con- 
straints does not tell us what forecasters 
should know for a particular survey. Cer- 
tainly, forecasters should know their own 
forecasts in this and previous periods, so 
Piree-ys and ,_)P;,4,-, should both be 
orthogonal to e; ,. The most difficult infor- 
mational issue is whether P, itself is known 
when the forecast ,P,,,, is made. In many 
surveys, P, is not released until after fore- 
casters have predicted , P; ,,,. In those sur- 
veys, neither P, nor the eged one-step- 
ahead forecast error e141 ote, 
should be orthogonal to e; ,. RR the 
forecast errors will be MÁ(X), rather than 
MA(K — 1), as they would be if the forecast- 
ers knew P, when they made their 
forecasts.) This serial correlation does not 


Vn a different context, Watson (1983) made this 
Observation. Note also that we cannot use a GLS 
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refute rationality. However, rationality can- 
not hold if higher-order serial correlation 
exists, because such serial correlation would 
imply that individual forecasters do not learn 
from their own past errors. 

We also note that if we observe , P; ,, both 
it and forecasters’ perceptions of their 
own lagged one-step-ahead forecast errors 
GP it —;-1P;,:) should be orthogonal to 
ei „1! Of course, information that forecast- 
érs did not know when they predicted , Pak 
should not be orthogonal to e, ,. That would 
include, for example, such quantities as P,, 
t+1Pit+ks t+1Pit+k+i P, 7 1D; and P,— 
;Pi. Additionally, if the past forecast of 
another forecaster (,_,P;,,) is publicly an- 
nounced, then a forecaster’s perception of 
the other forecaster’s lagged error (,P, , 
;-1P; ,) should not improve price forecasts. 


D. Aggregate Shocks and Tests 
of Rationality 


At first glance, (2) seems trivial to esti- 
mate. It would seem that we could: use OLS 
and estimate a covariance matrix that is 
consistent in the presence of serial correla- 
tion. However, to use these estimators is to 
assume that forecast errors are uncorre- 
lated across forecasters. But such correla- 
tion is likely because of aggregate shocks to 
the economy. If the average forecast error is 
not zero for each period, then OLS will 
yield inconsistent parameter estimates. 
Chamberlain (1984) first noted the potential 
effects of aggregate shocks on rational ex- 
pectations models estimated with panel 
data. Chamberlain suggests that one result 
of aggregate shocks in a rational expecta- 
tions model is that the sample version of the 
orthogonality condition E(ej,|1;,) con- 
verges to zero as the number of time peri- 





transformation oa (2) in this case because the regres- 
sors are not strictly exogenous. 

zm is the guess that person / made at the end of 
time ¢ about the value of P,. Since data on P, may not 
be released until after that guess is made, rationality 
does not imply that ,P,,- P, Note also that data 
revisions made after the forecast are not in the fore- 
caster's information set when he makes the prediction, 
and therefore functions of those revisions cannot be 
used as independent variables in efficiency tests. 
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ods increases, but not as the number of 
individuals increases, if the number of time 
periods is held fixed. He also notes that 
most panel data models rely on a large 
number of individuals to achieve consis- 
tency. Thus, in a panel with a large N and a 
small T, coefficients in rational expectations 
models may be inconsistent. This problem 
can be addressed in two ways. One can use 
either a long panel so that aggregate shocks 
do not affect consistency or time dummies 
to eliminate the effects of aggregate shocks. 
In this study, we use a long panel to achieve 
consistency." 

OLS also yields inconsistent standard er- 
rors in the presence of aggregate shocks. In 
this paper, we present an important innova- 
tion: In order to achieve efficiency, we use a 
covariance matrix estimator that remains 
consistent in the presence of aggregate 
shocks. 

We now describe the structure of the 
covariance matrix estimator. Recall that the 
GMM estimator in a linear model is 


E X'ZIZAZ\ZX\ 
) Bow "uror xr 

XZ (ZOAZ\ ZY 

x| —|—] —}. 

NT | NT | NT 
In this case, Y is P,,,, X is a constant and 
Pits and Z is Z, (The Z; /'s are instru- 
ments chosen from information available to 
forecaster i at time t.) Here the Q matrix 
will have off-diagonal elements because of 
aggregate shocks and MA errors (for OLS, 
Q = I). To consistently estimate the covari- 
ance matrix we need to consistently esti- 


mate (Z'OZ/NT). If the errors in this 
model were inid, then (1/NT)x 


V We ignore problems with unit roots in this paper. 
Results of West (1988) show that in time-series like the 
deflator, in which trend terms dominate, standard 
asymptotic results for GMM estimators are correct. 

Zarnowitz (1985), arguing that pooled time-series 
cross-section regressions are misspecified, has not rec- 
ognized that such regressions are yT consistent. He 
does, however, recognize that OLS standard errors are 
inconsistent for these models. 
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UN LIZ} 6) gêi Zi, would converge to 
a fixed matrix M, which is a consistent 
estimator of (Z'OZ / NT). If we merely had 
moving average errors, with no correlation 
of the errors across people, then with 
an MACK) a CLINT Seep 
Zi 8i x& oj k Zi +j Would be a consistent es- 
timator. UR that as we increase the order 
of the MA error, the number of terms that 
we compute increases very slowly, so com- 
putation of this matrix is feasible.!4 

However, if errors are correlated across 
people, (2k -- 1)N? terms must be esti- 
mated to compute a general covariance ma- 
trix based on the orthogonality conditions 
from (2). Unfortunately, this calculation 
would exhaust the number of degrees of 
freedom in the data. However, we can con- 
struct a covariance matrix estimator that is 
consistent even in the presence of aggregate 
shocks if we make the following assump- 
tions: 


(8) E(si iki ua) 


UU is V i,t,l,m, 
= s.t. || m|x k; 
0, otherwise. 
(9) E(el seni uu) 
Bu Vi,j,t,l,m, 
= s.t. | —m|xk; 
0, otherwise. 


This amounts to assuming that the data are 
not conditionally heteroskedastic and that 
no forecaster is systematically better than 
any other (ie., o;'s are the same for each 
individual). "5 


Newey and West (1987) discuss the limits on the 
growth of the number of terms that can be estimated. 
These assumptions are somewhat restrictive, but 
they seem reasonable. Although conditional heteroske- 
dasticity exists in inflation, it is unlikely to exist with 
price-level forecasts. The findings of McNees (1975) 
and Zarnowitz (1967) that the accuracy of forecasters 
does not differ systematically supports assumption (9). 
Neither assumption can be formally tested because of 
the combination of aggregate shocks and moving aver- 
age errors. 
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Given (8) and (9), we need only estimate 
the (k -1) o’s and (k +1) 6’s, a feasible 
operation. Under these assumptions, our es- 
timate of Q will have the form 


QR... R 
RQ .. R 
(10) Gel FS AU; 
R R œo Q 
where 
(11) Q 
% c Tk 0 0 0 0 
91 0g Ok- OQ, 0 0 0 
Oy Ogi 2 O0; o 0 0 
=|0 oo 0, % C| 0 0 
0 0 a Oy 0g 0 0 
0 0 . 0 0 0 ao 
0 0 2a 0 0 0 .. a og 
and 
(12) R 
So ô, à 0 0 0 0 
8, 8, Sr 29.20 0 0 
NE So ê & 0 0 
=|0 & 8, 8g 8, 0 0 
0 0 fa. Xx ôo 0 0 
0 0 0 0 0 d 
0 0 0 0 0 bj. ôo 


Given this estimate of ©, we can compute 
Boum and its covariance matrix and test the 
restrictions implied by rationality.' No pre- 


'®Note that we must assume that e/_,, and &/,,, , 
come from the same distribution. Thus we have sym- 
` metry rather than skew-symmetry in the R matrix. 
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vious study has noted this form for find: 
In fact, the limited panel-data work testing 
the rationality of forecasts has completely 
ignored this point.!” 


IV. Data 


This study uses both panel survey data 
and aggregate time-series data. The survey 
data come from the ASA-NBER survey of 
economic forecasters who are members of 
the ASA Business and Economics Section. 
The panel is comprised of professional fore- 
casters rather than research economists 
whose livelihood does not depend upon the 
accuracy of their forecasts.'® Since they re- 
port to the survey the same forecasts that 
they sell on the market, these respondents 
have an economic incentive to make accu- 
rate predictions and we can assume that 
their survey responses:are reasonably accu- 
rate measures of their expectations. Fore- 
casters receive their surveys near the end of 
the first month of each quarter. They report 
their perceptions of the previous quarter's 
GNP deflator and their predictions for the 
current quarter and the following four quar- 
ters.? They respond by the end of the sec- 
ond month of the quarter.?? 

To test rationality, we compare individual 
quarterly. forecasts with the deflator an- 
nouncements released 45 days after the end 
of a given quarter. The deflator data used in 


"Since the dependent variable is the same for each 
forecaster, we cannot estimate a fixed time-effects 
model in this case. However, if there were cross-sec- 
tional variation in the dependent variable (as is true in 
the panel-data consumption literature), a fixed time- 
effects model could be estimated using the i.n.i.d. co- 
variance estimator. With such a short panel, this is 
probably the preferred approach. For further discus- 
sion of this issue, see Runkle (1989). 

In fact, fewer than 5 percent of the respondents to 
the December 1970 survey were academics (Zarnowitz, 
1974). 

They also have to regularly forecast all the major 
components of GNP in order to be included in the 
survey. 

P jt is not mezningless to ask about the deflator 
from the previous quarter since the 45-day NIPA re- 
lease is made after the survey is mailed out. In fact, 
many respondents have already completed their forms 
when those numbers are released; we assume that 
those data are not in the forecasters’ information set. 
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FicunE 1. INITIAL AND FinaL GNP DerLaTOR Data, 1968-1986 


this study are often systematically revised. 
Forecasters generally do not know in ad- 
vance (i.e., at time £) the nature of system- 
atic data revisions occurring after the date 
on which they make their forecasts but be- 
fore the date on which the NIPA announce- 
ménts are made (i.e., time £ +1). Often, the 
ASA-NBER survey asks forecasters for pre- 
dictions that are further in the future than 
the next annual benchmark revision of the 
GNP deflator series. In these cases, fore- 
casts differ systematically from announced 
values solely because of data revisions. We 
deal with this problem by excluding from 
the data all forecasts with horizons that 
extend beyond the date of systematic data 
revisions. We include only those observa- 
tions for which there were no benchmark 
revisions from the time the survey was taken 
until the 45-day NIPA announcement was 
made. To do otherwise would severely bias 
our results against the hypothesis of ratio- 
nality. 

We assume that forecasters knew the 
GNP deflator announcements made in pre- 


vious quarters. We allow for the fact that 
P,, the current quarters GNP deflator, is 
not necessarily known when ,P,,, is an- 
nounced by assuming only that forecasters 
know their own perception of P,, which is 
,P,. Note that the longer the forecast hori- 
zon, the more data we would lose because 
of our data screens. For this reason, we 
concentrate on one-step-ahead deflator 
forecasts and examine the rationality of 
price-level forecasts rather than inflation 
forecasts. , 

Many previous studies may be biased 
against the hypothesis of rationality because 
they assume that the final revisions of all 
variables contained in agents’ information 
sets were known when forecasts were made. 
Figure 1 shows the effect of using revised 
rather than initial data. It shows the diver- 
gence between the 45-day announcement of 
the GNP deflator and the final data for the 
GNP deflator (as of September 1987), 
rescaled to reflect the benchmark revisions 
of 1976 and 1986. This figure shows that 
there are large systematic differences be- 
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tween these two price series.*! Not only the 
level, but also the rate of growth of the 
series is affected. 

Although we compare the forecasters’ 
predictions to initial GNP deflator data in 
our study, it is important to note that using 
revised data in tests of unbiasedness is not 
necessarily inappropriate. Whether revised 
data should be used depends on two issues: 
First, do forecasters try to predict the initial 
or the revised data? Second, are there sig- 
nificant and predictable data revisions? 

Obviously, if forecasters try to predict the 
revised data, those data should be the de- 
pendent variable in tests of unbiasedness. 
However, in the forecasts studied by 
Zarnowitz (1967) and McNees (1986), pre- 
dictions were, on average, closer to the ini- 
tial announcement than to the revision. This 
suggests that forecasters are, on average, 
trying to predict the initial announcement. 

Regardless of which data the forecasters 
try to predict, data revisions should have 
little effect on tests of rationality unless they 
are significant and systematic. Since initial 
announcements of many variables, such as 
real GNP, are rational forecasts of the final 
data for those variables, the choice of de- 
pendent variable probably would have little 
effect on tests of the rationality of predic- 
tions of those variables. But we show in 
. Section V, Part C that the choice of data is 
very important in testing unbiasedness of 
forecasts of the GNP deflator, as one would 
expect from the evidence in Figure 1. 

Although there are some circumstances 
in which it is permissible to use revised data 
for tests of unbiasedness, it is never permis- 
sible to use data unavailable to the forecast- 
ers as an independent variable in tests of 
efficiency. Thus, for example, if we wish to 
test whether forecasters properly adjust for 
their own past error (P, —, 5P; ,..,), we 
should use the latest available revision of 


?! Note that Zarnowitz (1985) recognized that price- 
level data could not be compared across major bench- 
mark revisions. No other author has recognized this. 
We discuss problems with Zarnowitz's data in Section 
V, Part B. 

See Mankiw and Shapiro (1986). 
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P,_, before the forecaster made his predic- 
tion. We do this in our study. If we used 
final data for this test, the test would neces- 
sarily be incorrect because the final revision 
of P,- was not in the forecaster's informa- 
tion set when he made his prediction. 

We use survey data from the fourth quar- 
ter of 1968 through the third quarter of 
1986. We exclude periods containing the 
annual benchmark revisions, which usually 
occur in July, and forecasters who did not 
respond at least twenty times. Our sample 
contains 1613 observations. Some of our 
regressions use fewer observations because 
data are missing for other variables. 

Like the survey data, our time-series data 
are reported as quarterly averages. The 
GNP deflator data come from the Survey of 
Current Business. Our M1 data come from 
the database of the Federal Reserve Board 
of Governors. Our nominal oil price statis- 
tics come from the Commerce Department's 
Foreign Trade Statistics. 


V. Empirical Results 


We address three questions in our empir- 
ical investigation. First, what is the covari- 
ance structure of the errors? Second, are 
the forecasters’ predictions unbiased? Third, 
are the forecasters’ predictions efficient? 

All of the results presented are tests of 
rationality at the one-step-ahead horizon. 
Thus, our equation is 


(13) Prep = ot Peppy t AXi ei 
E(si., | Ia) =0. 


To test for unbiasedness, we estimate re- 
gressions without X;,. Our null hypothesis 
in that case is that ag=0 and a,=1. To 
test for efficiency, we include some addi- 
tional variable X;, in the regressions. Our 
null hypothesis in that case is that o — 0, 
20,771, a4 —0. 


A. The Covariance Structure 
of Forecast Errors 


Two important questions arise about the 
covariance structure of the forecast errors. 
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TABLE 1— TESTS FOR UNBIASEDNESS AND SERIAL INDEPENDENCE 
x? for 

Method ap a, a Hg MA Regressors N 

(D 5.853 0.9967 - 21.66 0 L, Piu 1613 
(1.319) (0.0008) - (0.00002) 

Q 5.853 0.9967 - 1.226 0 LP. 1613 
(5.321) (0.0031) - (0.5417) 

(3) 5.853 0.9967 - 0.900 1 1, Piet 1613 
(6.227) (0.0036) -= (0.6377) 

(4) 11.38 0.9943 0.3031 19.81 1 LP, 803 
(5.725) (0.0032) (0.0802) (0.0002) P,—-, B; 

(5) 7.078 0.9959  — 0.0021 0.875 1 L,,Pj oua 728 
(7.692) (0.0044) — (0.1106) (0.8315) Pri 1-2 Po 

(6) 7.756 0.9956 | —0.0055 1.341 1 si Pda 1111 
(6.720) (0.0039) (0.0695) (0.7194) iPja 1-1 Pine 


Note: Standard errors are in parentheses under coefficients; significance levels are 
given under x?-statistics. Hp: (æo = 0, a, =1, a; =0). 


First, are aggregate shocks important? Sec- 
ond, are forecast errors serially correlated? 
We address both issues in Table 1. Row 1 of 
Table 1 reports the results of a simple OLS 
regression to test the rationality of one- 
step-ahead forecast errors. The results in- 
dicate that the hypothesis of unbiasedness is 
strongly rejected. However, that rejection is 
misleading because of the magnitude of ag- 
gregate shocks. In this OLS regression, the 
average covariance of the forecast errors 
made in a given period by two different 
forecasters is 58 percent of the variance of 
the average forecast error. This large co- 
variance implies that a large percentage of 
forecast error variance comes from aggre- 
gate shocks, and thus we cannot assume 
that these errors are independent across 
forecasters. Row 2 reveals the importance 
of aggregate shocks. If we correct the stan- 
dard errors in the OLS regression to reflect 
the cross-forecaster error covariance, our 
test results change dramatically. We do not 
reject the hypothesis of unbiasedness using 
a GMM estimator with a covariance matrix 
that accounts for aggregate shocks. In fact, 
the estimated standard error for a, in- 
creases from 0.0008 to 0,0031 when we do 
so. (Note that cmm ^ Bos in this case, 


This test is similar to Figlewski and Wachtel's 
(1981). 


because the model is exactly identified, and 
the instruments are also the regressors.) All 
subsequent results account for the cross- 
sectional correlation of forecast errors that 
is created by aggregate shocks. 

For results presented in Rows 1 and 2, we 
assume that forecast errors are not serially 
correlated. If forecast errors are not serially 
correlated, then the regression coefficient 
on the lagged forecast errors should not be 
significant. In Row 4, we report the results 
of including the lagged error as a regressor. 
The y*-statistic for the null hypothesis 
(ag —0, a, —1, a,=0) clearly shows that 
that lagged error is correlated with the cur- 
rent forecast error. In fact, there is a very 
strong MA(1) correlation (the MA parame- 
ter is 0.3031). 

Rejecting serial independence might 
cause us to question the rationality of the 
price-level forecasts.” However, we need to 
conduct additional tests to see whether this 
serial correlation implies that we could im- 
prove the forecasters’ predictions using data 
they knew when they made their forecasts. 
We test whether two such pieces of data 
could improve our forecasts. First, as shown 
in Row 5, we test whether the second lagged 
error is significant in predicting future 





257 arnowitz (1985) draws this inference. 
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prices. The value for the y?-statistic is ex- 
tremely low, indicating that this error has 
no power to predict future prices. This sup- 
ports the hypothesis of rationality, since the 
second lagged errors were definitely known 
by forecasters when they made their predic- 
tions, ,P; ,,,. Second, as reported in Row 6, 
we test whether the perceived lagged fore- 
cast error helps predict future prices. Since 
the survey elicits not only the predictions 
P; ıı but also ,P;, (the forecasters’ per- 
ceptions of P,), we can construct the fore- 
casters’ perceptions at time ¢ of the one- 
step-ahead forecast errors they made from 
time £ —1 to time f. Since these perceived 
errors are certainly in the forecasters' infor- 
mation sets, they should not help to predict 
future prices. The y*-statistic in Row 6 
shows that the perceived forecast errors do 
not improve price forecasts. Thus, only the 
unperceived part of lagged forecast errors 
helps to predict future prices. (Remember 
that P, is not known when the forecast 
Pi +1 is made.) The presence of MA(1) 
errors arising from unperceived lagged er- 
rors is compatible with the hypothesis of 
rationality. All subsequent regressions as- 
sume an MA(1) error structure. 


B. Unbiasedness Tests 


In Row 3, we present our OLS results 
assuming MA(1) errors and correcting stan- 
dard errors for the effect of aggregate 
shocks. The estimate of a, is 5.853 (with a 
standard error of 6.227), and the estimate of 
a, is 0.9967 (with a standard error of 
0.0036). The value of the x? test for unbi- 
asedness is 0.900. Since this test statistic has 
a probability value of 0.6377, we cannot 
reject the null hypothesis of rationality. 

Such strong support for rationality seems 
incompatible with Zarnowitz's (1985) rejec- 
tion of unbiasedness using the same data. 
The only obvious differences between 
Zarnowitz's study and ours are that he looks 
at the unbiasedness of inflation forecasts 
and does not pool the data. But there is 


The survey data are reported in 1000's, which is 
why our estimated constant is so large. 
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another difference that explains his rejec- 
tion of unbiasedness: Zarnowitz uses re- 
vised data. He uses as his actual inflation 
series the final available data before the 
benchmark revisions of January 1976 and 
December 1980. We find that the annual 
July data revisions of the GNP deflator se- 
ries by the Department of Commerce ap- 
pear to have large systematic components. 
Therefore we use as actual data the final 
values available before each annual July re- 
vision. 

Using revised data, Zarnowitz finds the 
striking result that, for one-quarter-ahead 
inflation forecasts, the hypothesis of ratio- 
nal expectations can be rejected for 27 per- 
cent of the forecasters at the 5 percent level 
and for 69 percent of the forecasters at the 
10 percent level. Over the 1968 to 1979 
period, he finds a mean error of —0.16 
percent for one-quarter-ahead. inflation 
forecasts. Unfortunately, these results arise 
from Zarnowitz's use of revised rather than 
initial data.” 

We reconstructed the data that Zarnowitz 
used for P,,, from different issues of Busi- 
ness Conditions Digest and adjusted the sam- 
ple so that all our regressions would include 
the same observations. Using Zarnowitz’s 
approach, we then estimated individual re- 
gressions for each forecaster in our sample 
using both Zarnowitz’s data and our unre- 
vised data. The difference in the results is 
striking.” The following table summarizes 
our findings. 


Number (and Percentage) of Individual 
Forecasters for Whom Unbiasedness 
Is Rejected 


] Percent 
Level 


5 Percent 


Data Set Level 


Zarnowitz 27(45.0 percent) 13 (21.7 percent) 


Initial Data — 8(13.3 percent) 1( 1.6 percent) 


This table shows that using initial data as 
the dependent variable results in test statis- 
tics that are far more favorable to the hy- 


26Zarnowitz recognized that the use of revised data 
might affect his inference, but he believed the effect 
would not be large. 
We present a complete set of these individual 
regressions in the Appendix. 
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pothesis of unbiasedness. We believe that 
these data are a more accurate reflection of 
what the forecasters were trying to predict, 
and of what their information sets con- 
tained, tham are the revised data used by 
Zarnowitz. 

Our use of levels instead of inflation does 
not account for the difference in the results 
because rejections at the 5 percent level 
increase from 27 percent to 45 percent of 
the sample when we use the price level 
rather than the inflation rate in our regres- 
sions. Clearly our use of initia] data ac- 
counts for the difference of our results from 
Zarnowitz's and our failure to reject ratio- 
nality. 

Testing whether the initial announce- 
ments of the deflator are rational forecasts 
of the revised data that Zarnowitz used 
further indicates the difference between our 
price data and Zarnowitz's. If there were no 
systematic revisions in the data, the 45-day 
GNP deflator announcement should be a 
rational forecast of the revised data. In that 
case, if we regress Zarnowitz's data on a 
constant and the 45-day announcement, the 
coefficient on the constant should be zero 
and the coefficient on the announcement 
should be one.” But when we computed 
that regression, the test statistic for the hy- 
pothesis that those coefficients had their 
assumed values was 15.92. Since this test 
statistic should be distributed as a x2 ran- 
dom variable under the null hypothesis, we 
must reject the notion that the 45-day de- 
flator announcements are rational forecasts 
of the data used by Zarnowitz.” 

As we mentioned before, using revised 
data may not have much of an effect on 
tests of unbiasedness if the initial an- 
nouncement was a rational forecast of the 
final data, as was probably true with 
Zarnowitz's other data series. But for the 
GNP deflator, the choice of dependent vari- 


28Mankiw, Runkle, and Shapiro (1984) devised this 
test. l 

2 We also adopted the suggestion of an anonymous 
referee to test whether this result holds true in each 
half of the data. It does. We reject the hypothesis that 
the 45-day announcement is a rational forecast of 
Zarnowitz's data at the 5 percent level in each half- 
sample. 
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able is important. We cannot determine 
a priori whether forecasters are trying to 
forecast the initial or the revised data, but ` 
the fact that their forecasts are unbiased 
predictors of the initial data and biased 
predictors of the revised data suggests that 
they are trying to predict the initial an- 
nouncement. Thus, it seems more appropri- 
ate to use the 45-day announcement as the 
dependent variable. 

' To the best of our knowledge, all of the 
other authors in the inflation-forecast litera- 
ture have used revised data as the indepen- 
dent variable for their tests and as a condi- 
tioning set in their tests of rationality.?? 
Since the revised data are not in the fore- 
casters’ information sets when they make 
their forecasts, other researchers’ tests of 
rationality are biased toward rejection. 


C. Efficiency Tests 


Since we cannot reject unbiasedness, we 
must test the further implication of rational 
expectations that forecasts be efficient, that 
is, that no readily available information 
could have improved forecast accuracy. This 
involves testing the hypothesis that o = 0, 
a, —1, and a, — 0 in equation (13). Tables 
2, 3, and 4 give the results of our efficiency 
tests. 

First note that none of the regressions 
reported in Table 2 rejects the coefficient 
restrictions implied by efficiency (Hj). In 
Rows 1 through 5, respectively, the follow- 
ing variables are shown not to improve price 
forecasts: the forecaster's perception of the 
price level at the time of his forecast (, P, ,), 
the lagged price level (P,_,), the forecaster's 
previous one-step-ahead prediction (,_, P; ), 
the lagged value of the forecaster's error in 
perceiving the price level (P, , —, P; ,..,), 
and the forecaster's perception of another 
forecaster’s lagged error (, P; , —, , P; ,). Re- 
markably, none of these implications of ra- 
tionality is rejected. 


30Zarnowitz (1967) compares the annual forecasts of 
GNP and several other variables to their initial an- 
nouncements for the years 1953-1963. He recognizes 
the importance of studying price forecasts (p. 138), but 
does not do so in that study. 
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TABLE 2— TEsrs or RESTRICTIONS IMPLIED BY EFFICIENCY 

















x? for 

Method ay ay az Ho MA:  Regressors N 

(1) 4.542 0.8097 0.1906 — 3.847. 1 LP» 1599 
(6.929) (0.1012) — (0.1024) (0.2785) Pit 

#2) 7.243 0.8797 . 0.1200 4.012 1 [zB 1079 

(7.646) (0.0645) — (0.0659) (0.2602) P, 

(3) 741 09112 0.0859 2.331 1 LP ats 1119 
(7.132) (0.0676) (0.0678) (0.5066) iaPua 

(4) 7.876 | 0.9956 0.0200 — 1.259 1 LiBaup 797 
(7.034) (0.0041) — (0.2546) (0.7390) P, a pm Phi 

(5 8.343 0.9953 | —0.0823 2.478 1 1, Pitt 1457 
(6.985) (0.0040) (0.0701) (0.479) Pit 1-1 Pj 


Note: Standard errors are in parentheses under coefficients; significance levels are 
given under x^?-statistizs. Hp: (ag = 0, 2, =1, a2 = 0). 


TABLE 3—FurTHER TESTS OF EFFICIENCY 


. x? for 

Method ao a az Hy . MA Regressors N 

163) 3.092 0.9992 —0.0009 1.0567 1 l, Piro 1613 
(7.549) (0.0056) (0.0017) (0.7875) Pont 

(2) 0.7887 1.0012 -0.0016 1.6601 1 L,PF;,,5 1569 
(7.523) (0.0055) — (0.0016) (0.6458) Po 4-2 

3) 4481 1.0000  —0.0014 1.964 1 ^ AL,P,,p 1613 
(6.011) (0.0045) — (0.0012) (0.5799) M1,.1 

(4) 4.704 1.0000 | —0.0014 1.868 1 1,P,,,j 1569 


(6.214) (0.0047) (0.0013) 


(0.6002) Mi, > 


Note: Standard errors are in parentheses under coefficients; significance levels are 
given under x?-statistics. Hy: (æg 0, a, — 1, a2 — 0). 


TABLE 4—JoiNT Test OF EFFICIENCY 


Pip = Oy + Oy Pietas Py talPN 
+ a P; =P + asPo 1-1 
+ agMl, it Eir 
a 21.882 
(11.451) 
a, 0.806 
! (0.073) 
a 0.191 
(0.073) 
as 0.022 
(0.011) 
a, 0.005 
(0.082) 
as ; 0.006 
(0.003) 
a — 0.005 
(0.004) 
No. of MA Terms 1 
x?-Statistic 8.288 
' Significance Level 0.308 
N “576 


Note: Standard errors are in parentheses. 


LE 


Two variables that varied greatly, and 
presumably had a large effect on the price 
level during the sample period, are the 
money supply and the price of oil. Eco- 
nomic forecasters were accused of systemat- 
ically underestimating the effects of these 
variables on price behavior during the 1970s 
and 1980s. Table 3 examines whether fore- 
casters fully adjusted their predictions to 
changes in the money supply and oil prices. 
This table presents regressions that include 
the previous two lagged values of nominal 
crude oil prices (P5, ,, P5, ;) and M1 
(M1,., M1, 3) as dependent variables. 
The coefficient restrictions implied by ratio- 
nality are not rejected in any of these re- 
gressions. : 

Although the results in Tables 1, 2, and 3 
show strong support for forecast rationality, 
a joint test of forecast rationality using the 
most important variables from those tables 
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TABLE 5—~TEsTs OF FALSE EFFICIENCY RESTRICTIONS 

Method Qo a, a x’ for Ha MA Regressors N 

(1) 8.602 0.7093 0.2908 10.76 1 L,Pij,.p 1129 

. (7.272) (0.0970) (0.0978) (0.0131) B. 

(2) 0.6950 0.0567 0.9431 2353.7 1 L,Pj.p 1172 
(1.380) (0.0195) (0.0196) (0.0000) t RA PA 

(3) 4.268 0.5457 0.4457 110.2 1 Ll,Pj,,p 1178 
(4.349) (0.0441) (0.0438) (0.0000) 14+1F i142 

(4) 10.12 0.9949 0.6891 19.64 1 1,,Piray, 1120 
(5.760) (0.0033) (0.1719) (0.0002) P,- Pin 





Note: Standard errors are in parentheses under coefficients; significance levels are 
given under y*-statistics. Hy: (as = =0, a,;=1, a,=0). 


would provide even more convincing evi- 
dence that these forecasts are rational. 
Table 4 gives the results of a test of ratio- 
nality in which the P,,, was regressed on a 
constant, the one-step-ahead forecast 
Pirer the lagged price level P, ,, the 
lagged perceived forecast error ,P, 


it 
1-1P;,,, the second lagged forecast error 
P.-1—;-2P;,1-1, the lagged nominal crude 
oil price Po, ;-1 and the lagged money sup- 
ply M1, , The x? test statistic for this 
regression "shows that the hypothesis of ef- 
ficiency cannot be rejected.?! 

Given the strong support that the results 
in Tables 1—4 have provided for the hypoth- 
esis of forecast rationality, we might wonder 
whether these tests have any power. Table 5 
addresses this issue. It displays the results 
of regressions that include variables that 
were not in a forecaster's information set 
when pe made his price forecasts. In fact, 
the y?-statistics show that we can reject the 
test for forecast rationality for each of these 
variables. Row 1 of Table 5 shows regres- 
sion results including the contemporaneous 
(but unknown) price level (P,,) as a regres- 


? Although the money supply and oil prices certainly 
do not exhaust the list of information available to the 
forecasters, the fact that forecasters seem to have 
efficiently used this information suggests that the fore- 
casts are rational. Mullineaux (1980) found similar 
results in testing whether the mean Livingston fore- 
caster efficiently incorporated data about money 
growth. 


sor. Row 2 shows the results of including 
the forecaster’s perception of the price level 
at the end of the period (,,,P,,,). Row 3 
results include the forecaster’s next one- 
step-ahead prediction (, , ,P;, ;). Results in 
Row 4 include the forecaster’ S contempora- 
neous error in perceiving the price level 
(P, —,P, 1). The test of efficiency is rejected 
in every case. Thus, these tests have suffi- 
cient power to reject invalid restrictions. 

The results in Row 4 are particularly in- 
structive. The forecaster's perceived fore- 
cast ib from the previous period 
GP, , —,.,P,) is orthogonal to the current 
forécast error (see Table 1, Row 6. How- 
ever, since his unperceived forecast error 
CP, —, P, ,) is not orthogonal to his forecast 
error (see Table 5, Row 4), neither is the 
actual lagged forecast error, because it is 
the sum of the perceived and unperceived 
components (see Table 1, Row 4).32 

The results in Tables 1-5 strongly con- 
firm the hypothesis that survey forecasts are 
rational. The variables expected to improve 
price forecasts do so, and the variables not 
expected to improve price forecasts do not. 
The coefficient restrictions rationality im- 
plies cannot be rejected when the standard 
errors are correctly computed. 

The rational expectations hypothesis im- 
plies that only differential private informa- 


3? Boschen and Grossman (1982) first made this dis- 
tinction between perceived and unperceived forecast 
errors. 
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tion should explain differences among indi- 
vidual forecasts. Our final set of tests checks 
whether differences among the forecasts of 
different forecasters can be explained by 
publicly available information. We tested 
whether the lagged growth rate of M1 or 
the lagged growth rate of cil prices could 
explain the difference between any given 
forecaster’s prediction error and the mean 
prediction error for that period. If either of 
those variables is correlated with idiosyn- 
cratic error components for some forecast- 
ers, then those forecasters did not fully ac- 
count for that publicly available information 
in making their forecasts.) In only one of 
the 60 individual regressions is the rate of 
money growth significant at the 5 percent 
level in explaining the forecaster's idiosyn- 
cratic errors. In only 4 of the 60 individual 
regressions is the rate of oil price growth 
significant at the 5 percent level in explain- 
ing those errors. This suggests that hetero- 
geneity in individual forecasts is not due to 
differential accuracy among forecasters in 
predicting the effects of money growth or 
oil price growth on future prices. 


VI. Conclusion 


Our results indicate that survey respon- 
dents' forecasts of the GNP deflator are 
both unbiased and efficient—and therefore 
rational. We demonstrate the importance of 
accounting for aggregate shocks in order to 
conduct correct statistical inference in panel 
data models. We find that failure to account 
for such shócks when estimating the covari- 
ance matrix for the regression estimates 
leads to false rejection of the hypothesis of 
rationality. We develop a covzriance matrix 
consistent in the presence of aggregate 
Shocks that leads to strong affirmation of 
rationality. We also demonstrate that al- 
though a strong moving average component 
is present in agents' one-step-ahead fore- 
cast error, it does not imply irrationality, 
because it arises from imperfect informa- 


The idiosyncratic component is ei , — Ent 
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tion regarding the current price level. Inter- 


- estingly, this implies that the class of ratio- 


nal expectations models in which only un- 
anticipated inflation can cause output to 
deviate from trend can explain persistence 
in output. Also, we show that using revised 
rather than initial price data to test the 
rationality of forecasts can greatly affect the 
results of those tests. In fact, Zarnowitz, 
who uses the same forecast data we do, 
rejects rationality for inflation forecasts 
solely because he uses revised data. Finally, 
we find that we cannot reject the strong 
implication of rational expectations that 
forecast differences should only result from 
asymmetric information. Readily available 
data do not explain the differences among 
individual forecasts. 

Yet our results are not conclusive evi- 
dence for rational expectations. We find 
that expert forecasters are rational, but 
other people may not be. Still, our results 
and methodological discussion cast grave 
doubts upon the assumptions, methods, and 
conclusions in the existing literature. And, 
although the support we provide for the 
rational expectations hypothesis is limited, 
it takes on added importance when viewed 
from the perspective that almost the entire 
existing literature has rejected the rational- 
ity of price forecasts, even from professional 
forecasters. Hence, our results can be 
viewed as salvaging the possibility that the 
rational expectations hypothesis is empiri- 


-cally valid, and reopening the debate on this 


subject. 

Finally, our finding of the importance of 
aggregate shocks should prompt other 
economists who explore macroeconomic 
questions with panel.data to reexamine their 
statistical assumptions. It seems unlikely that 
the residuals in micro consumption and la- 
bor studies are independent within each 
cross section. Failure to adjust for this de- 
pendence could lead to incorrect statistical 
inference.” 


“Altug and Miller (1990) find aggregate shocks very 
important in their study of consumption and labor 
supply. 
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APPENDIX 
Individual Test Results Using Zarnowitz’s Data 
NBER LD. a a "Ho NBER LD. ay 
3 2.4898 1.0011 2.8829 Te — 8.2951 
(14.961) (0.00974) (0.23558) (12.343) 
8 4.4746 0.99956 1.8702 14 8.3869 
(22.851) (0.01478) —— (0.39255) (10.713) 
15 11.962 ^ . 0.99445 54328 | 21 — 1.1597 
(7.9520) — (0.00496) (0.06611). (21.361) 
22 — 3.9781 1.0052 3.6053 23 20.768 
(18.836) (0.01255) —— (0.16486) (14.039) 
27 24.443 0.98641 5.6445 — 30 19.143 
(10.298) (0.00590) —— (0.05947) (20.582) 
31 ' 20.897 0.99162 12272 ` 32 9.2309 
(11.312) (0.00670) ^ (0.00216) (13.606) 
34 :—0.72462 1.0037 7.5493 38 15.030 
(14.304) (0.00909) (0.02295) (13.080) 
39 14.890 0.99332 4.9309 40 18.354 
(11.189) (0.00688) (0.08497) (13.503) 
42 1.7853 1.0026 9.9783 43 30.891 
(11.301) (0.00716) (0.00681) (11.097) 
44 — 6.3401 1.0059 2.0942 46 8.4090 
(16.299) (0.00979) (0.35096) (16.835) 
47 2.5519 0.99761 0.23846 48 10.198 
(20.115) (0.01140) (0.88760) (16.415) 
49 — 6.8452 1.0079 28.547 51 26.053 
(8.0293) — (0.00522)  (0.0000006) (7.5482) 
54 — 13.488 1.0125 18.141 57 10.081 
(11.293) (0.00725) ^ (0.00011) (15.853) 
59 9.6887 0.99712 6.6950 60 17.905 
(13.193) (0.00866) (0.03517) (8.8023) 
61 15.960 0.99276 4.8359 62 9.8688 
(9.3530) | (0.00537) (0.08911) (14.246) 
64 4.8450 . 0.99910 6.8349 65 11.691 
(10.514) (0.00692) (0.03280) (9.0874) 
66 13.732 0.99389 2.6122 67 22.335 
(10.254) (0.00553) (0.27087) (8.8707) 
68 29.936 0.98663 6.8446 69 , 14.398 
(13.878) (0.00732) ^ (0.03264) (11.620) 
70 8.5655 0.99709 7.1223 72 9.0130 
(7.0671) — (0.00398) (0.02841) (11.390) 
73 20.168 0.99007 13.373 : 75 23.365 
(12.249) (0.00801) (0.00125) (12.945) 
77 — 15.027 1.0112 1.5137 78 2.2050 
(18.470) (0.01204) —— (0.46913) (11.446) 
82 18.430 0.99119 6.5981 84 — 11.607 
(8.9620) (0.00517) (0.03692) (11.200) 
86 7.0478 0.99869 7.0460 87 8.8439 
(9.9970) — (0.00598) (0.02951) (10.752) 
89 15.341 0.99269 4.2050 93 27.050 
(11.190) (0.00666) ^ (0.12215) (15.571) 
96 2.4145 0.99318 0.28821 98 11.779 
(15.160) (0.00915) (0.36580) (7.5815) | 
102 14.569 0.99180 0.77244 108 — 31.562 
(22,167) (0.01409) (0.57962) (22.838) 
109 —21.115 1.0176 11.217 112 ` = 24.921 - 
(20.057) (0.01248) (0.00367) (19.960) 
125 — 5.9854 1.0002 3.3509 138 — 17.728 . 
(26.976) (0.01684) — (0.18722  . (13.384) 
144 —16.815 ' 1.0114 2.1684 145 . — 6.3399 


(14.334) (0.00898) (0.33816) (22.023) 


ay 


1.0087 
(0.00796) 
0.99883 
(0.00709) 
1.0056 
(0.01251) 
0.99066 
(0.00757) 
0.99200 


. (0.01011) 


0.9970 
(0.00890) 
0.99281 
(0.00853) 
0.99225 
(0.00745) 
0.98747 
(0.00605) 
0.99806 
(0.01120) 
0.99800 
(0.01102) 
0.98454 
(0.00452) 
0.99654 
(0.01026) 
0.98964 
(0.00477) 
0.99707 
(0.00728) 
0.99563 
(0.00518) 
0.98815 
(0.00490) 
0.99324 
(0.00678) 
0.99754 
(0.00663) 
0.98999 
(0.00751) 

1.0015 
(0.00724) 
0.99486 
(0.00598) 
0.99406 
(0.00598) 
0.98559 
(0.00925) 
0.99478 
(0.00461) 
1.0250 
(0.01432) 
1.0160 


`- (0.01215) 


1.0132 
(0.00845) 

1.0055 
(0.01328) 
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Ho 


11.229 
(0.00364) 
19.312 
(0.00006) 
4.0226 
(0.01338) 
3.0006 
(0.22307) 
1.3584 
(0.50703) 
9.4499 
(0.00887) 
7.2617 
(0.02649) 
4.1992 
(0.12251) 
20.689 
(0.00003) 
5.4088 
(0.06691) 
12.409 
(0.00202) 
11.952 
(0.00254) 
5.8440 
(0.05383) 
4.9027 
(0.08618) 
2.4453 
(0.29445) 
6.5628 
(0.03758) 
6.3599 
(0.04159) 
2.5578 
(0.27834) 
4.4448 
(0.10835) 
7.3882 
(0.02487) 
6.4330 
(0.04010) 
1.8743 
(0.39174) 
1.7829 
(0.41007) 
3.7746 
(0.15148) 
8.6409 
(0.01329) 
12.634 
(0.00181) 
2.0212 
(0.36401) 
5.8484 
(0.05371) 
1.5882 
(0.45199) 


732 THE AMERICAN ECONOMIC REVIEW SEPTEMBER 1990 | 








NBER LD. ay p? Hg NBER LD. ao a Ho 
148 — 35.524 1.0289 6.4511 158 — 3.1309 1.0053 5.9090 
(41.462) (0.02586) (0.03973) (20.486) — (0.01267) (0.05210) 





Note: Standard errors are in parentheses under coefficients; significance levels are given under y~-statistics for Hp. 


Individual Test Results Using Initially Released Data 








NBER LD. E a, Hy NBER LD. ao ay Ho 
3 9.1382 0.99450 0.47508 7 0.48552 1.0005 0.91773 
(14.126) (0.00920) (0.78857) (10.056) (0.00648) (0.63200) 

8 6.7419 0.99550 0.12371 14 11.003 0.99423 5.8201 
(21.129) (0.01302) — (0.94002) (7.8468) (0.00519) (0.05447) 

15 3.3783 0.99820 0.28922 21 — 0.38588 1.0013 0.37485 
(7.4004) — (0.00462) (0.86536) (15.913) (0.00932) — (0.82909) 

22 — 13.857 1.0102 1.4730 23 12.695 0.99288 2.0525 
(16.682) (0.01111) — (0.4788) (9.2274) | (0.00497) (0.35835) 

27 : 17.579 0.98852 8.4831 30 1.5236 0.99892 0.08735 
(7.7145) — (0.00442) — (0.01438) (14.447) (0.00710) — (0.95726) 

31 19.276 0.99011 6.4269 32 —3.4191 1.0037 2.4887 
(8.9184) (0.00528) (0.04022) (12.352) (0.00808) (0.28812) 

34 4.6144 0.99801 1.1179 38 3.3336 0.99895 1.5414 
i (11.326) (0.00720) — (0.57182) (11.519) (0.00751) — (0.46268) 
39 8.80857 0.99580 1.3134 40 12.607 0.99394 2.0482 
(8.7244) (0.00536) (0.51857) (10.164) (0.00561) (0.35912) 

42 — 5.4555 1.0048 2.2748 43 19.672 0.99235 11113 
(9.8857) | (0.00626) (0.32066) (10.432) (0.00569) ^ (0.00386) 

44 — 11.053 1.0070 0.70276 46 — 4.6930 1.0045 1.1088 
(14.223) (0.00854) (0.70371) (14.882) (0.00990) — (0.57441) 

47 —3.2375 0.99910 2.0598 48 — 0.20145 1.0031 5.7632 
(18.572) (0.01052) — (0.35705) (14.478) (0.00972) (0.05604) 

49 — 5.6613 1.0048 6.0616 51 14.644 0.99033 7.3626 
(6.0361) — (0.00392) (0.04828 ` (6.2969) — (0.00377) (0.02519) 

54 —1.1542 1.0018 2.2354 57 — 5.4562 1.0048 1.5203 
(8.6148) — (0.00553) (0.32704) (13.974) (0.00904) (0.46760) 

59 —1.8680  . 1.0027 1.7603 60 9:2780 0.99340 6.7461 
(11.181) (0.00734) — (0.41471) (6.7636) — (0.00367) (0.03428) 

61 7.5849 6.99623 0.96154 62 — 3.8786 1.0023 0.28313 
(8.5865) | (C.00493) (0.61831) (9.3386) — (0.00477) (0.8680) 

64 . 2.2385 0.99869 0.08927 65 4.3618 0.99774 0.47456 
(9.5383) (0.00628) (0.95635) (6.7932) | (0.00387) (0.78877) 

66. 5.1299 0.99726 0.55304 67 11.565 0.99224 4.8101 
(6.9177) (0.00373) (0.75842) (7.6704) (0.00424) (0.09026) 

68 19.760 - 0.98953 3.9391 69 12.249 0.99186 3.3008 
(10.024) (0.00529) — (0.13952) i (8.4623) (0.00494) (0.19200) 

70 3.4932 0.99804 0.41833 72 . 5.3485 0.99709 0.43405 
(5.4228) (0.00305) (0.81126) ' (83018) (0.00483) (0.80491) 
73 7.1518 0.99709 5.6098 75 23.011 0.98691 7.2293 ` 
(9.7200) — (0.00636) (0.06051) (8.5596) — (0.00496) (0.02693) 

T! — 4.7178 1.0019 0.94257 78 2.0963 0.99917 0.26855 
(14.526) (0.00947) (0.62420) (10.034) (0.00635) (0.87435) 

82 11.719 0.99338 3.2729 84 2.6116 0.99839 0.18663 
(6.4776) (0.00374) (0.19467) (8.3672) (0.00447) (0.91091) 

86 10.445 0.99364 1.9437 87 1.8489 0.99634 5.6542 
(7.6236) — (0.00456) (0.3783 (11.063) (0.00611) (0.05918) 

89 72372 0.39551 0.73126 93 17.439 . 0.99021 1.2036 
(8.9314) — (0.00532) (0.69376) (16.295) (0.00968) — (0.54782) 

96 6.0837 0.99435 5.5642 98 6.4943 0.99638 1.2734 


(9.1806) ^ (0.00554) (0.06191) (6.2992) (0.00383) (0.52903) 
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NBER LD. a a, Ho NBER LD. ay "m Hy 
102 15.938 0.98915 0.79484 108 —20.179 1.0159 5.3174 
(21.261) (0.01352) (0.67205) (21.714) — (0.01362) — (0.07004) 
109 —15.376 1.0120 4.8020 112 —9,5870 1.0046 1.3252 
(17.518) (0.01090) (0.09063) (17.037) (0.01037) (0.51552) 
125 0.11363 0.99449 9.0306 138 —94007 1.0054 0.80104 
(25.318) (0.01580) (0.01094) (11.147) (0.00704) — (0.66997) 
144 —89352 1.0045 2.0037 ^ 145 —2.8352 1.0020 0.05842 
(11.625) (0.00728) (0.36719) (20.514) (0.01237) (0.97121) 
148 —34.150 1.0261 3.9021 158 24112 0.99915 0.31752 
(39.627) (0.02471) (0.14212) (17.144) (0.01060) — (0.85320) 


Note: Standard errors are in parentheses under coefficients; significance levels are given under y?-statistics for Hg. 
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Margin Requirements, Volatility, and the Transitory 
Component of Stock Prices 


By Gixas A. HARDOUVELIS* 


Official margin requirements in the U.S. stock market were established in October 
1934 to limit the amount of credit available for the purpose of buying stocks. 
Since then, higher or rising margin requirements are associated with lower stock 
price volatility, lower excess volatility, and smaller deviations of stock prices from 
their fundamental values. The results hold throughout the post-1934 period and 
are not very sensitive to the exclusion of the turbulent depression years from the 
sample. Thus margin requirements seem to be an effective policy tool in curbing 
destabilizing speculation. (JEL 313, 520). 


Federal regulation of securities margins 
is mandated by Congress in the Securities 
tchange Act of 1934. The stock market 
perience of the late 1920s led Congress to 
e conclusion that credit-financed specula- 
m in the stock market might create exces- 
'e market volatility through a pyramiding- 
pyramiding process.’ Congress reasoned 
at the imposition of margin requirements 
uld constrain the amount of borrowing 
id prevent excessive market volatility, and 
bsequently gave the Federal Reserve ju- 
idiction over the level of margin require- 
ents. Since 1934, the Federal Reserve 
anged the initial margin requirements in 


*Department of Finance, Rutgers University, New 
inswick, NJ 08903; and Research Department, Fed- 
] Reserve Bank of New York, NY 10045. I would 
> to thank the seminar participants at U. C. Berke- 
, Boston College, Columbia, Ohio State, Rutgers, 
Securities and Exchange Commission, the Central 
nk of Greece, and the Federal Reserve Bank of New 
rk, as well as Paul Bennett, Arturo Estrella, Ken- 
h Froot, Steve Peristiani, James Poterba, Anthony 
drigues, and George Sofianos for helpful discussions 
comments; and Valerie Laporte for editorial assis- 
ce. Special thanks go to James O’Brien, Robert 
ller, and two anonymous referees for very thorough 
nments. The views expressed in this artic.e do not 
ect the views of the Federal Reserve Bank of New 
rk or the Federal Reserve System. 

The pyramiding-depyramiding process is reviewed 
Kenneth Garbade (1982) and can be summarized as 
ows: In the absence of broker-enforced margin re- 
rements, optimistic investors with relatively low de- 
es of risk aversion might borrow large amounts of 
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stocks twenty-three times, and the current 
rate is 50 percent (see Table 1).? 

Do initial margin requirements curb spec- 
ulative excesses in the stock market and 
reduce stock price volatility? This question 
has recently gained new importance among 
regulators and students of financial market 
developments following the October 1987 


funds to buy stock, causing a price rise that could not 
be justified by economic fundamentals. The price rise 
might then feed on itself if these speculators used their 
increased wealth to buy more stocks, thus driving prices 
even higher. This pyramiding effect could be followed 
by a market collapse if after an initial price drop, 
brokers and other creditors were to ask for more 
collateral on their loans to speculators. If some specu- 
lators could not provide the additional collateral, credi- 
tors would sell the stocks they kept as collateral, driv- 
ing prices still lower. This outcome would generate 
further calls for collateral, more liquidations, and addi- 
tional price declines. 

An initial margin requirement of, say, 60 percent 
implies that at least 60 percent of the value of the new 
stock purchase should come from the investor's own | 
capital. If the stock price rises after the initial pur- 
chase, investors can withdraw the differential from 
their margin account or can use it to buy additional 
stock on 60 percent margin. If the price declines after 
the initial purchase, investors are not required to add 
funds to their margin accounts unless their equity 
position falls below the so-called maintenance margin, 
which has remained at 25 percent since 1934. Brokers 
typically add a spread over the.official maintenance 
margin that varies across customers and across time. 
See George Sofianos (1968) for a detailed description 
of contemporary margin regulation in the cash and 
derivative markets. 
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TABLE 1—IniTIAL MARGIN REQUIREMENTS 











| Rate Rate 
Effective (Percent) Effective (Percent) 
10/15/34 45 01/16/58 50 
02/01/36 55 08/05/58 70 
11/01/37 40 10/16/58 90 
02/05/45 50 07/28/60 70 
07/05/45 75 07/10/62 50° 
01/21/46 100 11/06/63 70 
02/01/47 75 06/08/68 80 
03/30/49 50 05/06/70 65 
01/17/51 75 12/06/71 55 
02/20/53 50  . 11/24/72 65 
01/04/55 60 01/03/74 50 
04/23/55 70 


Sources: New York Stock Exchange Fact Book, 1987, 
p. 54; Board of Governors of the Federal Reserve 
System, Annual Report, various issues. 

*Rate incorrectly reported in Fact Book. 


abrupt collapse of stock prices and of the 
market mechanism.’ Clearly, theory alone 
cannot provide a definite answer. Those who 
believe that speculation is stabilizing be- 
cause it deepens the market and increases 
liquidity are likely to view margin require- 
ments as harmful. Those who believe that 
an unchecked market is often subject to 
destabilizing speculation are likely to think 
that margin requirements could prevent 
Speculative excesses. The question can only 
e resolved empirically. 

| Surprisingly, there is little, if any, previ- 
ous empirical work that focuses on the role 
of margin requirements in curbing specula- 
tive excesses. Earlier researchers have con- 
centrated primarily on the effect of a change 
in margin requirements on the level of stock 
prices and have found that an increase in 
margin requirements tends to stop and 
sometimes reverse a previous run-up in 
prices. Stock price volatility is first men- 


?See, for example, the "Interim Report of the 
Working Group on Financial Markets," submitted to 
the President of the United States, May 1988. See also 
‘Franklin Edwards (1988) and Arturo Estrella (1988). 
“See Jacob Cohen (1966); Walter Eckardt and 
Donald Rogoff (1976); Corwin Grube, Maurice Joy, 
and Don Panton (1979); James Largay (1973); or James 
Largay and Richard West (1973). Dudley Luckett (1982) 
finds that margin requirements negatively affect the 
ratio of debt to equity in broker-dealer customer ac- 
counts. 
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tioned by Thomas Moore (1966), who ar- 
gues that margin requirements are an in- 
effective tool for controlling volatility. 
He observes that although margin require- 
ments have changed a number of times since 
1934, the volatility of stock prices has re- 
mained relatively stable. James O’Brien 
(1984) discusses the central issue of specula- 


tive excesses but foregoes an empirical in- 


vestigation on the grounds that short-term 
speculative excesses have not been a charac- 
teristic of the post-1929 period. A detailed 
study by the Board of Governors of the 
Federal Reserve System (1984) takes a simi- 
lar position, making no attempt to correlate 
a measure of stock price volatility or a mea- 
sure of speculative excess with the size of 
margin requirements. Two timely articles in 
the literature—one by George Douglas 
(1969) and another by R. Officer (1973)—do 
find a negative association between the level 
of margin requirements and the volatility of 
stock prices. But these articles fall short of a 
complete analysis of the effects of margin 
requirements on stock volatility and do not 
examine excess volatility, the key variable of 
interest. 

This paper conducts an empirical investi- 
gation of the role of initial margin require- 
ments in curbing speculative excesses in the 
cash market. Section I discusses the theoret- 
ical connection between margin require- 
ments and destabilizing speculation. Section 
II confirms the previous evidence on the 
negative correlation between stock price 
volatility and the level of margin require- 
ments. The analysis exploits all the available 
data and controls for many excluded vari- 
ables (particularly those associated with the 
Fed's regulatory response to the economic 
environment) that could have caused a bias 
in the results of Douglas and Officer. The 
robustness of the negative association be- 
tween margin requirements and volatility is 
checked using two measures of volatility, 
one annual and the other monthly, plus a 
vector autoregressive (VAR) analysis. The 
VAR analysis shows that an increase in 
margin requirements is associated with a 
future drop in (1) stock return volatility, (2) 
stock returns, (3) the growth of trading vol- 
ume at the New York Stock Exchange 
(NYSE), and (4) the deflated amount of 
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borrowing for the purpose of buying stocks. 
Thus the evidence is consistent: with the 
hypothesis that margin requirements -con- 
strain the activities of investors whose be- 
havior raises market volatility. 

Section III focuses on the topic of princi- 
pal interest, the relationship between excess 
volatility and margin requirements. It em- 
ploys a regression test of excess volatility 
that is robust to common errors of misspeci- 
fication and allows for a time-varying dis- 
count rate. The test affirms the previous 
evidence on excess volatility and finds that 
excess volatility is weaker both during peri- 
ods of high and during periods of rising 
margin requirements. The natural question 
that follows is which type of excessive mar- 
ket behavior do margin requirements seem 
to reduce? By construction, the test of Sec- 
tion III excludes the popular rational specu- 


lative bubbles hypothesis as an explanation | 


of the excess volatility results. Hence, to 
answer: the question, one has to pursue hy- 
potheses of irrational behavior, and there is 
no well-accepted benchmark model for such 
behavior. 

- Section IV explores forms of irrational 
behavior that give rise to transitory compo- 
nents in stock prices (see Robert Shiller, 
1984, or Lawrence Summers, 1986). One 
implication of the existence of transitory 
components or “fads” is the presence of 
negative correlation between price-dividend 
ratios and subsequent multiperiod stock re- 
turns. Another implication is the presence 
of negative serial correlation in multiperiod 
stock returns. I examine both implications 
and find that the transitory components in 
‘stock prices become less pronounced during 
periods of high and periods of rising margin 
requirements. 

Section V offers some concluding re- 
marks. An appendix contains the detailed 
exposition of the excess volatility test of 
Section III and a description of the data. 


I. Margin Requirements and Destabilizing 
Speculation: Theoretical Linkages - 


The proposition that margin require- 
ments help curb destabilizing speculation is 
based on two implicit claims: The first claim 
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is that speculation by some groups of in- 
vestors can be destabilizing. The second 
claim is that margin requirements can im- 
pose an effective constraint on the market 
activities of speculators. The first claim is 
plausible but is not shared by all economists. 
For example, Milton Friedman (1953) ar- 
gues that speculation is destabilizing only if 
speculators on the average lose money by 
selling when assets are low in price and 
buying when assets are high. There are two 
counterarguments to Friedman's position. 
First, in a dynamic model with speculators 
that enter and exit the market, a few specu- 
lators can destabilize the market and subse- 
quently perish. Second, even in a static con- 
text with perfect competition, it is possible 
to construct theoretical models of desta- 
bilizing speculation featuring speculators 
that do not necessarily lose money on the 


_ average (see Oliver Hart and David Kreps, 


1986; Bradford DeLong, Andrei Shleifer, 
Lawrence Summers, and Robert Waldman, 
1987; or John Campbell and Peter Kyle, 
1988). These models show that speculation 
can destabilize prices, not that it necessarily 
destabilizes prices. Theory cannot predict 
the effect of increased speculation on the 
volatility of stock prices. 

The claim that margin requirements can 
impose a binding constraint on the behavior 
of destabilizing speculators is also plausible. 
Finance theory predicts that the less risk- 
averse investors hold more stocks and less 
cash in their portfolios and are therefore 
more likely to be constrained by the imposi- 
tion of margin requirements than are other 
investors. In practice, the margin require- 
ment would not be an effective constraint if 
speculators could sell other assets they own 
or somehow find a way around the margin 
restriction at no extra cost, or if corpora- 
tions could alter their financing behavior in 
order to attain the desired debt-equity ratio 
of their stockholders. Thus the importance 
of margin requirements as an effective con- 
straint is an empirical question. In the em- 
pirical analysis of Sections III and IV we 
allow for nonlinearities arising from the 
possibility that margins may be: more bind- 
ing during the times when margin require- 
ments change. 
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Il. Margin Requirements and Volatility 


This section contains an analysis of the 
rélation between margin requirements and 
volatility. The volatility: of stock returns, if 
measured properly, can capture unusual 
swings in stock prices caused by the pyra- 
miding-depyramiding effect that prompted 
Congress to establish margin rules some 50 
years ago. Since the pyramiding-depyramid- 
ing process is likely to last a few months, 
he volatility measure should be based on 
horizons that span a properly long time 
period. I employ an annual measure of 
volatility, but for robustness, I also use a 
monthly measure. Measures of volatility that 
are based on daily fluctuations of stock re- 
turns are not suited for analyzing the pyra- 
miding-depyramiding process. 

The section begins with a description of 
the two alternative measures of volatility. 
Then it investigates the economic variables 
that determine the Fed's propensity to 
change margin requirements. Finally, it pre- 
sents multiple regression and vector autore- 
gression analyses of the relation between 
margin requirements and volatility. 





ct 





A. Measures of Volatility 


Let r, denote the real rate of return from 
the last trading day of month t—1 to the 
last trading day of month ¢. The variable r, 
is the nominal return of the Standard & 
Poor's index including dividends minus the 
CPI inflation rate. To construct an annual 
measure of volatility, I first run the follow- 
ing regression: 

12 
(1) r,= Y a,SEASON,, 


i-1 


12 

| t» Bjr,-j t En 
j= 

where SEASON, is a monthly intercept. 

Then I define the annual measure of volatil- 

ity, o,r, as the standard deviation of the 

estimated residuals ê, over a calendar year: 


12 1/2 
(2a) o; zb (rye , 


j=l 
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where the index T denotes calendar years, 
the index j runs over the months of a calen- 
dar year, and é; is the average residual 
over the calendar year. The above definition 
of volatility allows the conditional mean of 
monthly stock returns to vary over time and 
across months. However, it assumes. that 
volatility itself is constant over the calendar 
year. This assumption is not problematic 
because here the aim is to find a good 
empirical measure of long and large swings 
in stock prices and not to arrive at the best 
empirical proxy of volatility per se. 

Margin requirements do not conveniently 
change at the end of a calendar year, and 
hence o, does not match well with the 
margin requirement. For this reason and in 
order to check the robustness of the results, 
I also use G. William Schwert’s (19882) 
monthly measure of volatility: 


Oy, = (11/2) "^ lê; 


where |é,| is the absolute value of the esti- 
mated residuals of regression equation (1). 
Under the hypothesis that the residuals in 
equation (1) are normally distributed, the 
adjustment factor (II /2)? in (2b)— which 
is approximately equal to 1.2533—makes 
On an unbiased statistic of the true stan- 
dard deviation. The variable o, is not a 
smooth measure, but for our purposes it 
suffices that it captures all thé unusual spikes 
in monthly returns. i 

Table 2 presents summary statistics of the 
two volatility measures for the period fol- 
lowing the establishment of official margin 


1 


(2b) 


SAn alternative annual measure of volatility would 
be the simple standard deviation of the monthly re- 
turns. This measure provides very similar results. 
Volatility can also be based on monthly excess nominal 
rates of return, that is, ex-post nominal returns minus : 
the known one-month Treasury bill rate at the begin- 
ning of the one-month holding period. Again, this 
measure is not very different because the volatility of 
nominal stock prices overwhelms both the volatility of 
the CPI and the volatility of the Treasury bill rate. 

Note that the variable of interest in this paper is 
actual volatility, not perceived volatility. Schwert 
(1988b) constructs a measure of perceived monthly 
volatility oe the fitted value of the regression of o>; 
On G,, , .j, Frj and SEASON, for j=1,...,12. 
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TABLE 2— VOLATILITY OF FEAL STOCK 
RETURNS-SUMMARY STATISTICS - 











9, On 
Sample 1935-1987 1934:11-1987:12 
Observations 53 638 

Mean | 0.044 . 0.045 
Standard Deviation 0.019 0.040 
Skewness 1.78 2.31 
Kurtosis . 4.51 9.69 
Minimum 0.018 0.000 
Maximum 0.120 0.332 

Py 0.44. 0.11 

p2 0.21 0.14 

P3 0.04 0.18 

P12 0.01 0.13 

P24 = — 0.02 

P36 - 0.01 

Pag - 0.01 
Rank Correlation —0.34 — 0.12 
t-statistic [— 2.60] [-3.07] 
Notes: o, is the standard deviation from January to 


December of the estimated residuals ê: from the re- 
gression: r, = X2 ,a,SEASON,, + X Birit €; 
where r, is the monthly real rate of return of the S&P 
index including dividends, and SEASON is a monthly 
dummy variable. o,, equals (7/2)/2|£| and is a 
monthly measure of volatility. P1, p2,..., P4g denote the 
autocorrelation at lags 1,2,...,48. Tke rank correlation 
is between the margin requirement cf month ¢ and o,, 
and between the average margin during a calendar 
year and ay. 


requirements. Both o, and o,, are nonnor- 
mal and autocorrelated, and o,, is more 
variable than o,. The table also provides a 
preview of the association between volatility 
and margin requirements. Given the non- 
normality of both measures, rank correla- 
tions are computed instead o? simple corre- 
lations. In both cases the rank correlation 
between volatility and margin requirements 
is negative. The t-statistics of the rank cor- 
relations are quite large but are only partly 
meaningful because they do not take into 
account the serial dependence of volatility. 
Figure 1 plots o,, together with the offi- 
cial margin requirement from 1927 through 
1987. The figure shows that in the late 
twenties and early thirties, when the official 
margin requirement was zero, volatility was 
unusually high. However, during this early 
period of zero official margin requirements, 
brokers imposed substantial margins, and 
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the zero official margin underestimated the 
true effective margin more severely than 
official margin of later periods did. Thus the 
inclusion of the late twenties and early thir- 
ties in the sample period biases the results 
in favor of finding a negative association 
between margin requirements and volatility. 
For this reason, the subsequent empirical 
analysis uses only the period: when official 
margin requirements were in effect (Novem- 
ber 1934 to the present). Naturally, the re- 
sults are now biased against finding a nega- 
tive association because a high volatility—low 
margin period carries zero weight in the 
regressions. 

Figure 1 brings out a second important 
point. Because margin requirements have 
changed very few times, it would be ex- | 
tremely difficult to uncover a potential rela- 
tion to volatility. Indeed, no such relation is 
obvious from the figure, especially during 
the postdepression years. Uncovering a po- 
tential relation between the two variables 
requires careful analysis and appropriate 
control for third factors that affect volatility. 


B. The Federal Reserve's 
Regulatory Behavior 


One factor complicating the empirical 
analysis is the Federal Reserve's reaction 
function. The Fed's behavior in regulating 
margins may have resulted in a spurious 
negative or positive correlation between 
margin requirements and volatility. This 
would be the case, for example, if the Fed 
had altered margin requirements in re- 
sponse to an economic variable that had 
itself been responsible for the change in 
volatility or in response to the change in 
volatility itself. 

The Fed's official reasons for changing 
margin requirements are recorded in the 
various issues of the Annual Report of the 
Board of Governors of the Federal Reserve 
System. The Fed typically attributes its deci- 
sion to increase margin requirements to a 
rapid increase in stock prices and suspected 
speculation and to a rapid expansion in 
stock market credit. Sometimes high trading 
volumes, inflationary pressures, and an ex- . 
panding economy were also given as reasons 
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FIGURE 1. OFFICIAL MARGIN REQUIREMENT AND STOCK VOLATILITY 


for an increase in margins. In deciding to 
decrease margin requirements, Fed officials 
usually cited a drop in stock market credit 
and the disappearance of those factors that 
had led to an earlier increase. Stock market 
volatility was never mentioned as a reason 
for changing margin requirements. The fol- 
lowing excerpt from the 1951 Annual Report 
is characteristic of the explanations supplied 
by the Fed after an increase in margin re- 
quirements: 


Although the total amount of credit in 
use in the stock market had not as- 
sumed heavy proportions, there had 
been some increase during the preced- 
ing months together with increases in 
the volume of trading and in prices of 
securities. The expanding business and 
economic situation appeared to be en- 
couraging stock market activity and 
speculation, and the Board of Gover- 
nors believed that in the existing cir- 
cumstances a further substantial price 
advance supported by a rapid expan- 
sion in stock market credit was a dis- 
tinct possibility. The increase in mar- 
gin requirements was effected as a 
preventive measure. [p. 81] 


Table 3 presents estimates of a hypotheti- 
cal Fed decision rule to change margin re- 
quirements using an ordered-response logit 
model with three states. Briefly, assume that 
the Fed’s unobserved disposition to alter 
margin requirements, Z,, has a logistic dis- 
tribution. Assume also that a decrease in 
margin requirements occurs when Z, « B'X, + 
c,, that no change in margin requirements 
occurs when 8'X, +c; < Z, < B'X,- c, and 
that an increase in margin requirements oc- 
curs when f'X,-- c, « Z, X, represents a 
set of indicator variables and $’, c,, and c, 
are parameters to be estimated. The likeli- 
hood function to be maximized is 


T 
(3) TFA, £6) 


t=] 
x [F(8'X,  c;) - F(B'X, + ¢,)] 
XI F(B'X, c], 


where F denotes the logistic cumulative dis- 
tribution function.” The model is estimated 


"See G. S. Maddala (1983, pp. 46-49) for more 
details. The maximization of the likelihood function 
was performed using GAUSS. 
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TABLE 3—THE FEDERAL RESERVE’s REACTION FUNCTION: ORDERED-RESPONSE LoGiT 
NOVEMBER 1934—DEcEMBER 1987 








Annual Horizon Monthly Horizon 
Variable Coefficient — T-Statistic Variable Coefficient T-Statistic 
€ | —3.96 -2.91 Cy — 4.26 — 8.40 
Cz 5.60 3.34 Cy 4.53 8.81 
P 79.11 2.70 noa 17.34 240 
MCR,., 42.90 1.73 MCR,..,; 15.27 2.39 
0,1 —0.57 —-0.02: Om, 1-1 — 1.63 —0.21 
9,4 119.35 0.72 T, 52.73 1.00 
Y. — 48.75 —1.55 Yi 12.06 0.77 
YL,.4 7.52 1.03 VL, 4 0.25 0.20 
Observations 638 638 
—2 In(Likelihood) — 191.20 200.67 
LR z: real rate of stock return from end of month t —2 to end of month t —1 
(nominal return rens dividends minus CPI inflation rate). 
Pi = average of r, 12s Ägir 


MCR,..; =growth rate of the ratio of broker-dealer credit to mS over - the 
capitalized value of the NYSE from month t —2 to month t — 


MCR,_, «average of MCR, _ 12, MCR,_1,--- 
= CPI inflation rate from month ¢—2 to month t —1. 


Tii 
LE = average Of 7,..15, T; 1p... 
Yi-1 

ti; 
Y, s average of Y, 15, Yin. 
VL,., 
VL, , = average of VL, VL, ay... 
G,,-., See equation (2a) of the text. 
9,,-1 = See equation (2b) of the text. 


MCR, _,. 


ES 
= growth rate of the industrial production index from month £ ~2 to month 


SY 
` &growth rate of the NYSE trading volume from month t —2 to month t = 
VL,- i. 


The dependent variable takes the value of 1, 2, or 3 depending on whether the official 
margin decreased, stayed the same, or increased during month t. 


over the sample period from November 1934 
through December 1987, but the results are 
similar when we include the first change in 
official margin requirements of October 
1934. The included indicator variables are 
the lagged real rate of return of stock prices, 
the lagged percentage change in stock mar- 
ket credit (defined as the dollar amount of 
dealer and broker credit to customers for 
the purposes of buying stocks deflated by 
the capitalized value of the NYSE), the 
lagged stock return volatility measures of 
Table 2, the lagged rate of inflation, the 
lagged growth in the industrial production 
index, and the lagged growth in trading 
volume at the NYSE, With the exception of 
volatility, all these variables were men- 
tioned by the Fed as reasons for changing 


margin requirements. The model is esti- 
mated twice, once using the previous 
month's values of the indicator variables 
and o, as the measure of stock market 
volatility, and a second time using the previ- 
ous year's average monthly values and c, as 
the measure of stock market volatility. In 
the latter estimation, oy is defined over a 
rolling twelve-month interval that ends one 
month before the observation of the depen- 
dent variable. 

The results show that lagged stock re- 
turns and the lagged rate of changes in 
stock market credit were the only statisti- 
cally significant variables influencing the 
Fed's decision to change margin require- 
ments. An increase in the growth of stock 
prices (i.e., the level of stock returns) or in 
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the growth of stock market credit raises the 
probability of an increase in margin require- 
ments. As expected, volatility did not affect 
the decision to change margin require- 
ments.® 

^ Tf the Fed's response to a recent growth 
in stock prices is effective and the market 
subsequently declines, it will be difficult to 
uncover a negative association between 
margin requirements and stock market 
volatility. The reason is that volatility is low 
in bull markets and high in bear markets. 
Thus, after the Fed raises the margin re- 
quirement and the market declines, the bear 
market raises volatility and creates a spuri- 
ous positive relation between the change in 
margin requirements and the change in 
volatility. Hence it is important to control 
for the effect of a rising or declining market 
in our subsequent investigation. The nega- 
tive relationship between the level of stock 
prices and stock price volatility is analyzed 
extensively by Andrew Christie (1982). 
Christie also provides evidence supporting 
Fischer Black's (1976) hypothesis that the 
low debt-equity ratios during periods with 
high stock prices cause volatility to be low. 


C. Evidence from Regression Equations 


Let us turn now to the effect of margin 
requirements on stock price volatility, an 
issue analyzed previously by Douglas (1969) 
and Officer (1973). Douglas controls only 
for the variation in dividends in his regres- 
sion equation, and Officer for the variation 
in the industrial production index. Neither 
paper provides a complete analysis because 
each paper's reference to margin require- 
ments is only incidental. Here I run a more 
complete regression that includes additional 
factors affecting volatility and that uses all 
available observations on volatility. The ad- 
ditional variables include the following: first, 


*The results are qualitatively similar when I use the 
level of stock prices relative to trend instead of the 
level of stock returns. Also, binary logit models show 
that the decision to increase margins is symmetric to 
the decision to decrease margins (margin credit is 
slightly more significant in the case of a decrease in 
margins). 
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the two variables to which the Fed was 
found to respond, namely, the lagged level 
of stock returns and the lagged growth of 
stock market credit; second, the average 
level of inflation and the average growth of 
the industrial production index. These vari- 
ables capture the effect of the economic 
environment on volatility and avoid a possi- 
ble spurious correlation between margin re- 
quirements and volatility. For example, in 
addition to changing margin requirements, 
the Fed may change monetary policy. The 
new monetary policy may affect the econ- 
omy, and the state of the economy may in 
turn affect volatility. In such a case a nega- 
tive correlation between margin require- 
ments and volatility may arise not because 
the change in margins was effective but be- 
cause the simultaneous change in monetary 
policy affected volatility. Third, the volatility 
of the growth rate of the industrial produc- 
tion index serving as a proxy for the volatil- - 
ity of dividends. 

The regression equations are estimated 
using Ordinary Least Squares, but I use the 
method of Lars Hansen (1982) with the 
weighting scheme of Whitney Newey and 
Kenneth West (1987) to adjust the coeffi- 
cient standard errors for conditional het- 
eroskedasticity and the presence of a mov- 
ing average (MA) process in the error term. 
The autocorrelations of the regression 
residuals determine the choice of the lag 
length in the MA correction. 

Table 4A uses nonoverlapping annual ob- 
servations and relates o, ; to the average 
official margin requirement of the calendar 
year, My. Two sets of regressions are re- 
ported, one in which the margin require- 
ment is the only explanatory variable and a 
second set with all the control variables 
added. The estimation is performed over a 
variety of subsamples in order to check the 
sensitivity of the results to particular subpe- 
riods. Figure 1 suggests that a negative asso- 


?In Hardouvelis (1988c), I include lagged volatility 
to control for the possibility—despite the evidence of 
Table 3—that the Fed responds to lagged volatility. 
The inclusion of lagged volatility creates an errors-in- 
variables problem, yet the results are very similar to 
the present results. 
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TABLE 4A—MARCIN REQUIREMENTS AND VOLATILITY —ANNUAL OBSERVATIONS 
T, = Bo + B; M, + Bri, + BucrMCR, + Br, + ByY, + Boya Y, + v, 


R2 


Sample Nobs Bm Br BMcR By By Boy SEE pı p2 P3 P4 es q 
1935-87 53  —0.048* 0.11 0.018 030 0.04 -0.15 -018 -003 1 
(0.024) 
—0.078* .—0.377* —0.546* —1.522 —0.001 0.447 0.35 (015 0.28 -—0.13 -0.27 -019 -0.03 1 
(0.020) — (0.127) (0.152 (0.92 (0.264) (0.262) i 
1947-87 41 —0.037* 0.07 0.013 0.10 —0.18 —0.19 -001 0.10 0 
(0.017) - 
—0.053* —0.203  —0.296 0.156  —0.718* 0.125 0.26 0.012 0.08 —0.11 —023 0.00 -0.07 0 
(0.023) (0.167) (0.196) (0.861) (0.342) (0.289) 
1935-81 47  -0047 0.11 0.018 0.32 0.08 —0.12 -0.18 -0.08 1 
(0.026) 
—0.075* —0.358* —0.510* —1.40 0.002 — 0.514* 0.35 0.015 0.28 —0.08 —028 -0.20 0.01 1 
(0.021) (0.134) (0.165) (0.96) (0.222) (0.232) 
1935-55 21  —0042 0.04 0.023 0.337 016 -013 -026 -019 1 
(0.034) 
—0.043  —0.588* —0.749* —4.90* 0.528* 0.640 0.51 0.017 0.18 —030 —0.16 -011 0.01 0 
(0.023) (0.124) (0.159) (1.24) (0208) (0.361) 
1956-87 32  -0047* 0.11 0.014 016 —0.221 —026 -008 012 0 
(0.018) 
—0.058* —0.225  —0.188 -0.054 —1.11° 0.195 0.30 0.012 0.18 -0.18 -0.12 -0.07 -0.03 0 
(0.027 (0175 (0.251) (1.07) C481) (0.413) 


Notes: øy is the volatility of monthly real rates of return of the S&P index over a calendar yea: (see Table 2). M, 7, MCR , 77, and Y denote the 
average margin requirement, the average real rate of return of the S&P index, the average monthly growth of the ratio of margin debt to the 
value of the NYSE, the average rate of inflation, and the average growth in the industrial production index over a calendar year. o(Y,) is 
the volatility of the monthly growth in the industrial production index over a calendar year defined in a manner similar to g,. Nobs denotes 
the number of observations, R? the coeffic:ent of determination adjusted for degrees of freedom, SEE the regression standard error, and 
Py s Ps the residual autocorrelations at lags 1,...,5. Newey-West (1987) corrected standard errors are in parentheses. The correction is for 
conditional heteroskedasticity and for a moving average of order q. Asterisk, *, denotes statistical significance at the 5 percent level. 


TABLE 4B—MaRGIN REQUIREMENTS AND VOLATILITY—MONTHLY OBSERVATIONS 
Om, = Liza; SEASON; + B,,M, + Br, .,* BucrMCR,_1+ BaT, + B,Y, * Beya Vn + Uu, 





2 
x) 
Sample Nobs Bm B. — Bucr Br By By R SEE pm m P Pe PM HD 4 
1934: 638 -0.037* 0.02 0.040 0.11 0.12 0.16 0.00 —0.07 21.8* 12 
11-87:12 (0.018) [0.026] 
—0.049* —0.233* —0.126* 0.087 -—0.073  0.291* 0.08 0.039 0.04 0.05 0.12 0.08 —0.09 23.6* 12 
(0.016) — (0.035) (0.044) (0.366) (0.092) (0.085) [0.015] 
1947: 492 —0.027* 0.02 0.034 0.03 0.12 0.11 0.01 —0.11 17.2 3 
1-87:12 (0.013) [0.102] 
—0.034* —0.204* —0.088 0.047 -—0.325* 0.077 0.07 0.033 —(0.03 0.06 0.08 0.01 —0.11 22.7* 0 
(0.012) (0.053) (0.051) (0.388) (0.129) (0.151) [0.020) 
1934: 566  —0.039* 0.02 0.039 0.12 0.13 0.19 0.11 —0.08 24.7* 12 
11-81:12 (0.019) [0.010] 
—0.049* —0.231* —0.336* 0.091 —0.072  0.274* 0.08 0.038 G.05 0.04 0.14 0.10 —0.11 26.1* 12 
(0.017) (0.039) (0.047) (0.381) (0.094) (0.082) [0.006] 
1934 254 —0.033 0.02 0.046 0.14 0.14 0.18 0.10 —0.09 28.4* 12 
11—535:12 (0.027) [0.003] 
—0.045* —0.232* —0.118* —0.159 —0.029  0.242* 0.07 0.045 0.08 0.05 0.13 0.08 —0.12 28.1* 12 
(0.022) (0.042) (0.051) (0.417) (0.106) (0.094) [0.003] 
1956: 384 -—0.031* 0.03 0.0355 0.G5 0.11 0.14 0.01 —0.10 16.1 5 
1—87:12 (0.015) [0.138] 
—0.031* —0.209* —0.095 0.634 -—0.261 0.066 0.08 0.034 —0.03 0.05 0.10 0.02 —0.11 17.5 3 
(0.015) — (0.064) (0.070) (0.640) (0.152) (0.203) [0.093] 





Notes: C, is a monthly measure of volatility of the monthly real rate of return of the S&P index (see Table 2). SEASON; is a monthly dummy 
intercept, M, r, MCR, vr, and Y are the official margin requirement, the real rate of return of the S&P index, the growth in the ratio of margin 
debt to the value of the NYSE, the rate of inflation, and the growth in the industrial production index. ¢(Y,,) is a monthly measure of 
volatility for Y defined in a similar manner as Sm. p;,...,924 denote the residual autocorrelations at lags 1,...,24. Newey-West (1987) 
corrected standard errors for conditional heteroskedasticity and a moving average of order q are in parentheses. Significance levels are in 
brackets below each x (11) statistic. Asterisk, *, denotes statistical significance at the 5 percent level. 


ciation may be due entirely to the depres- cur in the first half and eleven in the second 


sion years. Thus I reestimate each equation 
using the usual postwar sample from 1947 
to the present. I also partition the sample in 
the middle, at the end of 1955, so that 
eleven changes in margin requirements oc- 


half. Finally, I show the results for the 
1935-81 sample in order to exclude the 
period of existence of a futures market in 
the S&P index. Margin requirements in the 
futures market are much lower than the 
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TABLE 4C—CHANGE IN MARGIN REQUIREMENTS AND CHANGE IN VOLATILITY 
SAMPLE: 22 CHANGES IN MARGIN REQUIREMENTS, 1936-1974 


Ag, , 7 ag t am AM, t a, AF, + auc AMCR , +a, AT, + a, Ay, ay An Y) e, 


an a, &MCR ou 
— 0.009 
(0.028) 
—0.158* — 0.070* —0.984* — 0.367 
(0.060) (0.033) (0.344) (0.214) 


(0.032) 


Degrees 
of 
ay ayy R SEE Freedom 
0.387 —0.02 0.021 19 
(0.319) 
0.041 0.408 0.17 0.019 15 
(0.392) 


Notes: A denotes the change of a variable from period (t — 12, t — 1) to period (t +1,t +12), where r is the month 
when a margin change occurs. Ao, ,, Ar,, AMCR ,, A7r,, Ay,, and Ao(Y, ,) are defined this way. AM, is the change 
in margin requirements during month ¢. Asterisk, *, denotes statistical significance at the 5 percent level. 


post-1974 cash market margin of 50 percent, 
creating an effective margin requirement 
lower than 50 percent. 

Table 4A shows a statistically significant 
negative relationship between margin re- 
quirements and volatility. The results are 
stronger when the control variables are in- 
cluded in the regressions, as was expected. 
For example, consistent with our earlier dis- 
cussion, volatility is negatively associated 
with stock returns; thus, excluding stock re- 
turns would bias the correlation between 
margin requirements and volatility in the 
positive direction. The negative assóciation 
between margin requirements and volatility 
is observed in every subperiod, which is 
quite impressive. After all, the variability of 
the official margin is minimal and checking 
for subperiod relationships is asking too 
much of the data. : 

Table 4B repeats the analysis of Table 4A 
but uses monthly observations and relates 
the monthly measure of volatility, o,s, to 
the margin requirement M,. The regres- 
sions also include twelve separate monthly 
intercepts. The hypothesis that these inter- 
cepts are equal is rejected. The overall re- 
sults are very similar to the results of Table 
4A, an outcome that is reassuring. Margin 
requirements show a significant negative re- 
lation to volatility, and this negative relation 
is present in every subperiod. 

The overall results are weaker than the 
findings of Douglas but stronger than those 
of Officer. Douglas relates the level of 
volatility to the level of margin require- 
ments but uses four-year averages of these 
variables and looks at a cross section of 


r 


stocks. Officer relates the change in volatil- 
ity to the change in margin requirements. 
His sample begins before the establishment 
of official margins and ends in 1970. Since 
Officer finds a weaker relation than I do, it 
is interesting to uncover the source of our 
differences. Table 4C repeats Officer’s exer- 
cise with my data and the post-1935 sample 
period. The table isolates the 22 cases since 
1935 when a change in the official margin 
requirement took place, and regresses the 
change in the annual measure of volatility 
from the year spanning months t—12 to 
t —1 to the year spanning months ¢+1 to 
t +12 on the change in the margin require- 
ments of month t. The first row of Table 4C 
contains the results of the regression used 
by Officer: the only explanatory variables 
are the change in the margin requirement 
and the change in the volatility of the 
monthly growth of the industrial produc- 
tion index. The results are weaker than 
Officer's. The second row adds the extra 
proper controls and shows that, indeed, 
margin requirements have a very strong and 
statistically significant effect on volatility. 
Clearly, Officer's weak results were due to 
misspecification. He excluded factors that 
affect both volatility and margin require- 
ments. 


10Officer does not describe his timing very well, and 
it is possible that he may have examined the difference 
between the volatility that spans months t —5 to t 6 
and the volatility that spans months f —6 to 1 +5. Also, 
the R? coefficients in his regressions are very high, 
which is puzzling. 
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TABLE 4D—-MAarGIN REQUIREMENTS AND VOLATILITY- VECTOR AUTOREGRESSIONS 
SAMPLE; 1935:10-1987:12 ~ 
INDEPENDENT VARIABLES: 12 LAGS OF o, ,, M,, r,, AND MCR, PLUS 
12 MONTHLY INTERCEPTS 











Independent Variables 


D M Tm 
ependent aa l MNEEES 
Variable SUM | x*12 SUM xa 
Onn —0.025* 20.4 0.55*  30.8* 
(0.012) [0.059] (0.13) [0.002] 
M 0.96* 10046*  —0.13 7.6 
(0.012) — [0.000] (0.07) [0.819] 
r — 0.020 15.9 0.08 132 
(0.015) [0.195] (0.15) — [0.351] 
MCR —0.000 60.3* —0.14 20.9 
(0.013) [0.000] (0.11) [0.051] 


r MCR 
SUM | xy:02) SUM VOD R? SEE 
~0.29*  503* | -031* 178 0.17 0.037 
(0.13) — [0.000] (0.12) [0.123] 
0.27* 168 0.14 134 0.94' 0.034 
(0.11) [0.159] (0.00 [0.337] 
032  282*  -002 304* 0.06 0.046 
(0.17) [0.005] (0.15) [0.002] 
0.78* 1282* 0.28* 32.0* 0.32 0.039 
(0.15 [0.000] (0.14) [0.001] 





Notes: o,,, M, r, and MCR are defined in Table 3. SUM denotes the sum of the coefficients of the twelve lags of 
each independent variable. White (1980) heteroskedasticity-consistent stendard errors are in parentheses. The 
x (12) statistic tests that the coefficients of the 12 lags are jointly zero. Significance levels are in brackets. Asterisk, 


*, denotes statistical significance at the 5 percent level. 


D. Reduced Form Evidence: 
Vector Autoregressions 


The negative relation between margin re- 
quirements and volatility does not necessar- 
ily imply causation from margin require- 
ments to volatility. However, the opposite 
causation from volatility to margin require- 
ments is not supported by the evidence and 
is not intuitive. As I mentioned earlier, there 
is not a single instance when the Fed cited 
volatility even as a remote cause for its 
decision to change margin requirements. 
Also, the ordered-logit results of Table 3 
show no such relation. Furthermore, sup- 
pose for the sake of the argument that the 
Fed observed an increase in volatility and 
became concerned about excessive specula- 
tive activity. Under such a scenario, the Fed 
would ordinarily increase margin require- 
ments; only a decision to decrease the re- 
quirements would justify the negative asso- 
ciation between the two variables. Despite 
these arguments, Schwert (1988b) criticizes 
an earlier version of this paper on the 
grounds that the negative association be- 
tween the two variables reflects the ef- 
fects of volatility on margin requirements. 
Schwert regresses volatility on twelve of its 
own lags and on twelve lags and twelve 
leads of the change in the official margin. 
Finding that lags of margin changes have a 


small negative effect while leads of margin 
changes have a large negative effect on 
volatility, he concludes that margin require- 
ments do not affect volatility but volatility 
affects margin requirements. However, 
Schwert's conclusion is hasty. Further exam- 
ination shows that the link from lagged 
volatility to current margin requirements re- 
flects third factors. 

Table 4D presents multivariate autore- 
gressions with four variables and twelve lags. 
The variables are the level of margin re- 
quirements, the monthly measure of stock 
volatility, the monthly real rate of return of 
stocks, and the growth in the ratio of mar- 
gin credit to the value of the NYSE. Thus 
Table 4D adds the two control variables 
that are significant in Tables 3 and 4B to 
the usual bivariate relation between margin 
requirements and volatility. Observe that 
the Schwert results are now reversed: There 
is no effect from volatility to margin re- 
quirements, but there is an effect from mar- 
gin requirements to volatility. The table also 
suggests an explanation for Schwert's coun- 
terintuitive results: Consistent with our ear- 
lier discussion, stock returns affect volatility 
negatively and quite strongly, and margin 
requirements positively. Thus the negative 
effect of volatility on margin requirements 
that Schwer: finds may simply reflect the . 
positive effect of stock returns on margin 
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FIGURE 2. IMPULSE RESPONSE TO A POSITIVE SHOCK IN OFFICIAL 
MARGIN REQUIREMENTS 


requirements. Of course, other influences 
are also present. The table shows multidi- 
rectional feedback between the variables—a 
finding that makes individual results hard to 
interpret. Also, the reduced form coeff- 
cients of a VAR model cannot be given a 
precise structural interpretation in general. 
In the case of margin requirements in par- 
ticular, the VAR model treats margin re- 
quirements as a continuous variable that 
depends on its own past lags, which does 
not have a meaningful interpretation. 

One way to visualize the dynamic effects 
of margin requirements on volatility and on 
other variables of interest is to trace through 
time the effects of a shock to margin re- 
quirements using a previously estimated 
VAR system. Such an exercise can provide 
a useful final summary of the direction of 
the various responses, although in the ab- 
sence of standard errors, it cannot assess 
the strength of these responses. To perform 
the exercise, I added to the previous list two 
new variables: the volatility of the industrial 
production index and the growth in trading 
volume. The first variable was significant in 
the regressions of Tables 4A and 4B when 
the full sample was used. The second vari- 


able is an interesting output variable to 
examine. Computational limitations did not 
permit the inclusion of more variables. 

Figure 2 presents the impulse response of 
stock volatility and real stock returns to a 
temporary shock in the official margin re- 
quirement. The margin requirements shock 
itself persists for about two years and dies 
out completely after ten years. The return 
to zero is a reflection of a mean-reverting 
tendency of the official margin during the 
sample period. Reàl stock returns respond 
negatively for about two years. Volatility 
also responds negatively and for a longer 
time. The size of the initial volatility re- 
sponse is comparable to the size of the 
regression coefficients in Table 4B. Note 
that most of the previous literature on mar- 
gin requirements concentrated essentially on 
one of the two pieces of information in the 
figure—stock returns—and ignored volatil- 
ity. 

Figure 3 provides some more interesting 
evidence. It shows the impulse response of 
margin credit and trading volume. Both 
variables fall after an increase in margin 
requirements. Thus taken together, Figures 
2 and 3 provide evidence consistent with the 
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Note: Margin credit is the ratio of the broker-dealer credit to customers over the 


capitalized value of the NYSE. 


hypothesis that margin requirements con- 
strain the activity of investors whose behav- 
ior raises market volatility." 


III. Margin Requirements 
and Excess Volatility 


The previous section showed that margin 
requirements are negatively correlated with 
stock price volatility. However, volatility per 
se is not a direct measure of speculative 
excess. A better measure is excess volatility, 
defined as volatility that cannot be ac- 
counted for by the variation of current and 
future dividends and discount rates. Al- 
though the regressions of Tables 4A, 4B, 
and 4C controlled for the variation of fun- 
damental determinants of stock prices, they 
did not impose the present valve relation on 
the data and thus cannot be interpreted as 
referring to excess volatility. This section 


UI have repeated most of the empirical work of this 


section using an index of small stocks—the two small- 
est deciles of the NYSE in terms of capitalized value. 
The results for small stocks are stronger than the 
results for the S&P index and were reported in 
Hardouvelis (1988c). Some of those results can also be 
found in Hardouvelis (1988b). 


investigates the relation between margin re- 
quirements and excess volatility, employing 
a regression-based test used previously by 
Louis Scott (1985) and Kenneth Froot 
(1987). 


A. Theoretical Framework 


The test of excess volatility is based on a 
very common test for the rationality of ex- 
pectations, the so-called unbiasedness test. 
Consider a variable y, and let f denote an 
earlier forecast of y and u the correspond- 
ing forecast error: y,;2f,tu,,, The 
variable f is a rational (optimal) forecast of 
y based on publicly available information if 
the forecast error u is uncorrelated with the 
forecast f. The unbiasedness test can be 
performed by running the regression 


(4) Ui 1m Bot Bife t Eri 


and testing the hypothesis that By — 0 and 
B,* 0. Froot (1987) observed that the na- 
ture of the rejection of B, — 0 has an inter- 
esting interpretation: 8, « 0 implies excess 
volatility, while 8,» 0 implies insufficient 
volatility (see Appendix A for details). For 
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my purposes, it is sufficient to show that the 
rejection of the rationality hypothesis is 
more severe in periods with low or decreas- 
ing margin requirements. Nevertheless, I 
adopt Froot's interpretation to claim that 
excess volatility worsens in those periods. 
To apply the test to stock prices, I follow 
Kenneth West (1988) and Froot (1987) and 
fix investors' holding period horizon to, say, 
one period. Note that the period length can 
be varied arbitrarily for empirical purposes. 


For this holding period, the present value , 


model implies that the stock price is the 
discounted present value of the expected 
nominal dividend, E,D,,,, plus the ex- 
pected proceeds from selling the stock at 
the end of the period, E,P,,,: 


(5) P, (E,P,, ,"FE,D,,) / (10 ER, y), 


where E,R,,, is the nominal discount rate 
or required rate of return. Put differently, 
equation (5) claims that the required rate of 
return equals the expected rate of return, 
ruling out opportunities for expected prof- 
its. In the test's framework, P, serves as the 
forecasting variable f. The variable y would 
then be the present value of the actual 
proceeds of holding a stock for one period 
and selling it at the end of the period: 


(6) x19 (Part Diri) /( F ERA). 
Let us now express the required rate of 
return as the geometric sum of the observ- 
able nominal risk-free rate, i,, and a risk 
premium, £,6,,: 

() 1*ERí 7 (i) £,8,41)- 


The actual nominal rate of return, R,,,, is 
as follows: 


(8) 1+ Ra = (Sat D)/P. 
Identities (6), (7), and (8) lead to 


(9) Vet = PAL+R,4,)/A+i) C+, 6,41) 
= PL Risi m i, 7 E,6,41)- 


From equations (5) and (9), the forecast 
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error u,,, ÍS 
(10) u= Ya P, 
= PA R417 i, E01). 


Using equation (10), one could proceed to 
regress u,,, on P,, as in equation (4). One 
problem with the test is the nonstationarity 
of the level of stock prices. Since P, appears 
both on the right-hand side and the left- 
hand side of the regression of u,,, on P, 
and is nonstationary, the regression results 
become meaningless. This problem can be 
resolved, however, if we can find a varia- 
ble x, such that P, / x, is stationary. We can 
then regress u,,,/x, on P,/x,. 

Suppose we find a nonzero correlation 
between u,,, and P,—that is, either excess 
or insufficient volatility. What are the eco- 
nomic reasons for such a phenomenon? 
Robert Flood, Robert Hodrick, and Paul 
Kaplan (1986) argue that one popular expla- 
nation in the literature, rational speculative 
bubbles, cannot account for such an empiri- 
cal finding. A bubble component in a stock 
price, 5,, is rational if it is expected to grow 
at the required rate of return E,R,,, (see 
Hardouvelis, 19882). Thus a rational bubble 
is of the form 


b, í = (O0 E, R,41)b, + Vi 

with E,v,,,= 0, 
where v,,, is a white noise shock to the 
bubble. The presence of a rational bubble 
adds to the forecast error u,,, = y, — P, 
the term 


b, ZU E,R,41) = b, 


= Uj41/(1+ ER), 


which is white noise and is uncorrelated 
with P,. 


B. Empirical Evidence on Excess Volatility 
Let us begin by testing for deviations from 


rationality by regressing the forecast error 
u,,, Of equation (10) on the stock price P,. 
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Following Scott (1985) and Froot (1987), I 
deflate the stock price by a measure of 
dividends to transform it into a stationary 
series. In our framework, the choice of divi- 
dends as the deflator series provides statisti- 
cal power because deviations from funda- 
mentals affect the price-dividend ratio. I use 
a smoothed dividend measure, a three-year 
rolling average. The price-smoothed divi- 
dend ratio has even more power to detect 
deviation from fundamentals. Economic 
theory claims that in the absence of bub- 
bles, the price-dividend ratio (as well as the 
price-smoothed dividend ratio) is stationary. 
John Campbell and Robert Shiller (1987) 
argue that it is also stationary in practice. In 
order to test the null hypothesis of nonsta- 
tionarity, I regress the first difference in 
P,/d, on its lagged level and on twelve 
lagged first differences over the sample pe- 
riod from 1930 to the present. The t-statis- 
tic of the lagged level is —3.2, which is 
significant at the 5 percent level (see Wayne 
A. Fuller, 1976, p. 373).? 

The basic regression equation is as fol- 
lows: 


(11a) (R, DU idi) 
= Bo + Bi P.-~/ d, X) + &5 


where k is the number of months in the 
holding period horizon, i,_, is a k-month 
yield to maturity of a Treasury security ob- 
served at the end of month t — k, P, , is 
the stock price at the end of month : — k, 
and d, , is the average dividend over the 
period from t — k —35 through t — k. Note 
that the risk premium, which is unobserv- 


12Schwert (1987) cannot reject the null hypothesis of 
nonstationarity in the postwar data. Similarly, I was 
unable to reject the null hypothesis for the price- 
smoothed dividend ratio in Schwert’s 1947-87 sample 
or in the 1940-87 sample. The inability of the test to 
reject the null in these smaller samples is probably an 
indication of the test’s low power (see John Cochrane, 
1988, for related criticisms) and is not a problematic 
issue. The price-smoothed dividend ratio is more sta- 
tionary than price itself. Put differently, a plot of the 
price-smoothed dividend ratio across time does not 
reveal any obvious explosive behavior, and for the 
purposes of this paper this result is all that matters. 
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able, is missing from the above regression 
equation. Thus the error term contains noise 
plus the term E, 4,0/(P, , /d,. ,). This may . 
bias the coefficient estimates, but we show 
later that the bias is against finding excess 
volatility. Note also that in (112a), the price- 
dividend ratio is not forecasting future ex- 
cess returns but the product of excess 
returns with the price dividend ratio. In 
Section IV, I will analyze the price-dividend 
ratio as a predictor of future excess stock 
returns; the latter regression has a slightly 
different interpretation. 

Table 5À presents the regression results 
of equation (11a) for holding period hori- 
zons that range from one month to five 
years. To check the sensitivity of the results 
to the inclusion of the turbulent depression 
years, equation (11a) is reestimated exclud- 
ing the 193Cs from the sample. The table 
presents both sets of estimates. The estima- 
tion uses nonoverlapping observations for 
the horizons of one month, three months, 
and one year, and overlapping annual ob- 
servations for the remaining horizons.” All 
reported standard errors are corrected for 
conditional heteroskedasticity. In the hori- 
zons of two to five years, the procedure of 
Newey and West (1987) is used to correct 
the standard errors for the moving average 
generated by the overlapping of the data. 

The Chi-squared statistic for By — 0 and 
P, 0 tests the joint null hypothesis of fore- 
cast rationality and a zero risk premium. As 
expected, this joint hypothesis is rejected. 
The slope coefficient B, is negative, indicat- 
ing the presence of excess volatility (assum- 
ing the risk premium is constant). Surpris- 
ingly the results for the postdepression 
sample show even stronger excess volatility. 
In the full sample, the one-month horizon 
estimate of f, is insignificant and is similar 
to the evidence of Froot. The results for 


The results are similar when monthly overlapping 
Observations are used. These results are available on 
request. 

V Froot uses the CRSP data on the return of the 


NYSE index. He also reports results for a horizon of - 


one year, but in the absence of appropriate data on the 
one-year risk-fre2 rate, he rolls over one-month T-bill 
rates in order to construct excess one-year returns. 
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TABLE SA—ARrE Stock Prices ExcEssivELY VOLATILE? 


P. 
(R,- i. z ‘| - moz : Jes 
d tk d, -k 























Horizon Sample Nobs Bo Pi R? SEE DW Ho: Bg = 8120 
Full Sample 
1 month 1929-87 707 0.41* — 0.012 0.00 1.25 1.81 9.67* 
(0.19) (0.08) [0.008] 
3 months 1929-87 233 0.99* —0.026 0.00 2.02 1.65 11.02* 
(0.43) (0.018) [0.004] 
1 year 1930-87 58 5.30* —0.160*. — 0.04 4.82 1.83 10.70* 
(1.78) (0.070) [0.005] 
2 years 1931-87 57 11.80* -0.371* 0.12 6.91 0.86 18.05* 
(3.14) (0.130) [0.000] 
3 years 1932-87 56 16.53* —0.497* 0.16 8.16 0.45 20.34* 
(4.49) (0.191) [0.000] 
4 years 1933-87 55 21.89* — 0.623* 0.19 9.34 0.44 22.69* 
(5.45) (0.223) [0.000] 
5 years 1934-87 54 19.34*  — 0.367 0.03 12.40 0.75 15.49* 
(7.88) (0.307) [0.000] 


Postdepression Sample 





1 month 1949-87 575 0.57* —0.017* 0.01 1.11 1.85 22.31* 
(0.18) (0.008) [0.000] 

3 months 1940-87 189 1.46* — 0.040* 0.02 1:77 1.67 21.83* 
(0.44) (0.018) [0.000] 

1 year 1941-87 47 6.63* —0.189* 0.08 3.97 2.03 17.59* 
(1.98) (0.076) [0.000] 

2 years 1942-87 46 13.41* —0.376* 0.19 5.07 1.02 34.32* 
(3.22) (0.128) [0.000] 

3 years 1943-87 45 20.83* —0.576* 0.31 5.84 0.77 64.37* 
(4.17) (0.179) [0.000] 

4 years 1944-87 44 28.35* — 0.766* 0.35 7.20 0.65 69.55* 
(5.28) (0.230) [0.000] 

5 years 1945-87 43 30.78* —0.710* 0.16 10.77 0.88 38.68* 
(7.90) (0.334) [0.000] 
P, t~l P, fi 

e(t) pot (27 Jus 
Sample Nobs Bo By R? SEE DW 
Risk-Proxy 1 1929-87 706 0.77* 0.016* 0.02 . 1.05 2.03 


(0.16) (0.006) 

1929-87 706 Monthly 0.043*  —0.01 1.06 2.05 
Seasonals (0.002) 

Risk-Proxy 2 1929-87 706 0.00 0.012* 0.23 0.03 1.88 
(0.001) (0.001) 





Notes: R, is the nominal return on stocks including dividends from the end of month t — k to the end of month t. 
i,_, is the yield to maturity of a k-month security at the end of month ¢— k. Returns are not annualized. P,_, is 
the stock price at the end of month t — k and d,. is the average dividend over the 36-month period before t — k. 
O,.., is the risk premium at the end of month t — 1. The 1-month, 3-month, and 1-year horizons use nonoverlapping 
Observations. The 2- through 5-year horizons use overlapping annual observations. Corrected standard errors are 
inside the parentheses. The correction is for conditional heteroskedasticity and for the moving average generated by 
the overlapping data. R? is the coefficient of determination adjusted for degrees of freedom, SEE is the regression 
standard error, and DW the Durbin-Watson statistic. * and # statistical significance at the 5 percent and 10 percent 
level in a two-tailed test. The last column shows chi-squared statistics with their significance levels in brackets. 
Risk-Proxy 1 is the absclute value of the residuals of the one-month horizon regression (multiplied by 1.2533); its 
two regressions are estimated using Cochrane-Orcutt. Risk-Proxy 2 is the spread between the annualized yields of 
Moody's Baa and Aaa corporate bonds at t — 1, times P, ., /d,_,. This regression is estimated in first-differenced 
form. 
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the remaining horizons of three months and 
one to five years are significant and repre- 
sent entirely new evidence. Scott’s evidence 
. refers to an even longer horizon and is 
similar, although he did not specifically in- 
terpret his rejection of the present value 
model as originating from excess volatility. 

Table 5A also presents the results of 
regressing proxies of the excluded 
E, .,0/(P, ,/ d, .,) term on the indepen- 
dent variable P, , /d,_,. The first risk proxy 
is based on the volatility of the residuals of 
regression equation (11a) when the holding 
period is one month. The volatility measure 
is proportional to the absolute value of the 
residuals, but the results are similar when a 
rolling standard deviation is used instead.'? 
For robustness, I also use a second risk 
proxy based on the product of the price- 
dividend ratio and the spread between the 
end-of-month Moody's Baa and Aaa long- 
term corporate bond yields. All regressions 
uniformly show that the excluded risk term 
is positively correlated with the indepen- 
dent variable, and thus its exclusion biases 
the estimated coefficients against finding ex- 
cess volatility. 

The use of market-determined multi- 
period discount rates makes the excess 
volatility test of equation (11a) immune to 
the misspecification problems that arise 
when a single period discount rate is cumu- 
lated to construct a multiperiod discount 
rate. Flood, Hodrick, and Kaplan (1986) 
argue that if misspecification errors plague 
the single period discount rate, these errors 
cumulate in the constructed multiperiod 
discount rates and result in progressively 
stronger rejections of the present value 
model as the holding period horizon length- 


"Robert Malkiel (1975) and Robert Pindyck (1984), 
among others, claim that the volatility of stock prices 
and the equity premium are positively correlated. 
Robert Merton (1980) uses a similar working assump- 
tion. However, Kenneth French, William Schwert, and 
Robert Stambaugh (1987) find a very weak link be- 
tween expected volatility and realized excess stock re- 
turns. As Poterba and Summers (1986) point out, the 
weak link may be due to simultaneity: There is oppo- 
site negative feedback from stock returns to volatility, 
an issue that we discussed extensively in Section I], 
Part B. 
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ens. They use this insight to claim that 
West's (1988) findings on excess volatility 
are due to such misspecification errors. They 
point out that rejections of the present value 
model grow stronger with the length of the 
holding period horizon in West's test, and 
take this as evidence that the cause of the 
rejection is misspecification rather than ex- 
cess volatility. In contrast, the rejections of 
the present value model in Table 5A are 
uniform across holding period horizons. In 
Table 5A returns are not annualized and 
the estimated slope coefficients f, increase 
proportionately with the holding period 
horizon. Annualized returns would result in 
approximately equal slope coefficients.ó In 
short, the result on excess volatility appears 
robust and cannot easily be attributed to an 
unknown misspecification error on the dis- 
count rate. Hence, a novel aspect of our test 
is the greater confidence it provides in trac- 
ing the rejection of the present value model 
to excess volatility. 


C. Do Margin Requirements Matter? 


To see the effects of margin requirements 
on excess volatility, I run the following re- 
gression: 


(11b) (R, p i, 4) (P, / di) 
= Bo + Bu HP, -x / diy) 
+ BLL(P, 7d, ,) + £ 


where H, (L,) is a dummy variable that 
takes the value of unity if the average mar- 
gin requirement from the end of month 
t — k to the end of month ¢ is larger than 
(lower than or equal to) 50 percent. If mar- 
gin requirements curb speculative excesses, 
excess volatility should be more prominent 
during periods of low margin requirements. 
One should then observe that the coeffi- 
cient B, is more negative than the coeffi- 
cient By. 


‘The slopes line up closer to each other when all 
overlapping monthly observations are used. See 
Hardouvelis (1988c). 
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TABLE 5B—ManGiN REQUIREMENTS AND THE Excess VOLATILITY OF STOCK PRICES 
P, P, Pi 
Panel A: (R, — tofa) = fo TP + ets] +u, 
d, d, d, 
P, P, P, 
Panel B: (R, — i, ez = por pi F=) «ros Z=) + eunea = +, 
d, d, dık dy_y 
Panel A Panel B 
Horizon Bu BL R? SEE Hyfgy-fB, Bi B, By R? SEE 
Full Sample 

1 month — 0.013 —0.006 0.00 125 0.24 —0.013: — 0.022* 0.006 0.01 1.25 
(0.009) (0.014) [0.626] (0.008) (0.008) (0.012) 

3 months — 0.026 —0.027 0.00 2.03 0.00 — 0.028 0.040*  —0.032 0.02 2.01 
(0.019) (0.028) [0.959] (0.018) — (0.016) (0.025) 

1year —0.214*  —0341* 0.11 4.65 4.60* —0.179* 0.161*  —0.003 0.12 4.63 
(0.075) (0.110) [0.032] (0.071) — (0.054) (0.067) 

2 years —0.460*  —0.745* 0,30 6.20 TTE —0.371* 0.240* 0.008 0.21 6.55 
(0.121) (0.180) [0.005] (0.149) — (0.107) (0.094) 

3 years —0.599*  —0.986* 0.40 6.92 8.69* —0.505* 0.2394  —0.002 0.22 7.86 
(0.167) (0.231) [0.003] (0.243) — (0.133) (0.132) 

4 years —0.740* .—1.20* 0.43 7.83 10.73* —0.576* 0.087 —0.068 0.18 9.40 
(0.183) (0.212) [0.001] (0.284) — (0.116) (0.143) 

5 years —0.53124  —0.921* 0.13 11.73 4.67* —0.661* —0.074 —0.138 0.20 11.19 
(0.285) (0.336) [0.031] (0.338) — (0.141) (0.144) 

Postdepression Sample 

] month —0.0184  —0.000 (001 1.11 0.10 —0.018* 0.023* 0.000 0.01 1.10 
(0.009) (0.014) [0.750] (0.008) — (0.009) (0.011) 

3 months —0.038*  —0.033 001 1.78 0.13 — 0.043* 0.049*  —0.030 0.05 1.74 
i (0.019) (0.025) [0.714] (0.018) — (0.014) (0.026) 

1l year —0.231*  —0.297* 0.09 3.95 1.46 —0.224* 0.145* 0.023 0.16 3.78 
(0.082) (0.114) [0.227] (0.077) (0.048) (0.058) 

2 years —0.444*  —0.583* 0.24 4.91 4.20* — 0.346* 0.121 —0.046 0.25 4.90 
(0.118) (0.140) [0.040] (0.118) — (0.084) (0.063) 

3 years —0.667*  —0.890* 0.40 5.42 9.83* —0.506* 0.089 —0.108 0.37 5.60 
(0.155) (0.156) [0.002] (0.180) — (0.091) (0.086) 

4 years —0.8727 —115* 043 6.71 10.04* — 0.661* 0.039 —0.129 0.36 7.13 
(0.202) (0.207) [0.002] (0.264) — (0.098) (0.121) 

5 years —0.828*  —1.13* 020 10.49 3.86* — 0.868* 0.032 —0.129 0.38 876 
(0.304) (0.342) [0.049] (0.342) — (0.131) (0.142) 








Notes: See the notes of Table 5A. H, (L,) is a dummy variable that takes the value of unity if the average margin 
requirement from 1 —k through f¢ is greater than (less than or equal to) 50 percent. POS, (NEG,) is dummy 
variable that takes the value of unity if from t — k to t the margin requirement increased (decreased). The column 
Hy: By = B, contains a chi-squared statistic with its significance level in brackets. 


Panel A of Table 5B contains the regres- 
sion results. Consistent with the hypothesis 
that margin requirements curb speculative 
excesses, B, is more negative than 8y. Fur- 
thermore, the hypothesis that B, equals Bj 
is rejected for all the holding period hori- 
zons from one to five years. These results 
do not depend on the low margin require- 
ments of the depression years. They are 
present in the postdepression sample as well. 


Note that the dummy-variable methodology 
provides qualitative but not quantitative evi- 
dence on the effects of margin requirements 
on excess volatility. Given that margin re- 
quirements changed only 23 times, it would 
be very hard to extract robust quantitative 
information from the data, especially for 
horizons longer than one or three months. 
An alternative way of partitioning the 
sample to see the effects of margin require- 
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ments is as follows: 
(11c) (R,— i 4)(P. / dex) = Bo 
+ BP LZ d, 4) 
+ BpPOS,( P,-~/ d, .,.) 
+ ByNEG,( P, ,/ d, 4) + &5 


where POS, (NEG,) is a dummy variable 
that takes the value of unity if the margin 
requirement increased (decreased) from the 
end of month t — k to the end of month f. 
This formulation provides evidence of possi- 
ble nonlinearities in the effect of margin 
requirements on excess volatility. For exam- 
ple, if investors find partial ways around the 
margin requirements constraint in the longer 
run, then during times when margin re- 
quirements increase, the constraint would 
be more binding than usual and excess 
volatility would be lower. One would also 
expect to find stronger evidence over the 
shorter horizons because such nonlinearities 
are not likely to last very long. Intuition also 
suggests that stronger nonlinear effects 
should be observed when margin constraints 
become more restrictive; that is, we ought 
to observe a strong Bp and a weaker By. 
Panel B of Table 5B contains the regres- 
sion results. Regression coefficient f, re- 
mains negative and significant. Regression 
coefficient B, reflects the extra effect dur- 
ing periods when an increase in margins 
occurs. It is positive and significant for hori- 
zons up to two years, implying that excess 
volatility is lower during these times. The 
strength at short horizons is consistent with 
expectations. Regression coefficient B, re- 
flects the extra effect during periods when a 
decrease in margins occurs. As expected, it 
is negative but insignificant. The results are 
similar in the postdepression sample. 


IV. Margin Requirements and the Transitory 
Component of Stock Prices 


The previous section showed that excess 
volatility of stock prices becomes less pro- 
nounced during periods with high or in- 
creasing margin requirements. Since this ex- 
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cess volatility cannot be accounted for by 
rational bubbles, it is interesting to isolate 
the type of irrational behavior that margin 
requirements appear to mitigate. This sec- 
tion examines one type of irrational behav- 
ior, investment "fads." 

The fads hypothesis claims that investors 
often act out of herd instinct and slowly 
drive prices away from fundamentals. Alter- 
natively, noise traders with highly autocor- 
related misperceptions may also drive prices 
slowly away from fundamentals. Because 
these deviations are small over short hori- 
zons and take a long time to die out, ration- 
al investors cannot arbitrage them away. 
Summers (1986) argues that deviations of 
this nature may be hard to detect even with 
hundreds of vears of data. In his stylized 
example the stock price contains a transi- 
tory component with an autocorrelation co- 
efficient close to unity. Such a transitory 
component is practically indistinguishable 
from a random walk, yet it generates long 
swings in stock prices away from fundamen- 
tals. Fama and French (19882) observe that 
although irrational long swings in stock 
prices are hard to detect over the short run, 
they generate negative autocorrelation in 
multiperiod returns. In another paper, Fama 
and French (1988b) also contend that irra- 
tional long swings in stock prices generate a 
negative correlation between future multi- 
period stock returns and current price- 
dividend ratios. The intuition for both cor- 
relations is straightforward: If the current 
Stock price is irrationally high, it implies a 
high price-dividend ratio and a high current 
return. Later cn, as the price slowly reverts 
to its fundamental value, multiperiod re- 
turns become negative; that is, they move in 
the opposite direction from current price- 
dividend ratios and current returns. Fama 
and French (19882, b) present evidence of a 
negative correlation between these vari- 
ables."" 


For tests of serial correlation in stock returns that 
are based on secoad moment statistics, see Andrew Lo 
and Craig McKinlay (1988), O'Brien (1984), or Poterba 
and Summers (1968) Related papers are by Bruce 
Lehman (1988) and Campbell and Shiller (1988). 
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Certainly, as previous authors emphasize, 
negative correlation between future multi- 
period returns and either current price- 
dividend ratios or current multiperiod re- 
turns do not necessarily reject the market 
efficiency hypothesis. A negative correlation 
can also be generated by a model with 
time-varying risk premia as follows: An un- 
usually high current stock price—one that 
generates high current price-dividend ratios 
and high current returns—may rationally 
reflect a low risk premium; and the low 
subsequent stock returns would simply re- 
flect a low realized reward for the small 
amount of risk that market participants had 
correctly expected to assume. Thus, if high 
or rising margin requirements reduce the 
above negative correlations, one can con- 
clude that margin requirements reduce ei- 
ther irrational price swings or the rationally 
perceived riskiness in stocks. Under either 
interpretation, however, margin require- 
ments are effective: They reduce the size of 
transitory components and hence excess 
volatility, and/or they reduce the market’s 
riskiness. 

Before turning to the effect of margin 
requirements, let us replicate the previous 
evidence on the presence of a negative cor- 
relation between future returns and both 
current price-dividend ratios and current 
returns. I estimate the following regression 
equations: 


(12a) R, oe Bo 
+ Bi La /4,-x) + Ey; 


(13a) r, ag t ar, au tU, 

where R, is the realized nominal rate of 
return including dividends from the end of 
month t — k to the end of month t, r, is the 
realized real rate of return from t — k to t 
(R, minus the CPI inflation rate), k denotes 
the length of the forecasting horizon in 
months, and P, ,/d, , is the price- 
smoothed dividend ratio at the end of month 
t — k. Note the similarity between equation 
(12a) and equation (11a) of Section III. In 
(11a) the dependent variable is the product 
of R,—i,_, with P._,/d,_,, not R,— i, , 
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alone. Equation (11a) is derived from a ba- 
sic definition of excess or insufficient volatil- 
ity; a negative B, implies excess volatility. 
Equation (12a), however, is based on intu- 
ition, and a negative B, implies the pres- 
ence of transitory components in stock 
prices that contribute to excess volatility. 
Turning tó equation (13a), note that it is 
expressed in terms of real rates of return in 
order to accord with the formulation of 
earlier authors, but the results are similar 
when excess nominal rates are used instead 
(see Hardouvelis, 1988b). 

Panels A and B of Table 6A. present the 
results for equations (12a) and (132), re- 
spectively. Panel A replicates the Fama and 
French (1988b) results: 8, is negative and 
significant both in the full sample and in the 
postdepression sample. Panel B, however, 
finds very little evidence of a negative auto- 
correlation in multiperiod stock returns. 
Whatever negative autocorrelation exists is 
due to the depression years. The results are 
similar to those of O'Brien (1987), who also 
used the S&P index. O'Brien shows that the 
coefficients become significant only when 
the sample is extended back to the 1870s. 
Poterba and Summers (1988) justify the lack 
of a strong negative autocorrelation by ap- 
pealing to the test’s low power.!? 

To see if margin requirements have an 
effect on the correlations of Table 6A, I 
follow the same procedure as in equations 
(11b) and (11c) of Section III. I begin by 
dividing the sample period into regimes of 
high and low margin requirements. I run 
the following regression equations: 


(12b) Bog Bot By HP, / d, 4) 
+P LAP ,/4, 4) t 85 


r,^a,tay H,, tao, L,r, tu, 


(13b) 


181n Hardouvelis (1988c), I show that small stock 
returns exhibit stronger negative autocorrelations over 
longer horizons, at three, four, and five years. The 
relative strength of the coefficients of small stocks in 
the multiperiod return regressions is consistent with 
the evidence of Fama and French (19882), who rank 
the NYSE stocks by size into portfolio deciles. 
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TABLE 6A—Is THERE A TRANSITORY COMPONENT :N STOCK Prices? 
E P, t-k 
Panel A: R,—i,_,= Bg t+ By > + €, 
d, 
Panel B: T= Gg ck Gar, m B, 
Panel A Panel B 
Horizon Sample Nobs g8,x100 R? SEE DW Sample. Nobs a R? SEE DW 
Full Sample 

1 month 1929-87 707 —0.086# 0.01 0.060 1.78 1926-87 743 0.103 0.01 0.059 2.61 
. (0.049) (0.070) 

3 months 1929-87 233 —0.176* 0.02 0.089 1.83 1926-87 246 —0.087 0.01 0.124 1.99 
(0.0888) (0.102) 

1 year 1930-87 58 —1.17* 0.13 0.215 1.99 1928-87 60 0.007 | —0.02 0.224 1.95 
(0.38) (0.140) 

2 years 1931-87 57  —247* 0.27 0.295 120 1930-87 58  -—01774 0.02 0.312 122 
(0.50) (0.097) 

3 years 1932-87 56  —3.35* 0.35 0.339 0.59 1932-87 56  —0.268 0.07 0.364 0.56 
(0.77) (0.170) 

4 years 1933-87 55. —4.79* 042 0.426 0.93 1934-87 54 -0.152 0.01 0.474 0.78 
(0.97) (0.202) 

5 years 1934-87 | 54  —3.62* 0.16 0.599 1.08 1936-87 52  -—0.016 | —0.02 0.540 0.66 
(1.36) (0.179) 

Postdepression Sample 

1 month 1940-87 575  -—0.089* 0.02 0.042 1.91 1940-87 575 0.061 0.00 0.043 1.99 
(0.026) l (0.051) 

3 months 1940-87 189 —0.237* 0.04 0.071 1.76 1940-87 191 0.192* 0.03 0.079 1.89 
(0,075) (0.077) 

1 year 1941-87 47  —1.16* 0.17 0.168 2.06 1941-87 47 0.064 —0.02 0.169 1.92 
(039) —— (0.168) 

2 years 1942-87 46 ~2.36* 0.35 0217 1.07 1942-87 46  -—0.109 | —0.01 0.260 0.93 
(0.53) (0.114) 

3 years 1943-87 45  —3.76* 0.51 0.254 0.88 1943-87 45 0.070 | —0.02 0.333 0.74 
(0.73) (0.206) 

4 years 1944-87 44  —5.19* 0.55 0.325 1.01 1944-87 44 0.248 0.04 0.403 0.65 
(0.95) (0.185) 

5 years 1945-87 | 43  —5.42* 0.42 0.437 0.99 1945-87 43 0.199 0.00 0.501 0.54 
(1.32) (0.145) 


Notes: R,, i-k» P-k, and d,_, are defined in Table 5A. r, is the real rate of return of the S&P-500 index from 
period t — k through period £, defined as the nominal sock return R, minus the CPI rate of inflation. See the notes 
of Table 5A for remaining definitions. Nonoverlapping observations are used for horizons up to 1 year. Overlapping 
annual observations are used for horizons 2 through 5 years. Inside the parentheses are standard errors corrected 
for conditional heteroskedasticity and for the moving average created by the data overlapping. 


where H, (L,) is a dummy variable that 
takes the value of unity when the average 
initial margin requirement from t—k 
through ¢ is larger than (lower than or 
equal to) 50 percent. Panel A of Table 6B 
shows the results of equation (12b). By is 
less negative than B,, which suggests that 
transitory components are less pronounced 
. in periods of high margin requirements. The 
hypothesis that B,,- f, is rejected. This 
results holds in the full sample and in the 


postdepression sample as well; thus it is not 
due to the depression years alone. Panel B 
of Table 6B shows the results for equation 
(13b). In the full sample, there is strong 
evidence that a,, is not as negative as a,. 
The hypothesis that œ; — o, is rejected at 
multiperiod horizons. This result, however, 
is entirely due to the depression years. Con- 
trary to the results for B,, and fj, there is 
no evidence of a difference between qy and 
a, in the postdepression sample. 
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TABLE 6B—MaRGIN REQUIREMENTS AND THE TRANSITORY COMPONENT OF STOCK PRICE 


] P P, 
Panel A: R, ~i,-, = Bo + PHH, FEM + B,L, i.c" 


1—k 


Panel B: r,"-agtag Hor, 4t at Lr, 4 V, 
Panel A Panel B 
Horizon By X100 8,x100 R? SEE Ho: By=B, ey ar R? SEE Hgyag-ao, 
Full Sample 
1 month —0.092# —0.109 0.01 0.060 0.59 0.108 0.101 0.01 0.059 0.00 
(0.054) (0.070) [0.443] (0.067) — (0.084) [0.948] 
3 months —0.186* -~0.207# 0.01 0.089 0.21 0.199* —0.221# 0.04 0.122 9.07* 
(0.093) — (0.166) [0.647] (0.084) — (0.132) [0.003] 
1 year ~1.43*  —2.03* 0.20 0.206 721* — 0.024 0.028.  —0.03 0.226 0.04 
(0.37) (0.49) [0.007] (0.188) — (0.192) [0.838] 
2 years —2.88* | —4.19* 0.44 0.259 13.55* 0.0085  —0.303* 0.04 0.309 3204 
(0.41) (0.58) [0.000] (0.145) — (0.104) [0.073] 
3 years —3.79*  —5.50* 0.56 0.280 17.06* 0.003 | —0.486* 0.15 0.347 5.51* 
(0.61) (0.74) [0.000] (0.217) — (0.130) [0.019] 
4 years —5.33*  —7.44* 0.59 0.355 20.32* 0.163  —0.445* 0.13 0.445 9.75* 
(0.71) (0.81) [0.000] (0.208) — (0.168) [0.002] 
5 years —437* ~6.46* 0.26 0.562 7.49* 0.171  —0.388* 0.06 0.519 21.14* 
(1.29) (1.64) [0.006] (0.143) (0.118) [0.000] 
Postdepression Sample 
1 month —0.093*  —0.100* 0.02 0.042 0.18 0.098 0.039 0.00 0.043 0.40 
(0.029) — (0.041) [0.671] (0.065) — (0.070) [0.529] 
3 months —0.240* —0.244* 0.04 0.071 0.01 0.186* 0,241 0.03 0.079 0.06 
(0.82) (0.108) [0.917] (0.082 (0.211) [0.807] 
1 year —140*  —1.77* 0.19 0.165 2.738 0.013 0.170 | —0.04 0.170 0.28 
(0.42) (0.56) [0.099] (0.194) — (0.263) [0.598) 
2 years —245*  -3,55* 0.43 0203 7.12* —0.057 —1.02* 0.01 0.258 7.33* 
(0.49) (0.63) [0.008] (0.124) — (0.328) [0.007] 
3 years —4.26*  —5.48* 0.62 0.223 15.83* 0.004 0.567  —0.01 0.333 141 
(0.59) (0.68) [0.000] (0.218) — (0.435) (0.236) 
4 years —5.75* | —720* 0.64 0292 14.81* 0.203 0.768 0.03 0.404 0.79 
(0.77) (0.86) [0.000] (0.208) — (0.539) [0.374] 
5 years —6.00*  —7.84* 0.50 0.409 7.95* 0.213 0.038 — —0.01 0.507 0.30 
(1.13) (1.47) [0.005] (0.146) — (0.343) [0.586] 


Note: See the definitions in Tables 5A, 5B, and 6A. 


Let us now turn to the possibility that 
margin requirements may be particularly in- 
fluential at times when they change, espe- 
cially when they become more constraining. 
I estimate the following regression equa- 
tions: 


(12c) R,—i, -x= Bg + B(P a / di) 
+ BpPOS(P, , / d, .,) 
+ ByNEG,(P, ,/ d, ,) + En 
(13c) 


r, = gt OT, 


+ apPOS,r, , + «y NEG,r, 4,3 Uy, 


where POS, (NEG,) is a dummy variable 
that takes the value of unity when the level 
of initial margin requirements increases (de- 
creases) from the end of month t — k to the 
end of month t. Panel A of Table 6C con- 
tains the regression results of equation (12c). 
Observe that fj, continues to be negative 
and significant. The interesting finding is 
that B, is positive and significant, suggest- 
ing that transitory components are less pro- 
nounced during times that margin require- 
ments increase. This is true both in the full 
sample and in the postdepression sample. 
By is not different from zero, implying the 
absence of nonlinearities when margin re- 


758 


Panel A: R, — i, e= Bg + s| 


THE AMERICAN ECONOMIC REVIEW - 


t-k 


t-k 


P . 
i t B,POS, d 


SEPTEMBER 1990 





P, 


t~k 


esee 





Note: See the definitions in Tables 5A, 5B, and 6A. 
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TABLE 6C—MarGIN REQUIREMENTS AND THE TRANSITORY COMPONENT OF STOCK PRICES 


P, id 
dtes 


Panel B: r, 7 ag t a7, k t 4p POS, r, p + au NEG,r, y + u, 
Panel A Panel B 
Horizon B,X100 8,x100 Bgyx100 R? SEE ai a, ay R? . SEE 
É Full Sample : : 

1 month —0.088s  0.077* 0.013 | 0.01 0.060. 0.099 0.374* 0.158 0.01: 0.059: 
(0.049) — (0.032) (0.043) (0.072) (0.150) (0.359) ee 

3 months ~0.182#  0.160* | —0.118 0.03 0.088  —0.111 0.516* 0.496 0.01 0.124 
(0.108) (0.061) (0.087) (0.105) (0.175) — (0.385) . 

1 year ~ 1.26* 0.66* 0.01 0.18. 0.209 0.02 0.080 -0538  -0.02 0,225 
(0.38) (0.23) (0.23) (0.179) (0.269 . (0.520) f 

2 years —2,52* 1.06* 0.12 0.35 0.280 —0.253* 0.748" © —0.137 0.15 -0.290 
(0.58) (0.42) (0.34) (0.125) (0.246) (0.165) : 

3 years —3.52* 1.21* 0.20 0.42 0.321  —0.466" 0,282 0,332 0.07 0.363 
(0.91) (0.53) (0.48) (0.185) — (0.261) :: ` (0.252) ub US 

4 years —4.94* 0.62 0.15 041 0.430 —0.059 —0261: -0.057  -—0.02 0.480 
(1.24) (0.45) (0.55) ` (0.223) (0.296). — (0.243) 

5 years ~5.74*  . 001 — 0.06 0.43 0.494 0.063 0.052 . —0210 ^ —0.04 0.545 
(1.43) (0.69) (0.56) (0.177) (02149  . (0.175) 

Postdepression Sample 

1 month —0.092* 0.085* 0.009 0.02 0.042 0.059 0.464*  —0.128 0.00 0.043." 
(0.026) — (0.031) (0.040) (0.052) (0.186) (0.439) l 

3 months —0.246* 0.191* —0.104 0.07- 0.070 0.187* 0.325. —0.136 0.03 0.079 - 
(0.073) (0.062) (0.087) (0.084) (0.228) (0.200) 

1 year ~1.30* 0.60* 0.08 0.24 0.161  —0.075 0.617* - 0.069 0.01 0.166 

` (0.38) (0.18) . (0.20) i (0.200) (0.267 (0.378). 

2 years —232* 0.684 | —0.07 0.40 0.209  —0.096 0.659*  —0.240 0.10 0.246 
(0.52) (0.34) (0.24) (0.157) (0.27) (0.243) 

3 years —3.71* 0.60 —0.15 0.54 0.247 0.117 -0.066 | —-0.0504  —0.07 0.341 
(0.83) (0.41) (0.37) (0.277) (0.269) (0.356) 

4 years —5.32* 0.56 0.03 0.55 0.326 0.370* —0.269 | —0.089 0.01 0.409 
(1.27) (0.43) (0.55) (0.148) (0.182) (0.323) 

5 years — 6.46* 0.58 0.06 0.60 0.364 0.200 0.044 | —0.049  —0.03 0.513 
(1.49) (0.57) (0.59) (0.152) (0.198) (0.174) 


quirements are relaxed. Panel B of Table 
6C shows the results of estimating equation 
(13c). Here the evidence is stronger than in 
Panel B of Table 6B, where I separated 
regimes of high and low margin require- 
ments. An increase in margin requirements 
has a nonlinear effect, which is present 
across the whole sample. The coefficient ap 
is positive and significant at'short horizons 
both in the full sample and in the postde- 
pression sample. 

Overall, the evidence from the price- 
smoothed dividend ratios shows that the 
presence of transitory comporents is miti- 


gated both during times that margin re- 
quirement are high and during times that 
they increase. The strength of the evidence 
from the autocorrelations of multiperiod 


' stock returns is much weaker, however, and 


depends primarily on the influence of the 
depression years. In judging the strength of 
the latter results, one should recall the claim 
made by Poterba and Summers that tests of 
the null hypothesis of no serial correlation 
in returns have low power against the alter- 
native of fads. In view- of the low power, the 
lack of averwhelming statistical significance 
is not very surprising. Yet observe that de- 
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spite their low power, autocorrelations do 
' detect nonlinear effects that are also 
present in the postdepression sample. 


V. Conclusion 


- Despite the limitations imposed on the 
analysis by the infrequent changes in the 
official margin requirements, the paper un- 
covered an interesting empirical regularity: 
Higher: initial margin requirements in the 
cash market are associated with a reduction 
in both actual and excess stock price volatil- 
ity and with a reduction in the size of long- 

run stóck price swings away from funda- 
mental values. The results are observed 
across the full sample and are not very 
sensitive to the ‘exclusion of ‘the turbulent 
depression years.. . -` 

The case for interpreting the link be- 
tween the level of official margin require- 
ments and market speculative excesses in a 
causal way is strengthened by the presence 
of nonlinear effects at times when margin 
requirements increase. During those times, 
margin requirements apparently impose a 
more severe binding constraint on the be- 
havior of destabilizing speculators and hence 
decrease stock price deviations away from 
fundamental values and reduce excess 
volatility. I conclude that, consistent with 
the aim of the Security and Exchange Act of 
1934, initial margin requirements in the cash 
market seem to mitigate destabilizing specu- 
lation. 

The results of the paper do not necessar- 
ily argue for active Fed intervention on a 
month-to-month basis in order to control 
volatility at high frequencies. The core find- 
ing is that margin requirements dampen 
long swings in stock prices that increase 
volatility at low frequencies and that origi- 
nate from phenomena such as fads, the 
pyramiding and depyramiding of stock 
prices. A minimum policy prescription from 
the results is that policymakers should re- 
frain from eliminating official margin re- 
quirements in cash equity markets. Even 
when margin requirements are kept con- 
stant for a long period of time, they could 
be effective in diminishing destabilizing 
speculative activity. The usual intuition that 
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sophisticated investors are able to find ways 
around the margin restriction at a very small 
extra cost in the long run does not apply to 
every investor. In an evolving market with 
investors entering and exiting every period, 
margin requirements can be quite effective 
in constraining | the activities of new entrants 
in the market.” 

Since the stock market crash of October 
1987, the role of derivative markets in in- 
dex-based contracts has become a major 
topic in the public policy debate. Futures 
and options markets in stock indices are 
praised for providing liquidity and hedging 
capabilities to large institutional investors, 
but the same markets are also suspected of 
creating excessive volatility that spills Over 
to the cash market.? To date, margin re- 
quirements in derivative markets are much 
lower than margin requirements in the cash 
market. The reason is that the primary aim 
of margins in derivative equity markets has 
been to reduce the probability of contrac- 
tual defaults and the risk of a derivative 
market breakdown, assuming that stock 
price volatility is a given exogenous factor. 
The cash market experience of the last fifty 
years suggests, however, that long-term 
volatility is not exogenous and could be 
affected adversely by the low margin re- 
quirements in these markets. 


APPENDIX A: TESTING FOR 
EXCESS VOLATILITY 


Let o denote the variance of a variable i, p the 
correlation between u and f, and q the correlation 
between y and f. Following Froot (1987), let us take a 
mean-preserving spread around the forecasting vari- 
able f, keeping oj? and q constant. Then, if the mean 
squared prediction error 0,2 increases (decreases), f is 


P Before making any stronger policy recommenda- 
tions, it is desirable to examine the effects of contem- 
porary margin policy in other countries. In the U.S. 
margin requirements have not changed since 1974, but 
in other countries, such as Japan, margin requirements 
are actively administered today. In a detailed study of 
the Japanese margin experience, Hardouvelis and 
Peristiani (1989-90) report that during the 1980s an 
increase in margin requirements had an economically 
and statistically significant negative impact on the 
volatility and momentum of Japanese stock prices. 

ÜSee Lawrence Harris (1988). 
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defined to be an excessively (insufficiently) volatile 
forecast of y. This definition leads to the definition in 
the text as follows: First, note that q00, = = Cov(y, f) 
= mess + peyo,, and thus qo, =o, t po,. Next, 
qu / doy - = Zaf — gay). Substituting e, + po, “for qo, 

c / day simplifies to —2po,. Thus a negative (posi- 
es correlation between u and f implies excessive 
(insufficient) forecast volatility. If we have data on y 
and f, we can easily determine whether f is a rational, 
an excessively volatile, or an insufficiently volatile fcre- 
cast of y by running the regression u = Bo Bif, te 
and checking to see whether fj, is zero, negative, or 
positive respectively. 

How does the voluminous literature on volatility 
tests relate to the regression test above? As Froot 
(1987) discusses in detail, in this literature f is the 
stock price and y is the perfect foresight price. Volatil- 
ity tests can be interpreted as rejecting the rationality 
hypothesis that the correlation coefficient p between 
the forecast f and the forecast error u is zero (Scott, 
` 1985). If u and f are uncorrelated, then o = of + a2, 
and hence o; > af. The volatility literature uses direct 
estimates of cj and of and finds that the previous 
inequality is violated, That is, empirically, 


o; < of eo? +2po,0, <0 


© plim f, = po, /0y € -1/2(o, ,/0j) 


Thus in terms of the unbiasedness test (4) of the 
text, volatility tests have indirectly found that the 
slope coefficient B, is significantly smaller than 
—1/Xo, /o,J. Clearly, the unbiasecness test (4) is 
more general in discovering excess volatility, for it only 
requires that £, be smaller than zero. Next, note that a 
second volatility test of the rationality hypothesis is to 
test the inequality c? > o. In terms of our unbiased- 
ness test (4), violation of this inequality is equivalent to 


a, «ole of t2po,ay <0 
| e plim B, = po, /oj < -1/2. 


Again, B, significantly smaller than — 1/2 is a more 
stringent condition than f, significantly smaller than 
zero. 


APPENDIX B: DATA AND SOURCES 


The S&P Composite data are drawn from the 1988 
yearbook of the Ibbotson Associates and represent 
end-of-month values from 1926 through 1987. The data 
come in two forms: with and without reinvested divi- 
dends, a distinction that allows the construction of a 
monthly dividend series. Currently, the index includes 
S00 of the largest stocks, but before March 1957 it 
consisted of 90 of the largest stocks. 

The source for data on the one-month T-bill yield to 
maturity is also Ibbotson Associates. Data on the 
three-month and one- to five-year Treasury yields to 
maturity come from Stephen Cecchetti (1988) for the 
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period January 1929 through December 1946, and from 
Houston McCulloch in Shiller and McCulloch (1987) 
for the period January 1947 through February 1987. 
The McCulloch data refer to zero coupon yields. Fi- 
nally, the end-of-month Moody's Aaa and Baa corpo- 
rate bond yields of Table 5A are drawn directly from 
Moody's Investor Services. 

Data on the Consumer Price Index are taken from 
Ibbotson Associates, and on the industrial production 
index from the following sources: (i) for the period 
1926-1946, Industrial Production, 1986 edition, Board 
of Governors of the Federal Reserve System; (ii) for 
the period 1947~October 1987, Citibase data banks; 
(iii) for November and December 1987, International 
Financial Statistics, April 1988 edition. 

Sources for data on broker and dealer margin credit 
are (i) the series entitled *Customer Net Debit Bal- 
ances" in Board of Governors of the Federal Reserve 
System, Banking and Monetary Statistics, 1943, Table 
no. 143; and Banking and Monetary Statistics: 
1941—1970, 1976, Table no. 12.23; and (ii) the ser- 
ies entitled “Credit Extended to Margin Customers," 
which appears in various issues of the Federal Reserve 
Bulletin under the "Stock Market Credit" table. The 
first series runs from November 1931 through June 
1978, and the second series runs from March 1967 
through December 1987. The two series are not identi- 
cal. To avoid an abrupt jump in July 1970, I multiplied 
the second series by the factor of 1.43, which is the 
average ratio of the first to the second series during the 
overlapping interval from March 1967 through June 
1970. Data on the value of all New York Stock Ex- 
change Stocks are end-of-month and come from NYSE 
publications. 
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Business Cycle Models with Endogenous Technology 


By GeEorGE W. STADLER* 


This paper compares real and monetary business cycle models with and without 
endogenous technical change. If technology is endogenous, the properties of these 
models change significantly. In particular, both real and monetary models yield 
very similar output processes if growth is endogenous, and changes in aggregate 
demand can result in permanent changes in productivity, employment, and 
output. The effect of depreciation of technology is examined, and the pattern of 
real wage movements over the cycle when money wages are fixed but technology 
is changing is briefly considered. (JEL 131). 


This paper examines the implications that 
endogenous technical change has for busi- 
ness cycle theory. Conventionally, both real 
and monetary models of the business cycle 
assume that the evolution of technology is 
exogenous to the economic system.! The 
monetary models ignore technical change, 
thereby implying that the demand-side dis- 
turbances that give rise to economic fluctu- 
ations in these models have no impact on 
technology. The real business cycle models 
assume that random changes in or “shocks” 
to productivity or technology cause output 
fluctuations in perfectly competitive envi- 
ronments. These temporary or permanent 
shocks to productivity are modeled as ran- 
dom drawings from a probability distribu- 


* Department of Economics, University of Newcas- 
tle upon Tyne, NE1 7RU, England. I am indebted to 
Zwi Hercowitz, David Laidler, Bennett McCallum, 
Michael Parkin, Ian Wooton, an anonymous referee, 
and in particular to Peter Howitt for helpful comments 
and suggestions. Any errors are solely my responsibil- 
ity. 

lTypical real business cycle models are those of 
John Long and Charles Plosser (1983) and Finn Kyd- 
land and Edward Prescott (1982). Robert King et al. 
(1988a) argue that the neoclassical model of capital 
accumulation (augmented by shocks to productivity) is 
the basic framework for real business cycle analysis, 
and discuss its limitations. The monetary models fall 
into two groups: the misperception models originating 
with the work of Robert Lucas (1973) and Robert 
Barro (1976), and models that emphasize nominal 
rigidities, such as Stanley Fischer (1977) or John Taylor 
(1980). 
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tion: they are independent of economic 
forces and conditions and are thus exoge- 
nous to the economic environment. 

However, there exists a substantial theo- 
retical and empirical literature that empha- 
sizes the dependence of technological 
progress on economic factors, implying that 
technology is, to a large degree, endoge- 
nous. The work of Kenneth Arrow (1962) 
points to the importance of learning by do- 
ing, and W. A. Eltis (1971) emphasizes the 
role of expenditure on research and devel- 
opment in generating economic growth. Ja- 
cob Schmookler's (1966) pioneering study 
on the interrelationship between market 
conditions and innovation led him to con- 
clude that “the belief that invention, or the 
production of technology generally, is in 
most instances a noneconomic activity, is 
false."? James Utterback (1974) made a 
similar finding and reported: 


Market forces appear to be the pri-' 
mary influence on innovation. From 60 
to 80 percent of important innovations 
in a large number of fields have been 
in response to market demands or 
needs. The remainder have originated 
in response to new scientific or tech- 
nological advances and opportunities. 

[p. 621] 


?Schmookler (1966, p. 208). Other studies have also 
found that economic factors have a significant influ- 
ence on technology—Morton Kamien and Nancy 
Schwarz (1982) provide a survey of the literature. 
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Thus, exogenous changes in the scientific 
. base (the “shocks” of real business cvcle 
theory) do not appear to be the cause of 
most of the changes in productive tech- 
niques. 

Conventional real and monetary business 
cycle theory (with exogenous technology) has 
ignored this body of literature. It is there- 
fore interesting to see how real and mone- 
tary business cycle models with endogenous 
technology differ from the conventional 
models, and to what extent the results ob- 
tained from conventional mcdels are sensi- 
tive to the assumption that technical change 
is independent of economic conditions.? 

There is a second motivation for examin- 
ing models of business cycles with en- 
dogenous technology. Charles Nelson and 
Charles Plosser (1982) first observed that 
most economic time-series appear to be 
nonstationary and are best modeled as con- 
taining stochastic trends. This implies that 
some fraction of innovations in output is 
permanent and alters the long-run trend of 
output. Traditionally, monetary innovations 
were considered to have only a transitory 
impact on output: the influence of a mone- 
tary shock dissipates over time, leaving the 
trend of output unchanged. Thus, models 
based on monetary innovations (or demand- 
side innovations generally) have difficulty in 
accounting for nonstationarity. This has fos- 
'tered the growing belief that monetary 


3A number of recent papers recognize the impor- 
tance of endogenous technology in accounting for eco- 
nomic growth, for example, Lucas (1988), Edwari 
Prescott and John Boyd (1987), and Paul Romer (1986), 
but these papers do not examine endcgenous technol- 
ogy in a cyclical context. However, Robert King and 
Sergio Rebelo (1986) examine the impact of productiv- 
ity shocks in a real business cycle model with endoge- 
nous growth, and their key conclusions are similar to 
those implied by the model of Section II, Part C below. 
This paper differs from King and Rebelo in that, first, 
they model the growth process as resulting from invest- 
ment in human and physical capital, while I assume 
that changes in technological knowledge depend on the 
level of economic activity, specifically tbe level of labor 
input; second, this paper allows a role for money and 
aggregate demand shocks; third, it explicitly compares 
real and monetary business cycle models with and 
without endogenous growth. The model and conclu- 
sions of King and Rebelo's paper are briefly outlined in 
Section 3 of King et al. (1988b). 


THE AMERICAN ECONOMIC REVIEW 


SEPTEMBER 1990 


models of the business cycle are unable to 
provide a satisfactory explanation of output 


fluctuations and should be rejected in favor ' 


of real business cycle models.* 

This paper shows that this inference is 
false. If technology is modeled as endoge- 
nous, a monetary business cycle model ex- 
hibits properties that are very different from 
those of the conventional models of Robert 
Lucas and Robert Barro or John Taylor and 
Stanley Fischer. There is a long-run non- 
neutrality of money in models with endoge- 
nous technology: monetary shocks have a 
permanent impact on output, and output is 
nonstationary, even in the absence of exoge- 
nous shocks to the supply side. The reason 
is that changes in demand that raise the 
level of output can, through a number of 
channels, exert a permanent influence on 
the supply side. For example, changes in the 
utilization of factor inputs when demand 
changes can result in reorganization and the 
acquisition of new skills; or a higher level of 
output may make innovation more prof- 
itable and result in the allocation of more 
resources to R&D. Furthermore, even real 
business cycle models with endogenous 
technology have somewhat different proper- 
ties than conventional real business cycle 
models. 

The following section develops a general 
model that contains both real and monetary 
disturbances. Unanticipated changes in the 
money supply have real effects in this model 
economy because nominal wages are as- 


^For example, see Nelson and Plosser (1982, p. 159). 
Even Barro (1984) has suggested that the lasting con- 
tribution of the rational expectations revolution seems 
to lie in the development of the real, not monetary, 


business cycle theory. Bennett McCallum (1986), how- ` 


ever, assesses the evidence against monetary models 
and concludes that it is insufficient to justify a rejection 
of this class of models. Furthermore, Kenneth West 
(1988) has shown that a purely monetary model with 
overlapping wage contracts can, for certain parameter 
values, yield an output process that has a root that is 
below but near unity. This implies that it may be 
impossible to discriminate in finite samples between a 
stationary monetary model and a nonstationary real 
business cycle model. By contrast, this paper demon- 
strates that with endogenous growth a purely monetary 
model can give rise to a nonstationary output process 
with a unit or greater-than-unit root. 
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sumed to be set in advance by one-period 
contracts. This is not a crucial assumption. 
Very similar results would be obtained if we 
assumed that prices, rather than wages, are 
sticky, or if a Lucas-type misperception 
framework had been adopted. Technology 
is partly endogenous because it is assumed 
that technical knowledge advances through 
learning by doing and depends on the level 
of labor input and on the level of labor 
productivity achieved in previous periods. 

Within the general model are nested vari- 
ous sub-models that enable us to compare 
real and monetary models with and without 
endogenous technology. In Section II atten- 
tion is focused on four sub-models; a pure 
monetary model with endogenous technol- 
ogy, a typical real business cycle model with 
exogenous technology, a real business cycle 
model with endogenous technology, and fi- 
nally a hybrid model with transitory real 
and monetary shocks but no learning curve. 
The properties of the models are con- 
trasted, and the long-run properties of the 
general model are examined. 

The effects of depreciation of technology 
are examined in Section III. A monetary 
model with endogenous technology and a 
real business cycle model with exogenous 
technology are considered, where both the 
endogenous and exogenous technologies are 
subject to depreciation. The properties of a 
conventional real business cycle model 
change considerably if technology is subject 
to depreciation. 

The fourth section considers the implica- 
tions of learning for the pattern of real 
wages over the business cycle, when money 
wages are fixed by contract. Normally, a 
fixed money wage results in countercyclical 
movements in real wages, since prices move 
procyclically. However, once learning is in- 
troduced, the resulting pattern of real wages 
over the cycle becomes unclear. Even with 
nominal contracts, real wages may exhibit 
very little countercyclical movement. 

The final section contains a summary and 
conclusions. 


I. A General Model with Endogenous Technology 


This section develops a simple model of 
output fluctuations with endogenous tech- 
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nology. Following Arrow, the concept of 
technical change I adopt comes from the 
basic premise that learning is the product of 
experience and takes place during activity. 

I consider a simple closed economy. The 
supply side of the commodity market con- 
sists of a large number of competitive firms, 
all producing an identical good as output, 
that take the market price as given. Trading 
arrangements are sequential: at the begin- 
ning of each period, before the current pe- 
riod’s price level is known, households enter 
into one-period nominal wage contracts with 
firms, agreeing to provide whatever amount 
of labor the firm requires at that wage rate. 
Thus, in order to negotiate the wage con- 
tract, agents must form expectations of both 
the price level and the representative firm’s 
demand for labor before they know the 
value of the money supply (and aggregate 
demand) and before the real shocks hitting 
the economy are observed. 


A. Aggregate Supply 


The representative firm maximizes its 
stream of discounted expected profits: 


ur Ag 
e[z,- eai. 
j=0 Pj 


0«B«1 


V — max E, 
Un 





where LÍ is employment of labor by the 
representative firm, Y/ is the output of the 
representative firm, W, is the money wage, 
P is the price level, and B is a discount 
factor. The production technology is as- 
sumed to be Cobb-Douglas: 
(1) Y'-k(1) zg-?r, 0«o«1 
where k is a constant and F, is a strictly 
positive stochastic shock that contains two 
components: (i) a permanent component, 
£,, which can be thought of as a shock to 
technology, and (ii) a temporary productiv- 
ity shock, 7,, so that in natural logarithms 
the evolution of F, is given by 


(2) fh-htn 
fcd E 
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where lowercase letters denote natural loga- 
rithms. Both ņ, and £, are stationary 
stochastic processes with zero mean, con- 
stant variance, and zero covariance. 

Z, is a scale factor that represents accu- 
mulated technical knowledge. It evolves ac- 
cording to 


Y 
(3) nad (Li) 


The evolution of technical knowledge de- 
pends on the level of aggregate labor input 
(L,_,) employed the previous period, and 
on the level of aggregate labor productivity 
attained the previous period. The greater 
the level of labor input, the greater is the 
scope for learning and acquisition of new 
Skills. A higher level of labor input also 
requires more intensive use of factors fixed 
in the short run, thus raising the incentive 
to eliminate waste and bottlenecks. How- 
ever, some change in productivity can occur 
independently of changes in labor input, for 
example, through reorganization of the pro- 
duction process and firm structure to achieve 
greater efficiency. The benefits of such 
changes are likely to persist for some time 
while the improved structures are in place, 
and to capture this source of produc- 
tive improvements, I assume that technical 
knowledge also depends on the level of la- 
bor productivity attained in the previous 
period. For simplicity, it is assumed that Z 
is disembodied knowledge that does not de- 
preciate (i.e., knowledge in books that is 
easily acquired by new entrants to the labor 
market). I abstract from physical capital (al- 
though Z is a kind of capital). 


SAn alternative but more complex way of introduc- 
ing endogenous technology is to assume the firm faces 
an innovation production function and that innovation 
comes about through the application of resources to 
R&D. The results derived from such a model are 
similar to those presented below (see George Stadler, 
1988). An alternative and simpler way of modeling 
learning is to assume that technical knowledge just 
depends on past levels of output, as in Stadler (1986). 
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Z depends on aggregate labor input and 
productivity. Since’ the individual firm is 
small, I assume it ignores the effect of its 
own employment decision and level of pro- 
ductivity on aggregate employment and pro- 
ductivity. The learning process can be 
thought of as a by-product of the output 
process. It is not necessarily under the con- 
scious control of management. Further- 
more, as labor moves between firms, learn- 
ing skills become dispersed, so that aggre- 
gate labor input and productivity determine 
knowledge. 

Assuming that the evolution of knowl- 
edge is exogenous to the firm enables us to 
treat the firm's problem like. a one-period 
problem, and the firm's only decision vari- 
able is Li. Taking the first-order condition 
for labor enables us to derive the represen- 
tative firm’s demand for labor (in loga- 
rithms): 


i In( ak) 1 
py Amer unc) 
tati Í 


The labor supply function is assumed to be 


(5) Ly=e*(W,/P,)” 0«$, «1. 


At the beginning of the period the money 
wage is set at its expected market clearing 
level, that is, the wage at which the ex- 
pected demand for labor during the period 
ahead equals the amount households expect 
to supply. The amount of labor households 
expect to supply will depend on the ex- 


SConcavity of the production function (both labor 
and knowledge have diminishing marginal products) 
and the treatment of learning as an externality ensure 
that a competitive equilibrium exists each period with 
firms treating the state of technical knowledge as ex- 
ogenously given. If learning were internalized, a com- 
petitive equilibrium might not exist. Since there is 
implicitly no limit to learning, utilization of labor yields 
increasing returns over time and the growth rate of 
output will rise over time. The assumptions that under- 
pin this result are examined in Section II, Part E. 


VOL. 80 NO. 4 


pected real wage. Taking the expectation of 
equation (5) yields, in logarithms, 


(6) E, 4l; = 6, + óY(w,— pf), 


where pj = E, ,p, is the rational expecta- 
tion of the price level. The rational expecta- 
tion of the firm's demand for labor formed 
at the beginning of the period is obtained by 
taking the expectation of equation (4), and 
using equations (2) and (3) to obtain 





, In( aek 
(7) E, lp = \ Lg 
l-a 
1 1 
+ C era) + i-a 


since E, ,z,—z, and E, ,f,—f, , be- 
cause Y,.,, L,.,, Z- and f,_, are con- 
tained in the information set available at 
the beginning of period ¢, and E, (7,— 
E, ,£,70. Equating the expected supply 
and expected demand for labor given by the 
above two equations yields the money wage 
written into the labor contract: 


In(ak) —(1— a)9, 


(8) W,— prc 1+(1-a)¢, 
(1—2)2, " fu 
1+(1-a)¢, 1-(1—-2a)6; 


Substituting equation (8) into equation (4) 
for w, gives the firm's employment of labor: 


(9) = agta,(p, ~ pr) 


+ Z, + af, e— zf, 











In(ak) do, + $, 
a ee 
^. 1-(1—2a)94, 
1 
“1 l~a 
i 1 
72 ix a. 
1 
as 


~ (12a) «(17 a), 
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Substituting (9) into a logarithmic form of 
the production function given by equation 
(1) and aggregating gives us the aggregate 
supply function: 


(10) y? q- bo * bi( p, pf) 
t baz, +b3(&,+1,) + bafi 


bo — In(k) + ado 


b,—-a/(1—a) 

b,=(1-a)+ aa, 

b,-1/(1—a) 
(1+ 2) 


aaa alae: 


where q is the natural logarithm of the 
number of firms in the economy. Hence 
aggregate supply will increase in response 
to a positive price surprise, a positive real 
shock, or a rise in technical knowledge. 


B. Aggregate Demand and Expectations 


I assume that aggregate demand is given 
by the quantity theory equation with veloc- 
ity constrained to unity: 


(11) Y! = M,/P,, 


where Y7 is real aggregate demand and M, 
is the money stock. 

The money supply is assumed to follow a 
random walk with positive trend. The trend 
is set by the central bank so as to avoid an 
ongoing deflation in a growing economy, 
given that the velocity of money is constant. 
The money supply rule is public knowledge 
and, in logarithms, is 


(12) 


where u is a constant trend and e, is a 
zero-mean stochastic error with constant 
variance. 

When signing wage contracts, agents have 
at their disposal the past values of all rele- 


m,-m, tutt, 
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vant variables (which enables them to cal- 
culate z,), including f, ,. Writing (11) in 
natural logarithms, rearranging terms, and 
taking expectations gives us the rational ex- 
pectation of the price level: 


(13) Be = me yf, 


where y“ = E, y? is the rational expecta- 
tion of aggregate demand formed at the 
beginning of the period. Equations (11) and 
(13) imply 


(14) (p; př) (m, -m;) e = yf). 


Expected market clearing implies that y/* 
= y;*, so that one can take the expectation 
of equation (10) to obtain yZ*: 


(15) ye = y =q + bo t biz, + buf, 


since E,_,(p, — pf) — 0, f,_ is contained in 
the information set available at the begin- 
ning of the period, and the expected values 
of the real shocks (m, and €,) are zero. 
Actual market clearing implies yf = y;, so 
that subtracting equation (15) from equa- 
tion (10) gives an expression for (y? — y^), 
the difference between actual and expectec 
aggregate demand: 


yd — yf* = b p, — pf) + b(n, + &). 


Substituting this expression into equation 
(14) and noting that m, — m; — e, yields an 
expression for the difference between the 
actual and expected price level: 





(16) (m-»r)- ih 


Thus prices diverge from their expected val- 
ues each period because of unanticipated 
real and monetary shocks. 


C. The Output Process 
The level of output each period is ob- 


tained by substituting equation (16), which 
gives the price surprise into equation (10), 


le, a b(n, TÉ. 
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the aggregate supply function: 
(17) y,94-*bg*a[e, — b(n, + &)] 
+ bz, T b4(m, + é) + b f, 


Output depends on technology, both on the 
level of technical knowledge, z,, and on the 
exogenous shocks to technology. However, 
real shocks are partly offset through price 
level effects. A positive value of c, or £, 
raises output and requires the price level to 
fall below its expected value to clear the 
commodity and money markets. The fall in 
the price level raises the real wage, thus 
counteracting the effect of the productivity 
shocks on employment and on output to 
some extent. Output also depends on unan- 
ticipated changes in the money supply, e,, 
and, as in all demand-side models of the 
business cycle with nominal wage contract- 
ing, an unanticipated rise in the money sup- 
ply causes the price level to rise above its 
expected value and hence depresses the real 
wage and raises output. 

The output process of equation (17) is 
nonstationary. The nonstationarity arises 
from two sources. First, as can be seen from 
equation (2), f,_, is nonstationary because 
it consists of accumulations of permanent 
stochastic shocks. Second, z, is nonstation- 
ary for, as can be seen by solving equation 
(3) backward, it depends on the cumulative 
sum of past levels of output and employ- 
ment. The properties that each of these two 
sources of nonstationarity imposes on the 
output process become clearer in the next 
section, where a number of sub-models, 
containing some but not all the properties 
of the general model, are considered. 


II. A Comparison of Real and Monetary Models 
with Endogenous and Exogenous Technology 


Nested within the general model of equa- 
tion (17) are at least four sub-models: a 
purely monetary model that contains no real 
shocks but with endogenous technology; real 
models with and without endogenous tech- 
nology; and a hvbrid model with monetary 
shocks and transitory real shocks but no 
learning. Each of these models is consid- 
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ered in turn, beginning with the monetary 
model with endogenous technology. 


A. A Monetary Model with Endogenous 
Technology 


This subsection considers a purely mone- 
tary model that contains no real shocks. The 
only disturbances to the economic system 
are unanticipated changes in the money 
supply, so that this model is a special case 
of the general model with F,=1 for all t 
(Le., 7, — £ — 0 for all t). The difference 
between actual and expected prices now 
only depends on innovations in the money 


supply: 
1 
1+b, a 





(Pp, - py) = 
and the output path is given by 
(18) y,7 qt by + ae, + bz,. 


This output process is nonstationary and 
has a greater-than-unit root. This can be 
seen more clearly if one takes the first dif- 
ference of output, uses equation (3) and an 
aggregate version of equation (9), and, after 
some manipulation, writes y, as a moving 
average process: 


pel 


(19) y, (19 7)'yo ae, + L (14 7)eg 


j=0 
t-2 . 

*e X T1633 
j=0 


| 1 
e EXCEPT 


l-« 
* xac 
Cy 7 (b; — a;)(y - À)a 


* b,(y — A)ag — a4(y — A) by; 


h, T»0; 


c1-7 [ra *(1- a)(y -A)], c>0, 
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where yg is a reference value in the past 
and I assume e, = 0 for simplicity. 

Equation (19) demonstrates that output 
depends on deterministic components of 
growth, captured by the constant, cg, and 
the term (14- 7)'yy. With endogenous tech- 
nology the economy will grow even in the 
absence of stochastic shocks, because uti- 
lization of labor results in an ongoing learn- 
ing process that increases technical knowl- 
edge, leading to higher productivity and 
output in future periods. 

Output also depends on a nonstationary, 
stochastic growth component that consists 
of the cumulative sum of money-supply 
shocks that cause errors in price expecta- 
tions. However, these errors have a perma- 
nent effect on output. Initially, a positive 
innovation in the money stock raises output 
by ae,. In the absence of further shocks, 
output falls the following period, but not 
back to its previous level, for through learn- 
ing a monetary innovation of e, raises out- 
put by c,e, the next period. Thereafter, the 
impact of the shock gradually increases over 
time; in j periods ahead of it, output will 
have increased by (14- 7)/^!c,e,. Thus, in- 
novations in output are, to some degree, 
persistent, but the degree of persistence de- 
pends on the horizon from which the inno- 
vation is viewed: with endogenous technol- 
ogy, the impact of an innovation in output 
slowly rises over time.’ The reason for this 
is that any shock that causes an innovation 
in output adds a stimulus to an ongoing, 
endogenous growth process and raises the 
time profile of output permanently above 


"Empirical studies have found differing estimates of 
persistence of innovations in output. John Campbell 
and Gregory Mankiw (1987) find that an unanticipated 
change in real GNP of 1 percent should change one's 
long-run forecast of output by more than 1 
percent—between 1.3 and 1.9 percent. Other studies, 
such as John Cochrane (1988) and Mark Watson (1986), 
have found less persistence. Cochrane finds that at 
most 0.4 percent of an innovation in GNP is perma- 
nent, and Mark Watson finds 0.6 percent using an 
unobserved components model, but considerably more 
(1.7 percent) using an ARIMA model, 
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the level it would have achieved in the ab- 
sence of the shock. A monetary shock at 
time £ that raises employment and output 
above their steady-state values will, through 
learning, cause technical knowledge to rise 
above its steady-state value by time t — 1. 
This rise in technology will increase the 
productivity of and demand tor labor above 
their steady-state values at time ¢ +1, bid- 
ding up the expected real wage and level of 
employment. Hence, as a result of a mone- 
tary innovation at time £, through learning, 
output will be above its steady-state value at 
time ¢ +1 as well. However, the rise in labor 
input and output at time ¢+1 causes a 
further increase in technical knowledge 
above its steady-state value by time t +2. 
. This results in a further rise in labor pro- 
ductivity and employment at time ¢ +2, en- 
abling further acquisition of technical skills 
by time ¢ +3, which raises productivity and 
employment in that period, and so on. Thus, 
the endogenous growth process not only 
acts as a propagation mechanism for aggre- 
'gate demand shocks but also amplifies their 
impact over time. 

In contrast to conventional business cycle 
models, the sub-model captured by equa- 
tion (19), like the general model it is de- 
rived from, has a greater-than-unit root in 
the output process (of value (1-- 7)) and is 
not difference stationary. Since (19) is in 
logarithmic form, the greater-than-unit root 
implies that in natural levels the growth rate 
of output will increase over time. This arises 
because, as discussed in Section II, Part E 
below, the growth rate of output essentially 
depends on the level of output. 

Furthermore, the natural rate of unem- 
ployment is likely to depend on the history 
of aggregate demand in this model. The 
“natural” level of employment at any point 
in time can be regarded as the level of 
employment that prevails when there is no 
discrepancy between labor’s actual and ex- 
pected real wage. A sufficient condition for 
this is that there be no real or monetary 
surprises that drive a wedge between the 
actual and expected price level; that is, e, = 
£, =n, = 0. Aggregating equation (9) and us- 
ing (3) to solve backward for z, we can 
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obtain the following expression for 1,: 


t 
(20) l, =Q9+ agt 8Zo 45A L Y-j 
j=1 


t 
* aly IE à) y» Ls 
j=l 


where zy is a reference value in the past. 
Thus, the natural level of employment at 
time f is a stochastic variable; it depends on 
the cumulative sum of past values of em- 
ployment and on the cumulative sum of past 
values of output, which, from equation (19), 
depends on the cumulative sum of innova- 
tions in aggregate demand (and, in the more 
general model, on exogenous real shocks as ' 
well). The demand for labor and the equi- 
librium level of employment are conse- 
quently not invariant with respect to the 
history of aggregate demand in the econ- 
omy. The natural rate of unemployment, 
being the difference between the supply of 
labor and the equilibrium level of employ- 
ment, is likely to change in response to 
permanent changes in employment and the 
demand for labor.’ This suggests that in an 
economy with endogenous technology, 
changes in aggregate demand are likely to 
alter the natural rate of unemployment. 

The results obtained from this model 
stand in strong contrast to the results de- 
rived from conventional monetary business 
cycle models. In both the misperception 
models and in the wage-contracting models, 
money has a transitory influence on eco- 


8a general rise in the demand for labor resulting 
from a higher level of technical knowledge can reduce 
the natural rate of unemployment if it induces firms to 
establish training programs in areas of skill shortages. 
This could reduce the mismatch of skills required by 
employers and those possessed by the unemployed, 
hence lowering structural unemployment. A rise in 
demand for labor can also lead to greater expenditure 
by firms on recruitment and advertising, so leading to 
improved information flows and possibly lower search 
unemployment. 


VOL. 80 NO. 4 


nomic activity: the impact of a monetary 
innovation dies awav over time, and, in the 
. long run, money is neutral. However, when 
technology is dependent on demand condi- 
tions, monetary shocks have a permanent 
impact on output. There is a long-run non- 
neutrality of money in models with endoge- 
nous technology. 


B. 4 Real Business Cycle Model 
with Exogenous Technology 


A second sub-model nested within the 
general model of equation (17) is a pure 
real business cycle model with technology 
evolving exogenously. Money has no impact 
on economic activity, and the model can be 
derived from the general model of equation 
(17) by assuming that not only the velocity 
but also the quantity of money is constant, 
so that the level of nominal aggregate de- 
mand is constant. In this case, unexpected 
price level changes are caused by technol- 
ogy shocks that alter real output. As is well 
known, the price level is countercyclical in 
such a model. 

In the absence of endogenous technology, 
without a learning curve, Z,=1 for all t. 
Proceeding as before, the price surprise term 
reduces to 


(21) (p-p TA [5s /(1+b,)] (m+ €), 


for by assumption ¢,=0 for all t, and the 
value of output is 


(22) y, 7 q + by — ab;(n, + &) 
+ b3(n, + &) + bif, 


Output depends on exogenous real shocks. 
As in the general model, a positive innova- 
tion in 5, or £, has a positive effect on 
output. The positive output effect is offset 
to some extent by the effect of the fall in the 
price level on the real wage. Taking the first 
difference of equation (22) and using (2) to 
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solve backward yields 


t—1 


(23) y -Jo t" tE tb b éj 


j=1 


where yọ is a reference value at time t=0 
and I have assumed that ng = ég = 0. 

Output is clearly nonstationary in this 
model, for it depends on the cumulative 
sum of permanent shocks to technology, the 
£'s. As in other real business cycle models, 
exogenous changes in technology cause per- 
manent shifts in output, for the influence of 
the £’s does not decay over time. The initial 
impact of a positive innovation in £ is, from 
equation (22), to raise output by b,(1— a@é,. 
In the following and all future periods, it 
raises output by b,£,, so that its long-run 
impact exceeds the short-run impact (b, > 
(1—a)b,), for in the long run the offsetting 
rise in the real wage is absent. 


C. A Real Business Cycle Model 
with Endogenous Technology 


A third model of interest is a pure real 
business cycle model with endogenous tech- 
nology, that is, a model in which learning 
occurs but money has no real effects. The 
production function of the representative 
firm is given by equation (1), and derivation 
of the model proceeds as in the case of the 
general model above, but with e, = 0 for all 
t, so that again the level of nominal aggre- 
gate demand is held constant. Thus, the 
only difference between this model and the 
preceding one is that technical knowledge, 
Z,, rises over time in response to labor 
input and increases in productivity. 

In the absence of monetary shocks, the 
difference between actual and expected 
prices depends only on real shocks and is 
given by equation (21). The output equation 
is very similar to that of the general model: 


(24) Y= 4G + bo = ab,(€, + n.) 


+ bz, + b(é, + 7.) + bilia: 
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Using equations (3) and (9) enables us to 
write, after some manipulation, 


Q5) y=(1+7)'yo 


t~-1 


+ X (1+r)cg + 0, +ë, 
i-o : 


t—2 . 
+ C5 L (1+ TYE, j1 
j=0 


t—2 


+ C3 L (1 F TY j-i 
j=0 


t—1 


tc, 2. (14 7)! f, 


j=0 


t-2t-2 


$602: (19 7Y£ 


i-0j-i 
€57b5A + byt [b5(25—a,) *a;b,] (y); 
€47b5À; C4=[b2(4,—a3) -a5b,](y-A); 


where fy is a reference value in the past 
and I assume 7, = é = 0. 

Output depends on the cumulative sums 
of the exogenous shocks and is nonstation- 
ary. It thus bears some resemblence to the 
conventional real business cycle model of 
equation (23). Nevertheless, there are two 
important differences between this model 
and the real business cycle model with ex- 
ogenous technology. First, like the pure 
monetary modél (and like the general 
model), this model has a greater-than-unit 
root, the value of the root being (1-- 7). The 
conventional model of equation (23) has a 
unit root, and is difference stationary, while 
the model with endogenous technology is 
not difference stationary. As in the mone- 
tary model, in a real business cycle model 
with endogenous technology the growth rate 
of output increases over time. 
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. Second, with endogenous technology, 
transitory shocks to productivity (the 7’s) 
have a permanent effect on output. Again, 
learning acts as a propagation mechanism 
for both demand-side and supply-side dis- 
turbances to output. A temporary rise in 
productivity stimulates learning and causes 
a rise in Z, shifting the long-run trend of 
output upward. Thus, in a model with en- 
degenous growth, output will have a 
stochastic trend even if the only distur- 
bances hitting the economy are transitory 
changes in productivity. (This is also the 
main finding of Robert King and Sergio 
Rebelo, 1986.) 


D. A Hybrid Model 


A final model that deserves consideration 
is one that contains both monetary shocks 
and transitory real shocks to technology but 
has no learning curve. Hence, Z, — 1 for all 
t, and F, =e”, so that the firm's production 
function becomes 


Y= k(Li)"e™, 
and the output process is governed by 
(26) y= Qtbot ale, z b3n,] + ban,- 


The trend, or expected future value of out- 
put conditional on current information, is 
E,y,;7qtbg i1, 

since E,e,,,; = E,n,,;=0, for these are ran- 
dom variables with zero mean. It is at once 
apparent that this process is stationary, for 
output will only vary about the point (q + 
bo). This is true even if the monetary and 
real shocks have effects that persist for sev- 
eral periods before dying away, through, for 
example, the presence of adjustment costs. 
In the long run, money is neutral in this 
model. Thus, in a model with exogenous 
technology neither monetary shocks nor 
transitory shocks to technology have a per- 
manent impact on output, thus highlighting 
the distinctions between models with en- 
dogenous and models with exogenous tech- 
nology. 


x 
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E. The Steady State 


In the steady state neither real nor mone- 
tary shocks impinge on the economy— e, = 
n, ^ £, = 0 and p, — pf for all t. The output 


process for the general model can then be ` 


written as a first-order difference equation: 
(27) 


The solution to this difference equation is 


y, 7 (1 *)y, 1+ Co. 


Yeen 7 Ly, * (co/ 7)] (1+ T) -(co/7). 


This equation describes an explosive growth 
process, for 7» 0. The growth rate of the 
natural level of output, viewed n periods 
ahead at time t -- 5, will be (1-- 7)"[y, + 
(cy /r)\In(1+ 7), which increases as n rises. 
The rising growth rate occurs because the 
growth rate of technology, as expressed by 
equation (3), depends on the level of labor 
input and labor productivity attained. Hence 
the growth rate of technology essentially de- 
pends on its level, since the amount of labor 
utilized and labor productivity both depend 
on Z. This gives rise to the greater-than-unit 
roots that these models with learning con- 
tain; as the level of technology rises, so do 
productivity and labor input, causing the 
growth rate of technology to increase. The 
rise in Z results in a rising marginal product 
of labor that bids up the real wage and 
results in a rising level of employment. This 
can be seen from the long-run time paths of 
these variables: 


z,=(1+7)z,_,+¢, 
In(MPL,) = In(a) + y, — L 
and 
Aln(MPL,) = (b, — a,)[7z,_,+¢,] 
L—-L_,=a,(72z,_,+¢,) 
A(w, — pj) = ¢,( 72,1 + 2), 


where c,=A(q + by) +(y—A)ay>0; c= 
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(1 — a) /(1 +1 — 0262) > 0; (b, — a5) > 0. 
Thus the model exhibits rising growth rates 
of output, technical knowledge, productiv- 
ity, employment, and real wages, since the 
growth rates of these variables depend on 
the growth rate of technology and thus on 
the level of technology. 

These properties of the model may hold 
for considerable time periods and can ac- 
count for the rising growth rates that Paul 
Romer (1986) reports for the past two cen- 
turies. However, these growth patterns may 
change substantially in the very long run, 
because they are based on the short-run 
labor supply function given by equation (5), 
and (5) is unlikely to hold in the very long 
run: 

As society becomes wealthier, the income 
effect of a rise in wages may increasingly 
come to dominate the substitution effect, 
resulting in little or no increase in work 
effort as wages rise. Indeed, the long-run 
supply of labor schedule is probably back- 
ward-bending and the long-run value of $», 
the wage elasticity of labor supply, probably 
negative.” However, the value of the root of 
output depends on the labor supply elastic- 
ity $5, and 7 falls as $, falls." As $,— 
-A/y, 770, and if ¢, takes the value 
—A/y, T=0 and the output process 
changes to a nonexplosive unit-root process. 
In this case the time paths of the variables 


?Daniel Hamermesh and Albert Rees conclude their 
discussion of labor supply by noting that “The long-run 
supply curve of employee hours to the economy as a 
whole...will probably be backward-sloping at the aver-. 
age real wage levels of developed countries.... The 
backward slope arises from the tendency in developed 
countries for hours actually worked per year to be 
reduced as real income rises over time while the over- 
all rate of labor-force participation for all groups in the 
population taken together remains roughly constant or 
rises slightly." [1984, p. 82] 

107 = [1-0/0 +0- a)y +10 — 0/0 4 — 
2a)6;), and dr /d¢,>0 if y » (1— æ). This condi- 
tion must hold if a rise in labor input is not, ceteris 
paribus, to have a negative or zero effect on techni- 
cal knowledge, and is assumed to hold throughout. 
This can be seen by substituting an aggregate form 
of equation (1) lagged one period into (3) to obtain 
Z,/Z, ," KZÜq9»LITA-9FA If y «(0-2 
then a(Z,/Z,_,)/0L,_, <0. 
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become . 
y, Tei ^ €; 
Z= Z, 40€, 
Aln(MPL,) = (5, —44)c, 


L, aa la 


A(w, d bi) =CyC, 


= 05€, 


and a, «0 but c_,(b, —a5,),0, > 0 if ġ, = 
— A / y. In this case rising productivity is 
sufficiently offset by a falling supply of labor 
to dampen explosive growth. 

Thus, if the wage elasticity is sufficiently 
negative, the model displays long-run fea- 
tures one might expect-to observe in a de- 
veloped economy with roughly constant 
population: declining employment, mani- 
festing itself as a decline in average hours 
worked per person employed, steadily rising 
real wages and productivity, and technologi- 
cal advancement. If the wage elasticity rises 
` above — A / y, then 7 > 0 and rising growth 
rates would manifest themselves again, 
though in conjunction with a decline in av- 
erage hours worked per person employed if 
Q- is still negative. Hence, the actual growth 
path will depend crucially on the long-run 
dynamics of labor supply." 

A further factor that can dampen the 
accelerating growth is limits to learning. 
Continuous learning can only occur if there 
aré exogenous changes to the environment 
in which agents work. In the absence of new 
stimuli to learning, growth would slowly pe- 
ter out. In terms of equation (3), y and A 
would approach zero over time: if y = A — 0, 
Z is constant and 7 = c = c, = 0, so growth 
ceases and the economy enters the station- 
ary state. 

Two pieces of evidence lend some sup- 
port to my formulation of the learning pro- 


11 Changes in population growth will also affect the 
growth path of output, and a falling labor supply caused 
by declining population could also dampen accelerat- 
ing growth. In this case, @, in equation (5) would fall 
over time, causing the constants ag, bg, Co, and c, to 
become time-varying. 
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cess and the output process that results 
from it. First, Romer (1986) presents empir- 
ical evidence for several industrial countries 
that growth rates of output have a positive 
trend and have been rising over the past 


` two centuries. Second, in their survey of 


technical progress, Charles Kennedy and 
A. P. Thirlwall (1972) observe that "since 
product types are continually changing it is 
probably sate to assume that in aggregate 
there is no limit to the learning process.” '? 


HI. Depreciation of Technology 


This section examines the properties of 
business cycle models if technology depreci- 
ates. Generally, technical and scientific 
knowledge is not regarded as subject to 
depreciation, except under abnormal cir- : 
cumstances such as times of war. However, 
one can think of certain technologies that, 
through changes in the state of nature, be- 
come obsolete before replacements are 
found for them. For example, in 1987, The 
Economist reported that "Bacteria are 
evolving resistance to antibiotics and insects 
resistance to pesticides faster than new 


. chemicals are being invented. Before long, 


at this rate, medicine will revert to the days 
before penicillin and agriculture to the days 
before DDT.” To examine the impact of 
depreciation in a model with endogenous 
technology, I allow Z to depreciate over 
time and replace (3) by 


G9 zi zeo | 1)" 


0«68«1 


where 6 is the rate of depreciation of tech- 
nology. 


sS Kennedy and Thirlwall (1972, p. 39). 
P March 21, 1987, p. 93. This prognosis was perhaps 
too pessimistic. In a more recent issue (April 16, 1988) 
The Economist suggests that, at least in agriculture, 
genetic engineering may overcome some of these prob- 
lems, implying that the decline in agricultural technol- 
ogy is not permanent. 


VOL. 80 NO. 4 


This section considers two models. The 
purely monetary model of Section II pro- 
vides an example of the effect of deprecia- 
tion in a model with endogenous technol- 
ogy. Second, a real business cycle model 
with depreciating exogenous technology is 
appraised. It emerges that such a real busi- 
ness cycle model cannot account for nonsta- 
tionarity if technology is subject to depreci- 
ation. 


A. A Monetary Model with Depreciation 


The path of output in a-pure monetary 
model is given by equation (18). Using 
equation (28) instead of equation (3) as the 
process that determines technical knowl- 
edge, one can write equation (18) as an 
ARMA process. Taking the first difference 
of equation (18) and substituting for /,_,, 
yields, after some manipulation, 


(29) l Y, z dy ae,-d;e, , (1 * 73) Yr-1 


dy = bj(y — A) (a + ao) | 
- [a(y - A) - 9] (a + bo) 
= a[L£a4(y-4) 8] -b(y-4). 


This process may be stationary. A sufficient 
condition for stationarity is that (1+ 7 — ô) 
<1, or that ô> r. Obviously, the higher ô, 
the rate of depreciation of technology, the 
more likely it is that this condition will hold 
and output will be stationary. Thus, once 
depreciation is introduced, the value of the 
root of output may be greater than unity, 
less than unity, or not significantly different 
from unity, and even a model with endoge- 
nous technology may yield a sanoman out- 
put process. 


B. A Real Pienen Cycle Model 
with Depreciation 


This subsection considers the effect of 
depreciation in a conventional business cy- 
cle model with exogenous technology. The 
model is that given by equation (22), but 
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with one modification, namely, that the 
“permanent” technology shocks, the. é’s, 
dissipate slowly over time, so that equation 
(2) is replaced by 


(30) fh2h*n 


=(1-8) frit Er 

and 6 is again the depreciation rate. Using 
(30) to solve backward for f, , and substi- 
tuting the result into the output equation 
(22) yields -a stationary moving-average pro- 
cess for output: 


(33) »,-4-tbytn t£, 


t b, L (1- 8), 


j=0 > 


Shocks to output have a declining impact 
over time. Because (1— 8) « 1, the output 
process represented by this equation is sta- 
tionary and has a less-than-unit root, unlike 
the model given by equation (22) without 
depreciation, which has a unit root. 

While it-is admittedly difficult to make a 
plausible case for the depreciation of tech- 
nology in general, one cannot rule out the 
fact that some depreciation may occur in 
particular fields of knowledge. If technology 
does depreciate (6 is significantly greater 
than zero), conventional real business cycle 
models yield stationary output processes and 
cannot account for the observed nonsta- 
tionarity of output or for persistence of 
innovations in output; only models with en- 
dogenous technology might display nonsta- 
tionarity under these circumstances. 


IV. Real Wages and Endogenous Technology 


Empirical studies have generally failed to : 
find significant countercyclical movements 
in the real wage. The most recent major 
study (Michael Keane et al., 1988) finds that 
real wages are mildly procyclical. This 
“stylized fact” is consistent with real busi- 
ness cycle models, but it appears to reject 
models with nominal wage contracts since 
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real wages move countercyclically in such 
models.!4 

This conclusion—that wage-contracting 
models are rejected by studies of real wage 
movements—is less easily drawn if technol- 
ogy is endogenous. Even if the money wage 
is fixed, real wages may not exhibit pro- 
nounced countercyclical movement, particu- 
larly if learning occurs rapidly in some mar- 
kets. 

The general model presented above was 
not constructed to explain the cyclical pat- 
tern of real wages. Nonetheless, examina- 
tion of the simple labor market model pos- 
tulated in Section I can yield some insights. 
The only change necessary is to introduce 
overtime rates and to assume that at the 
beginning of the period overtime rates are 
also set in money terms. In Figure 1, at an 
expected price level Py, a zise in employ- 
ment above L would result in overtime 
payments and raise the real and money 
wage. Hence the labor supply function is 
upward sloping, the slope depending on the 
structure of overtime pay. 

A monetary shock that results in an unex- 
pected rise in the price level reduces the 
real wage and the value of overtime pay- 
ments, effectively shifting the labor supply 
schedule down and moving the labor market 
"equilibrium" from A to B. Employment 
(measured in man hours) rises: money wages 
rise through overtime pay, but by less than 
prices, so that the real wage falls below 
(W, / P,). In a real business cycle model, an 
exogenous rise in productivity would shift 
the demand for labor function to the right 
(to L1) and increase both employment and 
real wages. Price and wage flexibility en- 


sures that the labor supply function does : 


not shift when prices change, and the new 
labor market equilibrium is at point C. 

The movement from A to B captures the 
conventional pattern of real wages and em- 
ployment in a monetary business cycle model 
(in the Lucas-Barro models the rise in the 
labor supply results from misperception of 


“This has led McCallum (1986) to suggest that 
price rather than wage stickiness must lie at the heart 
of cycles that are driven by aggregate demand shocks. 
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the real return from working), but this pat- ' 
tern may not develop if technology depends 
on economic factors. In this case, as before, 
a monetarv innovation boosts aggregate de- 
mand. Provided prices do not jump to a new 
higher level but adjust slowly (the excess 
money balances are spent gradually), then 
the labor supply function will begin to shift 
down slowly, raising employment. If the rise 
in employment results in a rapid increase in 
learning and innovation, the demand for 
labor will begin to rise. Thus, the downward 
movement of the labor supply schedule will 
set in motion an upward and rightward 
movement of the labor demand schedule. 
The resulting wage-employment pattern de- 
pends on the points of intersection of a 
downward-shifting labor supply function 
with an upward-shifting labor demand func- 
tion. Rising prices reduce the real wage, but 
this is counteracted by rising productivity 
and output that place downward pressure 
on the price level and upward pressure on 
real] wages; rising productivity retards the 
downward shift in the labor supply function. 
Clearly, a number of different wage paths 
are feasible, depending on the speed and 
magnitude of adjustment in the labor supply 
and demand functions in response to chang- 
ing prices and productivity. It is conceivable 
that after a brief decline, real wages rise to 
their previous level. If there is a large in- 
crease in the demand for labor, one might 
even observe a rise in the real wage. 
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To conclude, real wage movements that 
are mildly procyclical but lagging may 
emerge from this model. Once learning is 
introduced, the pattern of real wages over 
the business cycle becomes unclear: it need 
not be strongly countercyclical, even if nom- 
inal wages are fixed by contract. 


V. Summary and Conclusions 


Empirical work suggests that many inno- 
vations in technology take place in response 
to market conditions, particularly demand 
conditions. It was shown that, in a simple 
model of endogenous technical change, dis- 
turbances to output that originate on the 
supply side or the demand side of the econ- 
omy have markedly different effects than if 
technology is assumed to be independent of 
the economic process. 

Endogenous technical change acts as a 
propagation mechanism for shocks that have 
only transitory effects in conventional mod- 
els. A supply-side shock, such as a tempo- 
rary rise in productivity, or a demand-side 
shock, such as an unanticipated rise in ag- 
gregate demand, can induce a permanent 
upward shift in the aggregate production 
function. Changes on the supply side of the 
economy are not independent of changes on 
the demand side, and there is a long-run 
non-neutrality of money in models with en- 
dogenous technology. The models derived 
in this paper also imply that the growth rate 
of output will increase over time provided 
the supply of labor does not fall sufficiently 
to choke off rising, growth rates. 

It was shown that if technology depreci- 
ates, conventional real business cycle mod- 
els cannot account for the observed nonsta- 
tionarity of output. However, both real and 
monetary models with endogenous technol- 


I*The argument is made more formally in Stadler 
(1989). A question that is not satisfactorily addressed 
in the literature is the degree to which real wage 
changes are lagging over the business cycle. Victor 
Zarnowitz comments that “Money wages often rise less 
than prices in recoveries and more than prices in late 
expansion stages. There is a procyclical and lagging 
movement in labor costs per unit output." [1985, p. 
529] 
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ogy still yield nonstationary output pro- 
cesses provided the rate of depreciation of 
technical knowledge is not too high. Fur- 
thermore, the pattern of real wage move- 
ments is unclear if technical knowledge 
changes quite rapidly in response to changes 
in factor utilization. Even if money wages 
are fixed, real wages may display very little 
countercyclical movement. 

There has been some debate as to whether 
models that emphasize aggregate demand 
innovations, or models that rely on real 
shocks as.an impulse mechanism, provide a 
better characterization of output fluctua- 
tions. However, if technology is endoge- 
nous, real and monetary business cycle 
models exhibit very similar properties, as a 
comparison of equations (19) and (25) illus- 
trates. Both classes of models can account 
for the persistence of innovations in output, 
and both can account for the rise in the 
growth rates of real output observed by 
Romer. This similarity makes it difficult, at 
the present time, to reject one model in 
favor of the other in terms of providing a 
better account of the output process. 
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A Theory of Managed Trade 


By KYLE BAGWELL AND ROBERT W. STAIGER* 


This paper proposes a theory that predicts low levels of protection during periods 
of “normal” trade volume coupled with episodes of “special” protection when 
trade volumes surge. This dynamic pattern of protection emerges from a model in 
which countries choose levels of protection in a repeated game facing volatile 
trade swings. High trade volume leads to a greater incentive to defect unilaterally 
from cooperative tariff levels. Therefore, as the volume of trade expands, the level 
of protection must rise in a cooperative equilibrium to mitigate the rising trade 
volume and hold the incentive to defect in check. (JEL 411, 422) 


Two major trends have dominated the 
postwar history of trade policy in indus- 
trailized countries. One is the dramatic mul- 
tilateral reduction in tariffs negotiated un- 
der the General Agreement on Tariffs and 
Trade (GATT).! The other is the move to- 
ward “special” protection that has occurred 
as the industrialized countries of the world 
have become more integrated and as volatil- 
ity in trade flows has become a more impor- 
tant source of domestic disruption. The rise 
in special forms of protection is epitomized 
by the growing use of Voluntary Export 
Restraints (VERs), Orderly Market Ar- 
rangements (OMAs), and tariffs that are 
tailor-made to suit the needs of particular 
sectors.” These policy tools are typically uti- 


*Kyle Bagwell, Department of Economics, North- 
western University, Evanston, IL 60201; Robert W. 
Staiger, Department of Economics, Stanford Univer- 
sity, Stanford, CA 94305; and National Bureau of Eco- 
nomic Research. We thank K. C. Fung, Marie Thursby, 
and participants in the International Seminars at 
Berkeley, Stanford, UCLA, the Institute for Interna- 
tional Economic Studies in Stockholm, and the NBER 
Summer Institute for very helpful comments. We are 
especially indebted to two anonymous referees for ex- 
tremely helpful suggestions. All remaining errors are 
ours. 

tn the United States, for example, the average 
tariff level on dutiable imports (the ratio of duties 
collected to dutiable imports) fell from 53.5 percent in 
1933 to 5.2 percent in 1982 (U.S. Trade Representa- 
tive, 1984, p. 187). 

?The increasing relative importance of special pro- 
tection as an instrument of international trade policy 
has been widely noted. See, for example, S. A. B. Page 
(1979) and C. Fred Bergsten and William Cline (1983) 
for attempts to quantify this trend. 
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lized by countries to limit the rate of expan- 
sion of imports or exports from that which 
would occur absent intervention. The term 
“managed trade” is often invoked to char- 
acterize the current international trading 
environment, since it consists of a relatively 
low “baseline” or “normal” level of protec- 
tion combined with the use of special pro- 


tection to dampen underlying changes in 


trade flows. 

The low baseline level of protection sus- 
tained by countries suggests that a static 
noncooperative Nash equilibrium view is in- 
adequate to explain existing levels of pro- 
tection. One alternative is to view the exist- 
ing trading environment as the result of 
explicit agreements among countries. This 
approach to explaining levels of protection 
has been taken by Wolfgang Mayer (1981) 
and Raymond Riezman (1982). However, 
such explicit agreements require the exis- 
tence of a workable enforcement mecha- 
nism, and at the international level it is 
unclear what that mechanism might be. A 
second alternative is to consider only self- 
enforcing agreements or tacit cooperation 
among countries. As Avinash Dixit (1987) 
and Richard Jensen and Marie Thursby 
(1984) have shown, the (credible) threat of 
future punishment can sustain a more lib- 
eral trading environment than that pre- 
dicted under the static Nash equilibrium. 
These models can help explain how coun- 
tries are able to sustain a relatively liberal 
trading environment in “normal” periods. 
However, they remain silent on the issue of 
“special” protection, since they take each 
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period to be the same as every other. In this 
regard, W. Max Corden (1974, pp. 175-76) 
has argued that countries rarely initiate pro- 
tection for the purpose of capturing terms- 
of-trade gains, presumably because of the 
fear of future retaliation by their trading 
partners, but that countries do employ pro- 
tection for its terms-of-trade effects in peri- 
ods when their terms-of-trade would other- 
wise decline. Corden argues that retaliation 
by trading partners is less likely during such 
periods. This suggests that episodes of “spe- 
cial" protection might usefully be viewed as 
part of a tacit international agreement in a 
changing environment. 

We attempt to formalize this view by con- 
sidering the way in which sustainable levels 
of protection in tacit cooperative equilibria 
are affected by changes in the underlying 
trade volume. Since potentially exploitable 
terms-of-trade effects will embody greater 
potential national welfare gains the greater 
is the underlying volume of trade, periods 
of high trade volume are likely to corre- 
spond to periods of great incentive to exer- 
cise one's power over the terms-of-trade. If 
the trade volume is large enough, the imme- 
diate gains from protection may outweigh 
the losses from punishment, and free trade 
will be unsustainable. However, this does 
not imply that international cooperation 
need break down. Countries can coopera- 
tively utilize protection during periods of 
exceptionally high trade volume to mitigate 
the incentive of any country to unilaterally 
defect, and in so doing can avoid reversion 
to the noncooperative Nash equilibrium. 
Thus, surges in the underlying trade volume 
lead to periods of "special" protection as 
countries attempt to maintain some level of 
international cooperation. In this sense, the 
model we develop below depicts managed 
trade as the outcome of tacit cooperation 
among countries in the presence of volatile 
trade swings. 

We adopt a very simple partial equilib- 
rium framework within which to make these 
points. Section I lays out the basic model 
under the assumption of free trade and 
calculates the underlying free trade volume 
as a function of the parameters of the model. 
Section 1I solves for the static Nash equilib- 
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ria in the quota and tariff games. Our re- 
sults here are similar to those developed in 
Dixit (1987). These equilibria constitute the 
credible (subzame perfect) punishments in 
the dynamic game of the following section, 
the threat of which will be used to support 
tacit cooperation. The dynamic model for 
the quota and tariff games is analyzed in 
Section III, where it is shown that equilib- 
rium trade policy becomes more restrictive 
during periods of high (free) trade volume. 
This result is reminiscent of a related point 
made by Julio Rotemberg and Garth 
Saloner (19862) in the context of a repeated 
oligopoly model with demand shocks.) Our 
modeling approach is clearly inspired by 
their work. Section IV adds a second sector 
and considers the model’s implications for 
the relationship between bilateral trade im- 
balances ard protection. Section V dis- 
cusses the generality of our results and con- 
siders several extensions. Section VI offers 
some conclusions. 


I. Free Trade 


We begin with the characterization of free 
trade in a very simple partial equilibrium ' 
model of trade in a single sector between 
two countries. Let the world (two-country) 
output in the sector be fixed at 2. At the 
beginning of any period, the distribution of 
world output between the “domestic” (no *) 
and the “foreign” (*) country is determined 
by a commonly known distribution function 
F(e) that generates foreign output e € [0, 1], 
with domestic output then given by 2-— e. 
On the demand side, the domestic and for- 
eign countries are assumed to have identical 
linear demands C — a — BP and C*-a«— 


?In concluding, Rotemberg and Saloner (1986a) 
conjecture that their framework could yield a predic- 
tion of trade wars occurring in states of depressed 
demand. Our paper addresses a different issue, but in 
a similar spirit. Also, Riezman (1987) introduces ran- 
dom terms-of-trade shocks into the tariff model of 
Dixit (1987) and notes that shocks to the terms-of-trade 
that increase the current gain from defection will in- 
crease the likelihood of Nash reversion. His concern, 
however, is with the effect of unobservability of shocks 
and tariffs on the ability of countries to sustain low 
cooperative tariffs. 
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BP*, respectively, where C(C*) is the con- 
sumption level of the domestic (foreign) 
country and P(P*) is the domestic (foreign) 
country price. Competitive firms supply the 
product in each country. For simplicity, we 
assume that production costs are zero and 
that a > 2. 

Free trade will ensure that a single price 
PS prevails in both markets so that P = P* 
= P^. The equilibrium condition that world 
supply equals world demand, 2 = C(P/)+ 
C*( P^), determines the free trade price Pf 
as Pf —(a —1)/B. Thus, consumption lev- 
els under free trade are given by C(P^) 
C*(Pf) — 1. Finally, the free trade volume 
Vf is given by V/22— e — C(PP)  C*(P^) 
—e-]1-e. In periods when e=1, both 
countries have equal supply and there will 
be no trade between them. When e « 1, the 
domestic country exports the quantity 1— e. 
Hence, the domestic (foreign) country is the 
exporter (importer), and free trade volume 
rises as e falls away from 1. This completes 
the characterization of trade volume under 
conditions of free trade. 


DA Static Model of Protection 


In this section we characterize the set of 
static Nash equilibria for the simple model 
of the previous section when countries 
choose either trade taxes or quotas. These 
equilibria will serve as credible (subgame 
perfect) punishments in the dynamic games 
considered in the next section, the threat of 
which can support tacit cooperation in a 
repeated setting. 

The two countries are assumed to observe 
the current realization of e, and thus the 
trade volume that would prevail in the pe- 
riod under free trade, V/ —1— e, and then 
to choose simultaneously their protective 
policies for the period. The domestic coun- 


^Our results would not be significantly altered if 
countries had common and imperfect information about 
V/ when choosing policies. If, however, each count 
had some private information about the current V/, 
then the analysis would be considerably more complex. 
For an analysis of the role of private information in 
finitely repeated tariff games, see Jensen and Thrusby 
(1989). 
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try is thus choosing export taxes or quotas 
while the foreign country chooses corre- 
sponding import policies. Each country's 
objective is to maximize its sum of producer 
surplus, consumer surplus, and rents from 
protection. 

Consider first the determination of the 
static Nash equilibrium when countries 
choose trade quantities directly in the form 
of import and export quotas. Provided that 
quota licenses are either auctioned off by 
the governments of each country or simply 
given to their respective firms, the country 
whose quota binds—the country with the 
smaller quota—will capture all the quota 
rents. This means that as long as there is 
trade, each country can always do better by 
tightening its quota beyond that set by its 
trading partner, which leads to the well- 
known property that the unique static Nash 
equilibrium in the quota game is autarky 
(see, for example, Edward Tower, 1975). 
Thus, regardless of the realization of e and 
the underlying free trade volume V’, the 
static Nash equilibrium in the quota game 
ensures that no trade will take place. 

We turn now to the determination of the 
static Nash equilibrium level of protection 
when the domestic and foreign countries 
choose specific export and import taxes 
1(V^) and 7*(V^), respectively, as functions 
of the observed free trade volume.’ To be- 
gin, the actual volume of trade following the 
realization of the free trade volume Vf and 
the selection of trade taxes 7(V^) and 
7"(V^) is easily characterized. If trade oc- 
curs, then effective prices to producers in 
the domestic (exporting) country must be 
equal across countries and world supply and 
demand must also be equal. Since (non- 
negative) taxes cannot reverse the free di- 


5We discuss in Section V an extension of our results 
to two sectors in which only import-restricting policies 
can be used. 

*Countries are not concerned with risk sharing in 
this partial equilibrium setting, since national income, 
and thus the marginal utility of national income, is 
unaffected by shocks in this single sector. See Section 
V for a discussion of the case where sectoral income 
smoothing motivates trade policy intervention. 

"Our results are essentially unchanged if countries 
set ad valorem tariffs. 
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rection of trade, we have P*— P=r+7* 
and 2=C(P)+ C*(P*). Trade will occur 
when trade taxes are not prchibitive; in our 
model, it is easy to show that trade occurs 
provided 


(1) 2V//B8»T-7*. 


Now, assuming that (1) holds, we have 
P(r, 7) = (e - D/B —(r + 7*)/2 and 
P*(r,7*) » (a -D/B +r + 7/2, which 
gives prices as functions of trade taxes for 
the domestic and foreign country.? 

Letting W(V/,r,*) and W*(V4,7,7*) 
represent domestic and foreign country wel- 
fare, respectively, given by the sum of the 
country's consumer surplus, producer sur- 
plus, and tariff revenue, we have when (1) 
holds 


(2) WO, r,7*) = f°" c(P)aP 
P(r,*) 


+ [POP pn «vilaP 
0 
t 7X(Vf, P(7,7*)), 


(3) WS, r,r) = f" ^ PaP 
P*(7,7") 


+ [POM Flaps 
ie 


+ 7*M(VS,P*(r,7*)), 


where we have written each country’s out- 
put in terms of the underlying free trade 
volume Vf, and where X(VF, P(z,7*)) and 
M(VF, P*(7,7*)) are domestic export supply 


5Note that as long as tariffs are not prohibitive so 
that (1) holds, prices will only be affected indirectly by 
the realization of e, through and 7*. The absence of 
a direct impact of e on prices is due to the perfect 
negative correlation across countries of the supply 
shocks we consider. While a useful simplifying assump- 
tion, we argue in Section V that our results will be 
preserved in much more general models. 
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and foreign import demand, respectively, 
written as functions of Vf and tariff-dis- 
torted prices. - 

The remaining possibility is that (1) fails. 
Now trade taxes prohibit trade. This possi- 
bility corresponds to autarky, with domes- 
tic and foreign prices given, respectively, 
by P(V!)=(a—-1-V‘)/B and P*(V/)- 
(a —1--V^)/B and welfare 


(4) WV 2.0%) = f"""c(P)aP 
POVI) 
+ [+v ap, 
0 


(S) W*(V5r,r*) = f" * Pap 
P*(Vf) 


+ ("^n - vjap*. 
0 


With the payoff functions now defined by 
(2), (3), (4), and (5), a Nash equilibrium for 
the static tariff game can be defined as a 
pair of tariff functions, r,(V‘) and rV), 
such that for every V/e[0, 1], z,(V^) maxi- 
mizes W(V/,r, rë (VI) over r and 7A(VS) 
maximizes W*(V^,r (V), 7*) over 7*. 

To solve for Nash equilibria, we first char- 
acterize best-response correspondences, 
T&(V/,7*) and rE(V/,7), defined, respec- 
tively, as the maximizers of W(V/,r,7*) and 
W*(Vf,c,7*). If (1) holds, dW(Vf,v,7*)/ 
dr = VÍ/2-— (3Br + Br*)/4, and 
W(V, r, 7*) is strictly concave in r. Hence, 


(6 T74(V/,r*) -2V//38 —7*/3, 


if 7* « 2V//g. 
If instead 7*2 2V//g, then by (1) any 7 
generates autarky. The domestic country 
welfare is then independent of its tariff, and 
SO 


(7) v4(V/,7*) —[0,2), if 7* 2 2V/ 7B. 


Similar calculations for the foreign country 


5 
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FIGURE 1 


give the exactly symmetric best-response 
correspondence 7(V/,7). 

There are thus two disjoint sets of Nash 
equilibria. The interior equilibrium, found 
by solving (6) and its symmetric counterpart, 
has r,(V/) 2 V//28 =7(V‘). The other 
equilibrium set corresponds to autarky. Any 
(7,7*) such that 72 2V//B and 7*2 
2V//B forms a no-trade Nash equilibrium 
with ry(V^) = 7 and r#(V) = 7* for all Vf. 
Figure 1 illustrates the sets of Nash equilib- 
ria for the static tariff game. 

To summarize, the static tariff game be- 
tween countries generates two sets of equi- 
libria: an interior Nash equilibrium and a 
set of autarky Nash equilibria. The static 
quota game has autarky as its unique Nash 
equilibrium. ; 

Finally, we note that the equilibrium pay- 
off configurations in the static tariff añd 
quota games can be unambiguously ranked. 


In all (tariff and quota) autarky equilibria, 
payoffs are given by (4) and (5). Setting + 
and 7* in (2) and (3) first equal to zero and 
then equal to the Nash interior tariffs, it is 
straightforward to verify that welfare is 
highest for both countries under free trade 
and higher in the interior Nash equilibrium 
than in the autarky Nash equilibria if and 
only if Vf > 0? 


?Harry Johnson (1953—54) notes that one country 
may prefer the (internal) Nash tariff equilirium to free 
trade if its import demand is sufficiently elastic relative 
to that of its trade partner. John Keenan and Riezman 
(1988) have linked this possibility to differences in 
country size. While the symmetry in our model, which 
results in common interior Nash tariffs, does not allow 
this possibility to arise, the main complication it would 
introduce to our analysis is to alter the focus from 
symmetric to asymmetric tariff equilibria. See also the 
discussion in Section V. 
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III. A Dynamic Model of Protection 


We now extend the model to allow for 
repeated interaction. In particular, we ex- 
plore the sense in which a dynamic environ- 
ment enables countries to lower protection 
from the levels that would prevail in a static 
setting and characterize the relation be- 
tween the achieved protection and trade 
volume. f 

The dynamic game upon which we focus 
is simply the static game studied above in- 
finitely repeated. Thus, at the start of any 
period, a value for e (and thereby V^) is 
realized and observed by all. Current period 
protection policies are then set, and current 
welfare is determined. At the beginning of 
the next period, all past choices are ob- 
served and a new value for e (and thus V) 
is determined. We assume that e is drawn 
from the same distribution independently 
every period. 

We examine symmetric (subgame perfect) 
Nash equilibria, in which the countries co- 
operate with low, common protection levels 
and credibly threaten to forever revert to a 
static Nash equilibrium if cooperation is 
violated. In the tariff game, common coop- 
erative tariff levels imply that both countries 
share symmetrically in the ccoperative tariff 
rents. Analogously, in the quota game, we 
assume that quota rents are shared symmet- 
rically in the cooperative equilibrium.!? 

As discussed above, the most preferred 
symmetric trade policy is free trade, and 
lower symmetric levels of protection are al- 
ways preferred jointly to higher symmetric 
levels of protection. For some values of vt, 
we will see that the threat of reversion is 
sufficient to generate free trade. However, 
for other values of V^, free trade cannot be 


10Tt is natural to focus on common protection levels 
in this simple model, where countries are assumed 
symmetric. Moreover, one can show that symmetric 
rent sharing supports the highest degree of coopera- 
tion in this model. A more general model is discussed 
in Section V, where our basic conclusions hold but 
asymmetries play a real role. See also Robert Feenstra 
and Tracy Lewis (1987), who find in a different context 
that sharing the rents from protection allows countries 


- to avoid trade wars. 
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maintained and the cooperative level of 
protection entails positive symmetric tariffs. 

We consider first the tariff game. The 
cooperative trade tax function, v, = T V^, 
must provide each country with no incentive 
to defect. That is, for every Vf, the ex- 
pected discounted welfare to each country 
under the strategy 7.(V‘) must be no less 
than the welfare achieved by the country 
when defecting and thereafter receiving the 
expected discounted welfare associated with 
a static Nash equilibrium. Clearly, a country 
choosing to defect does best by selecting a 
tariff on its reaction curve. Thus, from (6), if 
countries are cooperating and allowing 
trade, the optimal tariff with which to defect 
is 


(8  c5(Vf, T.) = 2V/73B — 7, /3 
= r*X(V^, T). 


We now fix Vf and a cooperative tariff 
level 7, and characterize the static incentive 
to defect. Let 


(9) OVE TVS, e), Te) 
=W(VI, toV", Te), Te) 
SWV Tarte) 
(10) Q*(V n, r5 (V/,r,.)) 
SW (Vinari) 


EE WAVE or) 
represent the respective static gains from 
defection for the domestic and the foreign 
country. 

Figure 2 illustrates the static incentive to 
defect from the cooperative tariff 7,2 0. 
The foreign import demand curve is given 
by the downward sloping line with slope 
— 1/B. The domestic export supply curve is 
given by the upward sloping line with slope 
1/8. The cooperative tariff 7, results in 
prices P*(7,,7,) and P(*,, 7,) in the foreign 
and the domestic country, respectively. The 
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P,P* 


P'(1, 15) = P* CHT) 
P*(T,,t) 


P(T,,T.) 





BAGWELL AND STAIGER: THEORY OF MANAGED TRADE 785 


FIGURE 2 


foreign gain from trade under 7, is the sum 
of the surplus from trade, given by the area 
of the triangle djc, and the foreign share of 
cooperative tariff revenues, given by the area 
of the rectangle jcba. The domestic gain 
from trade under 7, is analogously given by 
the sum of the areas of the triangle oef and 
the rectangle oeba. 

Now consider a defection from 7,. We 
know from (8) that both countries will de- 
fect to the same rp. Hence, whichever 
country defects, prices will be P*(7,,7))= 
P*(rp,7,) and P(r,, 75) - P(rp, 7.) in the 
foreign and domestic countries, respec- 
tively. The foreign static gain from defec- 
tion is then given by the net increase in its 
collection of tariff revenues, represented by 
the difference between the rectangles onml 
and kbci, minus the efficiency loss in 
its surplus from trade, represented by the 
triangle hic. Analogously, the domestic 
exporter's static gain from defection is 
given by gjih — kbel — mle. As is evident 


from Figure 2, the equivalence 
Q(V4,r5(V,c.),.) =Q (VI, TTV, TA) 


holds for all 7, > 0. All results concerning 
the static incentive to defect can therefore 
be expressed in the domestic country nota- 
tion. 

Using the envelope theorem, we find that 


dQ(V', rp (Vf, Teh 7.) 


(11) avi 
= [r5(V4,7.) - 7.] 72, 
f f 
(12) dQ(v ,Tp(V ,1,). 7.) 
dr, 
B y? 
= qe Tp Vf, T.) x 
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Using (8), we see that Q(V4,-,(V4,7,.),7,) is 
strictly increasing in V/ and strictly decreas- 
ing in 7, if and only if 


(13) 


Provided that the cooperative tariff is below 
the static interior Nash tariff, the incentive 
to defect from a fixed 7, is larger the larger 
is Vf and the smaller is 7,. 

These conditions are simple to interpret. 
As the underlying free trade volume in- 
creases, the incentive to defect gets larger. 
This occurs because the terms-of-trade gains 
from defection are applied tc a larger trade 
volume, that is, because more tariff revenue 
is collected from one's trading partner un- 
der defection when the underlying trade 
volume is high. The incentive to defect can 
be mitigated by increasing the cooperative 
tariff, which acts to reduce the volume of 
trade. Thus, when the trade volume surges, 
one might suspect that a high 7, would be 
required to avoid defection. This is indeed 
what we will find. 

Having characterized the static incentive 
to defect, our next step is to examine the 
expected future loss suffered by a country 
that defects. Letting E be the expectations 
operator with expectations taken over V^ 
and ó be the discount factor, we represent 
the respective present discounted values of 
the expected future gain from not defecting 
today for the domestic and the foreign 
country as 


c, «VI /2B. 


09. i [EMV s QD), (70) 
- EW(VS, tu (Vf), v. (V/))] 
-o(z()), 

Q5 Ew (VV), T) 


1-5 
- EW*(V!, cy (Vf), ty(V4))| 
= w*(1,(-)). 


Since e (and thus V^) is i.i.d. across periods, 
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œ and w* are independent of the current 
value of Vf as well as the current value of 
v (V^). The function 7,(-) will affect œ and 
w*, however, since the function's distribu- 
tional characteristics influence the corre- 
sponding expected values. Observe that w 
and o* will be strictly positive when à» 0 
and z,(V^) « r (V°) for all V, in which 
case the threat of future punishment is 
meaningful. 

' We focus on the case where punishment 
involves infinite reversion to the interior 
Nash tariff equilibrium of the static game."! 
Using (2) and (3), we then have 


(16) e(1C)) = o* (C) 


8. [oj (EVI) 
~ (1-8) 8B 


-Elez + (Evn P], 


where of is the variance of V^ and o is 
the variance of r (V^). Note that the 'ex- 
pected future gain from cooperating is : 
higher when c/f and EV! are higher, hold- 
ing fixed 7,(V“)." This reflects the fact that 
the gains associated with cooperation (low 
protection) are increasing and convex in the 
underlying free trade volume. 


"The case of reversion to autarky is similar, as 
detailed in Bagwell and Staiger (1988). The autarky 
threat actually generates the most cooperation possi- 
ble, since it is the harshest punishment that countries 
will endure. (This point is made at a general level by 
Dilip Abreu, 1988). Yet, its very severity makes it 
somewhat less plausible than the threat to revert to the 
interior equilibrium, As Dixit (1987) notes, the possibil- 
ity of autarky reversion is also significant for the tariff 
game since the existence of two static Nash equilibria 
makes possible cooperation even in finite horizon set- 
tings. (See Jean-Pierre Benoit and Vijay Krisna, 1985, 
and James Friedman, 1985, for more general discus- 
sions of this issue.) 

The assumption of linear demand generates a 
welfare function that is quadratic ín v. Since the Nash 
tariff is linear in V/, only the mean and variance of V/ 
appear in (16). The higher-order moments might be 
important with nonlinear demand. 
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We have now characterized both the im- 
mediate gain from defection and the ex- 
pected future loss. Both expressions are 
identical across countries, which enables us 
to focus on the domestic country hence- 
forth. Now for credible cooperation to oc- 
cur, the cooperative trade tax function, 
t(V/), must be such that at every V, no 
country has incentive to defect, or 


(17) AVS toV, c (V5), (/)) 


x o(v,(-)). 


This is our fundamental *no defection" 
condition, which implicitly defines a cooper- 
ative trade tax function. 

There will in general be many functions 
that satisfy (17). To characterize the “most 
cooperative" trade tax function, we hold o 
fixed at a constant level and solve for the 
lowest, nonnegative r, satisfying (17).? This 
process generates a trade tax function, with 
independent variables V and c, and deter- 
mines the necessary properties that the most 
cooperative trade tax function must satisfy. 

To begin, fix w » 0. For V/ —0, (8) and 
(9) establish that 0.(0, 7)(0,0), 0) = 0, so that 
(17) is satisfied by 7, — 0. Holding +, fixed 
at zero and increasing V^, we know from 
(11) that Q(V4,7,(V4,0), 0) increases mono- 
tonically. If œ is not too large, which will 
always be the case if ô is not too large, then 
there exists a critical value of V^, V^, such 
that O(VS,7,(V4,0), 0) = w. 

Solving this explicitly gives 


(18) VI = J6Bo . 


Hence, free trade is sustainable for V/ € 
[0, V4]. 

For more extreme values of V/, where 
Vf € (V7,1], (17) will be violated at 7, = 0. 


BA focus on the most cooperative equilibrium seems 
natural here, since countries are free to communicate 
about which self-enforcing equilibrium they will settle 
on, and the most cooperative equilibrium is the only 
symmetric equilibrium that is not Pareto dominated. 
Indeed, the GATT may be viewed as a forum within 
which the most cooperative (self-enforcing) trading ar- 
rangements are codified. 
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From (12), 7, must then rise above zero to 
reestablish (17). Explicit calculation yields 
the following representation of the most 
cooperative tax rule: 


(19) 7,(V/,o) 


0, if V! e[0,V^] 
-(yt-yf 


———— ii VF EVE A], 
28 | [77,1] 


The corresponding cooperative trade vol- 
ume is given by 


V/ if V! efo, V] 
(20) V*-4 Vf - yf —" 
—— — it V/ e[V7,1] 


The next figure summarizes (19) and Q0). 
Figure 3 plots 7, and V* as a function of 
Vf, The threat to revert to the static interior 
Nash equilibrium supports free trade over a 
range of moderate trade volume. Intuitively, 
if the underlying free trade volume is low, 
then the (current) incentive to defect with a 
tariff is low, even if the natural flow of trade 
is unrestricted (7, — 0). Once the trade vol- 
ume becomes extreme, the trade tax func- 
tion increases with the magnitude of the 
volume. Here, the incentive to defect is 
large because the natural flow of trade is 
high, and so the volume of trade must be 
mitigated somewhat (7, > 0, V* < V^) in or- 
der to prevent defection. 

The above analysis characterizes the nec- 
essary features of the optimal trade tax 
function and gives us an expression T= 
TV, œ). The analysis was conducted under 
the assumption of an exogenously given w. 
In fact, as (16) illustrates, œ depends on the 
7. (-) function, œ = w(7,(-)). To establish ex- 
istence of the optimal trade tax function, we 
must ensure that these equations are consis- 
tent, so that the w with which we began is 
also the w value that (V^, c) generates. 
Substituting the first equation into the sec- 
ond and using (16), (18), and (19), we can 
write the resulting equations as o(w)= a, 
since w(r,(-)) is independent of V^. The 
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FIGURE 3 


most cooperative trade tax function can then 
be represented as vr, — (V^), when the 
largest @ such that à €(0,1/6B) and c) 
= @ is substituted into v(V7, œ). 

We must now prove that such a fixed 
point does exist. Observe first that a fixed 
point does occur at à = 0, corresponding to 
the continual play of the static interior equi- 
librium. This follows since 7(V/,0)— 
vA (V^), by (19). To explore the possibility of 
a positive root, we explicitly calculate 
Er (V^) from (18) and (19) and use (16) to 
get 


(21) &(w) = at +(EVS) 


mal 
881-8) 
BEIC v). 


if œ €[0,1/68], where F(V^) is the distri- 
bution function for V^. It is now straightfor- 


ward to verify that with primes denoting 
derivatives, o(@ = 0)= 0, &'(w =0)=%, 
@'(w =1/68)=0, and à'(») «0 for we 
[0,1/68]. Hence, a necessary and sufficient 
condition for a unique fixed point ô € 


(0,1/68) is H(1/6B) « 1/68, or 


(22) ê< ôy- 


3| o? + (Ev^)] +4 7 


This will clearly hold if 8 and/or [o7/ + 
(EV‘)*] is sufficiently small. Thus, under 
this condition, the threat of interior Nash 
reversion generates a unique, most coopera- 
tive trade tax rate, with the properties given 
in Figure 3. 

If instead (22) fails, then à(1/68) 
1/6B. Since +(V‘, v) =0 for all o > 1/68, 
this case corresponds to free trade for every 
Vf. Taking these results together, we have 
now established a unique, most cooperative 
trade tax function, which can be expressed 
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solely in terms of V/ and other exogenous 
parameters, such as 6, o/, and EV’. 

The trade tax function is easily under- 
stood. When 8 and [o -- (EV^?] are small 
and (22) holds, the threat of interior Nash 
reversion is unimpressive. Intuitively, since 
6 is small, future losses from defection 


are not weighted heavily, while a small , 


[of +(EV/)2] implies that the reduced 
trade volume induced by the reversion is 
not expected to be large or variable and 
does not therefore represent a great loss in 
welfare. This case corresponds to Figure 3 
with œw defined via the fixed point condition 
in terms of exogenous variables. If on the 
other hand ô and [o7 - (EV^)?] are large 
and violate (22), then the reversion threat is 
acute; and so free trade is sustainable for all 
Vi. 

Note that (19) and (21) together imply 
that surges in trade volume will tend to lead 
to greater increases in protection the more 


"unusual" they are for the sector under. 


consideration. That is, the level of protec- 
tion sustainable in the cooperative equilib- 
rium of the dynamic tariff game will depend 
on the realization of V/ and its mean EV! 
on the one hand, and on the variance of V^, 
op, on the other. If free trade is sustain- 
able when Vf = EV, then all else equal, a 
given increase in V^ above EV/ will be 
associated with a higher cooperative tariff 
the more "unusual" is the trade volume 
surge (the smaller is o7/). 

To summarize, we have demonstrated that 
a high natural trade volume increases the 
incentive to defect. ‘Protection is: then 
needed to mitigate the volume of trade, 
thus sustaining the cooperative equilibrium. 
Higher values of ô and [o +(EV‘)?] act 
to make punishment more severe, and 
therefore make cooperation easier facing 
any given free.trade volume V/.4 


14The threat to revert to a static Nash equilibrium is 
not in this model renegotiation-proof, as defined by 
Joseph Farrell and Eric Maskin (1987). Our basic con- 
clusion as to the: relation between trade volume and 
protection is consistent with the potential for renegoti- 
ation, however. To construct punishment schemes that 
will not be renegotiated, specify asymmetric tariffs (off 
the equilibrium path) so that the country that did not 
cheat enjoys the punishment phase. 
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The basic results of our analysis also carry 
through if countries explicitly choose trade 
quantities (quotas) rather than prices 
(taxes).^ In particular, suppose countries 
set import and export quotas, and then ei- 
ther give the chosen quantity of quota li- 
censes to their firms or auction them off. As 
noted in Section HI, the unique static Nash 
equilibrium of this game is autarky, which 
then characterizes the credible punishment 
for defection.^ While as in the tariff game 
the current incentive to defect from the 
tacit cooperative quota is increasing in un- 
derlying trade volume, it is greater for a 
given trade volume in the quota game than 
in the tariff game, since defection in the 
quota game secures all the rents of protec- 
tion for the defecting country." Calcula- 
tions similar to those above yield the follow- 
ing representation of the most cooperative 
quota rule q*(V7, w): 


(23) a (Vf, v) 
VF for Vf e[0,V^] 


Vi y(V/y-4Bo 


2 2 
for Vf € (V/,1]. 


Figure 4 summarizes (23). Here the coop- 
erative quota level q (Vf, w) is plotted 
against the underlying free trade volume V. 
For low to moderate levels of V^, free trade 
is sustainable and is reflected in a move- 
ment along the 45° line. When V^ passes 
the threshold value of V/- 68», how- 
ever, free trade is no longer sustainable as a 
cooperative: equilibrium. Moreover, any 


DRotemberg and Saloner (1986b) develop a model 
to analyze the effect of quotas on the ability of firms to 
collude. We differ in focusing on tacit cooperation 
between governments when firms are competitive. 

Since there is no interior.static equilibrium for the 
quota game, we focus on the autarky threat, which is 
the threat that generates the most cooperation. See 
also fn. 11. : 

In terms of Figure 2, defection in the quota game 
captures the additional revenue rectangle aklo (ajik) 
for the importer (exporter), increasing the static gains 
from defection above that in the tariff game. 
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quota that restricts trade by less than the 
quota to which countries would optimally 
defect from free trade will only increase the 
static incentive to defect: since such a quota 
could not bind under defection, it would 
simply reduce the current welfare under 
cooperation for both countries but have no 
impact on current welfare under defection. 
For this reason, Figure 4 shows a disconti- 
nuity at V, with higher free trade volumes 
leading to a discrete tightening of the trade 
restricting cooperative quota. At this lower 
cooperative quota, the optimal defection 
entails slightly tightening the quota so as to 
secure all cooperative quota rents. By 
choosing a sufficiently low cooperative 
quota, the countries reduce the size of co- 
operative quota rents and thus the incentive 
to defect.! 


'81 is interesting to compare the levels of coopera- 
tion attainable in the tariffs and quota games. The 


IV. Protection and the Trade Balance 


In this section we explore how the re- 
lationship between trade volume and pro- 
tection analyzed in the previous section 
translates into a relationship between trade 
balance and protection. While the model 
developed above relates levels of protection 
most directly to trade volume, there is a 





autarky threat always provides the most cooperation, 
for a given policy instrument. Incentives to cheat are 
highest, however, when quotas are used, since all rents 
can be captured by defection. Thus, the threat of 
autarky reversion in the tariff game generally supports 
the most liberal trading environment. Whether quotas 
are preferred to tariffs with the interior reversion threat 
depends on the impatience of countries: the more the 
future is valued, the more likely it will be that tariffs 
with the threat of interior Nash reversion are inferior 
to quotas in supporting liberal trade. This comparison 
is further analyzed in Bagwell and Staiger (1988). 
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large informal empirical literature on trade 
imbalance as a determinant of protection, 
and it may be useful to explore the implica- 
tions of our model in this regard. 

The basic insight developed in the previ- 
ous sections is that the incentive to defect 
unilaterally from cooperative tariff levels is 
highest when trade volume is highest: there- 
fore, as the volume of trade expands, the 
level of protection rises in a cooperative 
equilibrium to mitigate the rising trade vol- 
ume and hold the incentive to defect in 
check. When applied to a single sector as in 
the previous section, this implies that peri- 
ods of greater than average trade volume 
will be associated with higher than average 
protection. We now show that when applied 
in the aggregate, this implies that the rela- 
tionship between aggregate trade imbalance 
and the level of protection will depend on 
the sectoral makeup of the imbalance. A 
widening trade imbalance will be associated 
with greater levels of protection to the ex- 
tent that the deficit (surplus) country’s im- 
ports (exports) increase. However, a widen- 
ing trade imbalance will be associated with 
lower levels of protection to the extent that 
the deficit (surplus) country’s exports (im- 
ports) fall. 

To show this, we consider the addition of 
a second sector to the model of the previous 
section and explore the relationship be- 
tween the bilateral trade imbalance associ- 
ated with trade between the home and for- 
eign countries and their levels of protection. 
For simplicity, we suppose that the home 
and the foreign country trade only in these 
two products, and that there is a large rest 
of the world with which both countries also 
trade, so that the bilateral trade in these 
two goods is small relative to each country’s 
total world trade. Thus, we stay within the 
partial equilibrium frameword of the previ- 
ous sections and focus on the relationship 
between bilateral trade imbalance and pro- 
tection.? | 

To keep things simple, suppose that the 
second product is identical to the first in 


See Bergsten (1982) on the importance of 
Japan-U.S. bilateral trade imbalance in determining 
levels of protection between the two countries. 
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every way except that while the first good is 
exported by the home country, the second 
good is exported by the foreign country. In 
particular, suppose foreign (home) output 
of good 1 is given by e [0,1] (2— e) and 
that the home (foreign) output of good 2 is 
given by e €[0,1] (2— e). As before, e- is 
determined independently in each period by 
the commonly known distribution F(e). Un- 
der this setup, free trade volumes will be 
identical in the two sectors and the results 
of the previous section imply that the two 
goods will share identical levels of protec- 


'tion in the cooperative equilibrium for every 


realization of e (and thus V^). Therefore, 


71 — 72 =r, for all Vf, and 7, is increasing 


in V/. But the bilateral trade balance will be 
given by 


Q4 TB-[2-e-C.]-IC, - e] 
-2-(C,4 €) 
-2-(C,* Cf)-0, 


where the second-to-last equality follows 
from the symmetry of the setup which en- 
sures that C, = Cf. 

What we have shown is that observations 
on the trade balance are not generally suf- 
ficient to determine the path of protection: 
in the extreme case illustrated here, the 
trade balance is always zero, independent of 
the value of V^, even though the coopera- 
tive level of protection will change with V, 
as was the case in the previous section. 
Hence, to predict the relationship between 
trade imbalance and protection, our model 
suggests that information on the sectoral 
makeup of the trade imbalance will be 
needed.?? l 


2A separate issue that arises with the introduction 
of more than one sector is the notion of multimarket 
contact developed by B. Douglas Bernheim and 
Michael Whinston (1987). The central idea is that 
multimarket contact allows the pooling of incentive 
constraints over markets: if slack exists in the incentive 
constraints of some markets, pooling may augment the 
degree of cooperation sustainable in other markets. In 
the special two-sector case considered above, there is 
no gain from pooling incentive constraints since the 
sectors are identical in all relevant ways. More gener- 
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V. Extensions 


We discuss in this section several further 
extensions that can be introduced to the 
model. We begin by exploring the generality 
of the relationship between underlying free 
trade volume and the incentive for coun- 
tries to defect that characterizes the mocel 
of the previous sections. The property that 
at least one country's incentive to defect 
from free trade rises during periods when 
the underlying free trade volame is high is 
crucial in generating the positive correlation 
between cooperative protection levels and 
trade volume depicted above. We now ex- 
amine the generality of this relationship and 
provide support for a presumption in its 
favor. 

Expressions (25) and (26) depict, respec- 
tively, the domestic (exporting) and foreign 
(importing) country's static incentive to de- 
fect from free trade for general export sup- 
ply (X(k, P) and import demand 
(M(k*, P*)) functions: 


(25) Q(k,k*,75,0) 
=[P*(k,k*, 7,0) - Pf] 
XX(k, P(k,k*,v5,0)) 
pf 


= [ X(k, P) 
P(k,k*,75,0) 


— X(k, P(k, k*,75,0))]dP. 
(26) QO*(k,k*,0,75) 
- [P/ — P(k,k*,0,75)] 
M(k*, P*(k, k*,0,75)) 


2o (P'Gok*,0, p) ^» 
ie [M(k*, P*) 


— M(k*, P*(k,k*,0,7)) | aP*. 





ally, however, the ability of governments to pool incen- 
tive constraints across sectors is likely to undermine a 
strict sector-by-sector relationship between trade vol- 
ume and protection. Instead, a surgz in overall bilat- 
eral trade volume would lead to an overall rise in 
bilateral protection. 


SEPTEMBER 1990 


The first term in expressions (25) and (26) is 
the tariff revenue collected from one’s trad- 
ing partner. The second term in each ex- 
pression is the efficiency loss in the surplus 
from trade associated with defection. The 
parameters k and k* represent general pos- 
itive shift parameters in the export supply 
and import demand functions, respective- 
ly. Thus, by definition ôX(k, P)/dk » 0 
and dM(k*, P*)/ak* » 0 for all P and P* 
and, provided àX(k, P)/8P » 0 and 
aM(k*, P*)/8P* <0, then dV// dk > 0, and 
dV! / dk* >. 

We focus on the implications of a shift in 
the export supply or import demand func- 
tion for the incentive to defect in the coun- 
try where the shock originates; that is, we 
consider the signs of dQ(-)/dk and 
dQ*(-)/dk*. We wish to establish a pre- 
sumption that these two derivatives are pos- 
itive, thereby ensuring that at least one 
country's incentive to defect from free trade 
rises whenever the underlying free trade 
volume rises. Using (25) and (26), direct 
calculation establishes the following: 


5 dQ(-) oig 206 7D pf 
—— > ————— à o€——À— € 
een) : ak nit ni, 
OX(k,P 
> f” IKUE ) ap, 
P(k,k*,tp,0) ôk 
(28) dQ*(-) 
dk* 





ieee) Pf 
ACE. ni * nj, 


aaa e) ae 
pf ok* : 


where qf and nf, are, respectively, the price 
elasticities of export supply and import de- 
mand (taken positively) evaluated at free 
trade. The right-hand side of (27) gives the 
additional efficiency loss from rp suffered 
by the defecting home country associated 
with the outward shift of its export supply 
function. Likewise, the right-hand side of 
(28) gives the additional efficiency loss from 
Tp Suffered by the defecting foreign country 
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associated with the outward shift of its im- 
port demand function. The left-hand side of 
these two expressions captures the impact 
of the terms-of-trade changes associated 
with the shock on each country's incentive 
to defect: the less elastic are the export 
supply and import demand functions at free 
trade, the greater will be the (free trade) 
terms-of-trade loss for the country within 
which the shock originates, and the greater 
will be the corresponding gain from defec- 
tion. 

From (27) and (28) it is clear that a shock 
to the export supply or import demand 
function that takes the form of an increase 
in k or k*, respectively, will increase the 
incentive to defect from free trade for the 
country within which the shock occures pro- 
vided that m/£---/ are sufficiently small, 
that is, provided that the elasticities of ex- 
port supply and import demand evaluated 
at free trade are sufficiently small. But in 
the context of a defection, this is likely to be 
the case, since the relevant elasticities are 
very short run in nature; that is, they reflect 
the time it takes for one's trading partner to 
detect a defection and respond. Thus, with 
a negligible (immediate) response of export 
supplies and import demands to a small 
price change from P^, conditions (27) and 
(28) are likely to be satisfied, and at least 
one country's incentive to defect from free 
trade will rise whenever the underlying free 
trade volume rises. This supports the pre- 
sumption in favor of an increasing relation- 
ship between free trade volume and the 
incentives of at least one country to defect 
from free trade, and suggests that the flavor 
(though not the transparency) of our results 
would be preserved in much more general 
settings.?! 


2lWe have also assumed throughout that e is iid. 
through time. Rotemberg and Saloner (1986a) have 
made the analogous assumption for their model, but 
this approach has been criticized by John Haltiwanger 
and Joseph Harrington (1987). As applied to our model, 
the criticism is that if a high V^ today makes more 
likely a high V/ tomorrow, then the cost of defection 
today rises for both countries. The predictions of the 
iid. model are reversed, however, only if 6 is suffi- 
ciently large. Since the small 5 case seems especially 
appropriate for elected governments, the possibility of 
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We turn next to the interpretation of the 
export tax. While it is true that import taxes 
are more commonly utilized than are export 
taxes, the latter are nonetheless observed, 
especially in the primary product markets of 
developing countries.” Still, it is interesting 
to ask whether our basic results would be 
preserved in a world in which the imposi- 
tion of export taxes were politically infeasi- 
ble. In this regard, it is possible to amend 
the two-sector model considered in Section 
IV with an ad hoc requirement that coun- 
tries are unable to impose export taxes, 
perhaps because of political pressures. In 
this amended model, there is a unique static 
Nash equilibrium in which each country im- 
poses the import tax rg(V/,0) = 2V//38 on 
its import good. Cooperation in an 
infinite-horizon setting is then made possi- 
ble with the threat to forever revert to the 
static equilibrium if a defection from a low 
tariff is ever observed. Our basic result 
readily extends to this model. During peri- 
ods of high-volume trade, each country will 
have a large incentive to deviate to the tariff 
2Vf/3B on its import good. This incentive 
is reduced only if the cooperative tariff is 
allowed to rise somewhat, so that each 
country is appeased in its import market. 
Thus, even if countries have no direct con- 
trol over the volume of trade in their export 
sector, higher trade volume will continue to 
correspond to higher protection. 

Finally, we have chosen to focus on gov- 
ernments that pursue protection for- its 
terms-of-trade effects, and thus have con- 
sidered a model in which full international 
cooperation would entail the maintenance 
of free trade in all periods. However, the 
basic insights we have developed are consis- 
tent with other government objectives. For 
example, suppose instead that import pro- 
tection is used by governments to mitigate 
sudden drops in the real income of import- 
competing producers. Such an objective is 





correlated shocks is likely to be consistent with our 
conclusions for an interesting range of parameters. 

^ See, for example, World Development Report 
1986, especially chap. 4, for a discussion of the export 
tax policies of developing countries. 
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in the spirit of Corden’s (1974) Conservative 
Social Welfare Function. If trade volume 
into the importing country surges, either 
because of a “good” supply realization 
abroad or a “bad” supply realization at 
home, the importing country may respond 
with “special” protection even under full 
international cooperation. However, the 
fully cooperative response typically will 
trade off the desire to maintain the real 
income of import-competing producers at 
home not only with the interests of domes- 
tic consumers, but with the interests of the 
export sector abroad as well, and thus re- 
sults in less complete maintenance of im- 
port-competing producer incomes than 
would be individually optimal. Thus, the 
workings of such a model would be qualita- 
tively similar to the model we have studied 
here: unusual surges in trade volume would 
tend to be associated with unusually large 
static gains from defection ta a high tariff 
for the government of the importing coun- 
try, and an increase in the equilibrium level 
of protection (above its fully cooperative 
level but still below the noncooperative 
choice) would be required to avoid a com- 
plete breakdown in cooperation, that is, a 
tariff war.? 


VI. Conclusions 


We have attempted to develop a theory 
of managed trade that correlates periods of 
unusually high trade volume with increased 
protection. While the model we have cho- 
sen is special in a number of ways, the 
insights it generates appear to be much 
more general. 

In particular, the notion that periods of 
unusually high trade volume present coun- 
tries with an unusually strong incentive to 
defect from cooperative trading arrange- 
ments seems to be quite general and forms 


This particular interpretation is also suggestive of 
the way in which international agreements, even if 
constrained to be self-enforcing in nature, may help to 
limit the policy discretion of a government vis-a-vis its 
own private sector, thereby alleviating time-consistency 
problems of the kind studied in Staiger and Guico 
Tabellini (1987). 
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the heart of our analysis. Given this, it fol- 
lows naturally that countries will attempt to 
manage the volume of trade with protective 
instruments that serve to dampen fluctua- 
tions in trade volume. Trade management 
can then be understood as an attempt by 
countries to maintain the self-enforcing na- 
ture of existing international cooperation. 
Finally, in exploring the sustainable level 
of tacit cooperation among countries in a 
volatile environment, we have made no for- 
mal distinction between special protection 
devices that are provided for explicitly in 
the GATT and those that are not. The 
“safeguards” provisions of the GATT, 
whereby countries are given the right to 
raise protection in the event of unforseen 
developments, may to some extent repre- 
sent an explicit institutional manifestation 
of our ideas. Our analysis suggests a role 
for safeguard provisions when trade volume 
is unexpectedly high as a means of avoiding 
a reversion to noncooperative interaction 
among countries. In this light, the recent 


. proliferation of safeguard "substitutes," for 


example, VERs and OMAs, may reflect the 
general inadequacy of the existing safe- 
guards provisions to maintain the credibility 
of the rest of the GATT system. 


"Consistent with this interpretation is the observa- 
tion of Kenneth Dam (1970): “One may conclude that 
the GATT escape clause is a useful safety valve for 
protectionist pressures and does not undercut in any 
serious way the advantages of the GATT tariff negoti- 
ating system. Insofar as the excape clause is a political 
‘prerequisite’ to the membership in the GATT of cer- 
tain contracting parties—most notably the U.S.—the 
argument in its favor is even stronger" (pp. 106—107). 
For a discussion of GATT safeguards, see also J. David 
Richardson (1988). 
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Comparative Advantage and Long-Run Growth 


By GENE M. GROSSMAN AND ELHANAN HELPMAN* 


We construct a dynamic, two-country model of trade and growth in which 
endogenous technological progress results from the profit-maximizing behavior of 
entrepreneurs. We study the role that the external trading environment and that 
trade and industrial policies play in the determination of long-run growth rates. 
Cross-country differences in efficiency at R&D versus manufacturing (i.e., com- 
parative advantage) bear importantly on the growth effects of economic structure 
and commerical policies. (JEL 411, 111, 621) 


What role do the external trading envi- 
ronment and commercial policy play in the 
determination of long-run economic perfor- 
mance? This central question of interna- 
tional economics has received surprisingly 
little attention in the theoretical literature 
over the years. 

Previous research on trade and growth 
has adopted the neoclassical framework to 
focus on factor accumulation in the open 
economy. (See the surveys by Ronald Find- 
lay, 1984, and Alasdair Smith, 1984.) This 
research largely neglects the effects of trade 
structure on rates of growth, however, ad- 
dressing instead the reverse causation from 
growth and accumulation to the pattern of 
trade.’ The direction that the research fol- 
lowed almost surely can be ascribed to the 
well-known property of the standard neo- 
classical growth model with ciminishing re- 
turns to capital that (endogenous) growth in 
per capita income dissipates in the long run. 


*Grossman: Woodrow Wilson School, Princeton 
University, Princeton, NJ 08544. Helpman: Depart- 
ment of Economics, Tel Aviv University, Ramat Aviv, 
Israel. We are grateful to Avinash Dixit, Carl Shapiro, 
and Lars Svensson for their comments on an earlier 
draft, and to the National Science Foundation and the 
Alfred P. Sloan Foundation for financial support. 
Grossman also thanks the World Bank and Helpman 
thanks the International Monetary Fund for providing 
support and a stimulating environment during part of 
our work on this project. These organizations are not 
responsible for the views expressed herein. 

1An exception is Max Corden (1971), who studies 
how the opening up of trade affects the speed of 
transition to the steady state in a two-factor neoclassi- 
cal growth model with fixed savings propensities. 
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For this reason, the familiar models that 
incorporate investment only in capital 
equipment seem ill-suited for analysis of 
long-run growth. 

The available evidence collected since the 
seminal work of Robert Solow (1957) also 
leads one to look beyond capital accumula- 
tion for an explanation of growth. Exercises 
in growth accounting for a variety of coun- 
tries generally find that increases in the 
capital to labor ratio account for consider- 
ably less than half of the last century’s 
growth in per capita incomes. Although 
econometric efforts to explain the residual 
have been somewhat disappointing (see, for 
example, Zvi Griliches, 1979), professional 
opinion and common sense continue to im- 
pute much of this residual to improvements 
in technology? We share the view, ex- 
pressed by Paul Romer (1986, 1990), that a 
full understanding of growth in the long run 
requires appreciation of the economic de- 
terminants cf the accumulation of knowl- 
edge. 

In this paper we draw on the pioneering 
work by Romer to construct a model that 
highlights the roles of scale economies and 
technological progress in the growth pro- 
cess. As in Romer’s work, our model im- 


?See Angus Maddison (1987) for a careful, recent 
exercise in growth accounting. ; 

3The benefits of education and experience undoubt- 
edly contribute part of the explanation for the growth 
residual. See, for example, Robert Lucas (1988) and 
Gary Becker and Kevin Murphy (1990) for growth 
models that highlight the role of human capital accu- 
mulation as a source of growth. : 
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plies an endogenous rate of long-run growth 
in per capita income, and we study its eco- 
nomic determinants. Our primary contribu- 
tion lies in casting the growth process in a 
two-country setting. We provide, for the first 
time, a rigorous analysis linking long-run 
growth rates to trade policies and other 
international economic conditions. More- 
over, we show that recognition of cross- 
country differences in economic structure 
impinge upon conclusions about the long- 
run effects of domestic shocks and policies. 

Our model incorporates the essential in- 
sights from Romer (1990), although we in- 
troduce some differences in detail. The 
building blocks are an R&D sector that 
produces designs or blueprints for new 
products using primary resources and previ- 
ously accumulated knowledge, an interme- 
diate-goods sector consisting of oligopolistic 
producers of differentiated products, and a 
consumer-goods sector in each country that 
produces a country-specific final output us- 
ing labor and intermediate inputs. As in 
Wilfred Ethier (1982), total factor produc- 
tivity in final production increases when the 
number of available varieties of differenti- 
ated inputs grows. Thus, resources devoted 
to R&D contribute over time to productiv- 
ity in the production of final goods, as well 
as to the stock of scientific and engineering 
knowledge. 

The new elements in our analysis stem 
from the assumed presence of cross-country 
differences in the effectiveness with which 
primary resources can perform different ac- 
tivities, that is, comparative advantage. For 
simplicity we specify a one-primary-factor 
model, and allow the productivity of this 
factor in the three activities to vary interna- 
tionally. Similar results could be derived 
from a multifactor model with interindustry 
differences in factor intensities (see Gross- 
man and Helpman, 1989d, and Grossman, 
1989). In any event, we find that many com- 
parative dynamic results hinge on a compar- 
ison across countries of parameters reflect- 
ing efficiency in R&D relative to efficiency 
in manufacturing the goods that make use 
of the knowledge generated by R&D. The 
effects of policy in a single country, of accu- 
mulation of primary resources in a single 
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country, and of a shift in world tastes to- 
ward the final output of one of the coun- 
tries all depend upon the identity of the 
country in which the change originates in 
relation to the international pattern of com- 
parative advantage. 

We describe the economic setting in Sec- 
tion I below. Then, in Section II, we derive 
the dynamic equilibrium of the world econ- 
omy and calculate two reduced-form equa- 
tions that determine the steady-state growth 
rate. In Section III, we investigate the struc- 
tural determinants of long-run growth. 
There, the implications for growth of varia- 
tions in consumer preferences, primary- 
input coefficients in one or both countries, 
and stocks of available primary resources 
are considered. Section IV contains policy 
analysis. We study barriers and induce- 
ments to trade in consumer goods and sub- 
sidies to research and development. The 
analysis is extended in Section V to incorpo- 
rate lags in the dissemination of knowledge 
and asymmetries in the speed of diffusion 
within and between countries. We use the 
extended model to reconsider the effects of 
trade policies on the steady-state rate of 
growth. Finally, Section VI provides a brief 
summary of our findings. 


I. The Model 


We study a world economy comprising 
two countries, Each country engages in three 
productive activities: the production of a 
final good, the production of a continuum 
of varieties of differentiated middle prod- 
ucts (i.e., intermediate inputs), and research 
and development (R&D). A single primary 
factor is used in production, and is taken to 
be in fixed and constant supply in each 
country. Although we refer to this factor as 
“labor,” we have in mind an aggregate of 
irreproducible resources that for any given 
level of technical know-how limits aggregate 
output. f 

At a point in time, output of final goods 
in country i is given by 


s B/a 
Y= BAL? f z(o)" do| , 
0 


0<a,B <1, 
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where Ly, represents employment in the 
final-goods sector, x@) denotes the input 
of middle product w, A, is a country-specitic 
productivity parameter, and n is (the mea- 
sure of) the number of varieties of middle 
products available at that time.* This pro- 
duction function exhibits constant returns to 
scale for given n, but an increase in the 
measure of varieties of middle products 
raises total factor productivity. This specifi- 
cation, which we borrow from Ethier (1982), 
captures the notion that an increasing de- 
gree of specialization generates technical 
efficiency gains. The economy's potential for 
augmenting the degree of specialization by 
developing new middle products implies the 
existence of dynamic scale economies at the 
industry level that are external to the indi- 
vidual final-good-producing firms. 
Competition in the final-goods sectors en- 
sures marginal-cost pricing. Hence, by ap- 
propriate choice of the constant B, pro- 
ducer prices satisfy 
l B/Q0— 2) 


Q) py= (2) [oso ‘de 





where w; is the wage in country i and 
Px(w) is the price of variety w. Final-good 
producers worldwide pay the same prices 
for (freely traded) middle products. 

At every moment in time, the existing 
producers of middle products engage in 
oligopolistic price competition. The pro- 
ducer of a variety o in country ; chooses 
Px(@) to maximize profits, l 


m(w)= 


[ Py(@) -= Wia; xi] 
Px(w) * 
x r— —— —— 


" z BŽ; Pyi¥;, 
f pxo) do 


‘Here, and henceforth, we omit time arguments 
when no confusion is caused by doing so. 


SEPTEMBER 1990 


where a, x; is the unit labor requirement 
for production of intermediates in country 
i. This expression for profits comprises the 
product of profits per unit (in square brack- 
ets) and derived demand for variety o, 
where the latter incorporates the assump- 
tion that neither the prices of competing 
products nor the value of final production 
varies with py(w). The first-order condition 
for a profit maximum implies the usual 
fixed-markup oricing rule, 


(2) apy(w) = wid, y; 


It is clear from (2) that varieties originat- 
ing from the same country bear the same 
price. Letting py; represent the price of a 
variety produced in country i and n, be the 
number of intermediate inputs produced 
there, equations (1) and (2) imply 


w; V7? NC 
(3) pu (| (Enp) ; 
J 


(4) QP yi = Wil, xj. 


With these prices, profits per firm can be 
expressed as 


(5) T;—(1—- a)py; X; / ni. 


where X; is aggregate output of intermedi- 


ates in country i (n; times per-firm SUID 
and is given by 


Ni Px; 
() X= 


4"jPxj 


olen} 


As in our 1989c paper (see also Kenneth 
Judd, 1985), resources devoted to research 
generate “blueprints” that expand the mea- 
sure of differentiated products. Research 
outlays are made by private, profit-maxi- 
mizing entrepreneurs, who appropriate 
some of the benefits from a new technologi- 
cal innovation in the form of a stream of 
oligopoly profits. We assume that innova- 
tors receive indefinite patent protection, but 
that blueprints are not tradable, so that all 
profits must be derived from production in 
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the country in which a middle product has 
been developed. Then, with free entry by 
entrepreneurs, if resources are devoted to 
R&D in country i at time f, the present 
value of future operating profits from pro- 
ducing there—discounted to time t —must 
equal the current cost of R&D. We denote 
R&D costs by c, (1) and write the zero-profit 
condition as : 


f e m7 Pa) de = e), 
t 


where R(t) is the cumulative interest factor 
from time 0 to time t (R(0) — 1). Differen- 
tiating this condition with respect to t, we 
find 


TjtÓ. o. 
(7) m 


ni 


Equation (7) expresses a standard no- 
arbitrage condition. Recognizing that c,,,(t) 
represents the value of an input-producing 
firm in country i at time t, (7) equates the 
instantaneous rate of return on shares in 
such a firm (the sum of dividends and capi- 
tal gains) to the rate of interest. 

We follow Romer (1990) in assuming that 
R&D generates a second output, which 
takes the form of a contribution to the stock 
of disembodied knowledge. Knowledge here 
includes all general scientific information, 
as well as some forms of engineering data 
with more widespread applicability, gener- 
ated in the course of developing marketable 
products. Knowledge contributes to the pro- 
ductivity of further research efforts by re- 
ducing the amount of labor needed for an 
inventor to develop a new product. Due to 
the more general and non-patentable na- 
ture of this product of the R&D effort, 
appropriation of the resulting returns by the 
creator seems problematic. We assume to 
begin with that general knowledge dissemi- 
nates immediately and costlessly throughout 
the world. This approximates a situation in 
which information spreads through techni- 
cal journals, professional organizations, and 
interpersonal commercial contacts, and 
where literature, scientists, and business- 
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people move freely across international bor- 
ders (see Luigi Pasinetti, 1981, ch. 11). We 
relax this assumption by introducing lags in 
the dissemination of knowledge in Sec- 
tion V. 

With these knowledge spillovers in mind, 
we specify our R&D technology as follows. 
If L,; units of labor engage in research in 
country i, they generate a flow of new prod- 
ucts 5; given by i 


(8) ni z LiK/ Arnis 


where K is the current stock of knowledge 
and a,,,; is a country-specific productivity 
parameter. We take the stock of knowledge 
to be proportional to cumulative experience 
in R&D; that is, there are no diminishing 
returns to research in adding to scientific 
understanding. By choosing units for K so 
that the factor of proportionality is unity, 
we have K =n and 


(9) K- XL,K/aj,- 
i 


Since knowledge has been assumed to be a 
free input to each individual entrepreneur, 
the cost of product development in country 
i can be written as 


(10) 


Cni = Wi y sj /n. 


We turn now to the demand side of the 
model. Consumers worldwide share identi- 
cal, homothetic preferences. They view the 
final goods produced in the two countries as 
imperfect substitutes. We represent prefer- 
ences by a time-separable intertemporal 
utility function 


Qi) Ue f e 
t 


xlog u[ yi( 7), v; (7)] dr, 


where p is the subjective discount rate and 
yr) is consumption of final goods from 
country i in period r. The instantaneous 
sub-utility function u(-) is nondecreasing, 
strictly quasi-concave, and positively linearly 
homogeneous. 
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A typical consumer maximizes (11) sub- 
ject to an intertemporal budget constraint. 
With u(-) linearly homogeneous, this opti- 
mization problem can be solved in two 
stages. First, the consumer maximizes static 
utility for a given level of expenditure at 
time r, E(7). The solution to this subprob- 
lem generates an indirect utility function, 
UL pyr), py(MIE(7), where py, is the 
price of Y;. In the absence of barriers io 
trade in final goods, these prices are com- 
mon to consumers in the two countries: The 
second-stage problem involves choosing the 
time pattern of expenditures to maximize 


(2). v fere "og vlps 0) 0.0] 


+log E(r)} dr 


subject to 
(13) [oe RO FOE( 7) 
t 


< f e IRO-ROw(1) Ldr+Z(t), 
t 


where w(t) is the consumer’s wage rate at 
time t, L is his labor supply, and Z(t) is the 
value of his time ¢ asset holdings. The inter- 
est factor in (13) is common to all individu- 
als as a result of trade on the integrated 
world capital market, but the wage rate 
varies by country. 

From the first-order conditions to this 
problem, we find that the optimal path for 
expenditure obeys 


É . 
(14) eR p. 


Savings are used to accumulate either own- 
ership claims in input-producing firms or 
riskless bonds issued by these same firms. 
Arbitrage ensures that the rates of return 
on these two assets are equal, and in equi- 
librium consumers are indifferent as to the 
composition of their portfolios. 
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H. Equilibrium Dynamics 


During the course of the development of 
our model in the previous section, we pro- 
vided some of the equilibrium conditions. 
For example, we derived pricing equations 
for goods and a no-arbitrage condition re- 
lating equilibrium asset returns. In this sec- 
tion we complete the list of equilibrium 
requirements by adding conditions that stip- 
ulate market clearing in factor and final- 
goods markets. We then derive and discuss 
a reduced-form system that describes equi- 
librium dynamics. 

Static equilibrium in the markets for the 
two final goods implies 


(15) Dy;Y; = SE, 


where s({py,,Py2) is the share of world 
spending allocated to Y, and E is world 
spending on consumer goods. The share 
function is, of course, homogeneous of de- 
gree zero. We establish below that relative 
commodity prices are constant in the vicin- 
ity of a steady state with active R&D sectors 
in both countries. For this reason, we take 
5; to be constant in our subsequent analysis, 
and omit its functional dependence on rela- 
tive prices. 

The labor-market clearing conditions 
equate labor supply and labor demand in 
each country. Using (3) and Shephard's 
lemma, we see that final-goods producers 
demand (1—- B)py,Y; / w; workers. The de- 
mand for labor by middle-products manu- 
facturers is a, y;X;, while (8) and the fact 
that K =n imply demand for labor by prod- 
uct developers of (a, „i / n)n;. Hence, 


(16) (a,,; /n)h; t a, x; Xi 
t (1- B)py;Y; / w; 7 Lp 


where L; is the labor force available in 
country i. 

Since we neglect here the monetary de- 
terminants of the price level, we may choose 
freely a time pattern for one nominal vari- 
able. It proves convenient to specify the 
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(17a) py, = (a,x, jd) s 

We show in Appendix A of our 1989a work- 
ing paper that, with this normalization, a 
necessary condition for convergence to a 
steady state with positive R&D in both 
countries (i.e., nonspecialization) is 


(17b) Py. =a, yx2 ara) t 
Together, (17a) and (17b) imply that relative 
prices of middle products are constant along 
the convergent path, which further implies 
with (4) the constancy of relative wages, and 
with (3) the constancy of relative prices of 
final goods. This last fact justifies our treat- 
ment of expenditure shares as fixed. 

Let g(t) denote the rate of growth of the 
number of products and the stock of knowl- 
edge; that is, g =ñ/n= K/K. Then from 
(17) and (4) we see that prices of intermedi- 
ates and wages grow at rate g, while from 
(10), product development costs are con- 
stant. Equations (5)-(7), (10), (15), and (17) 
imply 











18 Am mbi Be 
(5 m Lnb; n 
and 

: 1 BE 

19 R= 

(19) e-l Lnb’ 


where b; =(a,,; / a, x;)^. The coefficients b; 
will serve as our measures of comparative 
advantage. Country 1 enjoys comparative 
advantage in conducting R&D if and only if 
b, <b. 

Since wages grow at the same rate as n, it 
proves convenient to define e = E/n. Let- 
ting o,=n,/n be the share of products 
manufactured in country i and noting that 

- Yn, /n, (16), (15), 17), (4), and (18) 
imply 





GROSSMAN AND HELPMAN: COMPARATIVE ADVANTAGE GROWTH 801 


where we have defined H = Y;L;/a,,, the 
total effective labor force, o= Yo;jb, a 
weighted average of the comparative advan- 
tage parameters with product shares as 
weights, and s = Ys, /b;. Observe that the 
parameter o, which provides a useful sum- 
mary of the static intersectoral resource al- 
location, grows (shrinks) over time if and 
only if the growth rate of the number of 
differentiated middle products in the coun- 
try with comparative disadvantage in R&D 
exceeds (falls short of) that of the other 
country. 

We are now prepared to derive two equa- 
tions that describe the dynamic evolution of 
the world economy. From the definition of 
e, we have é/e = E/E — g, or, substituting 
(14), (19), and (20), 


eye 
e ag 





se— H — p. 


Hence, the rate of increase of spending per 
middle product is larger the greater is 
spending per product and the smaller is the ~ 
share ef the country with comparative dis- 
advantage in R&D in the total number of 
varieties. 

Now, from the definition of the product 
shares 2 their rates of change are given by 

6; / 0; - f; / n; — ^ / n. Using (16) together 
with (17, (4), (18), and (20), we obtain 


1- B 





S; 
22 d-h;-— 
( ) 9j i b, 


€ 





where h;=L;/a_,,; is effective labor in 
country i, and Lh; = H. Since the evolution 
of the two product shares are related by 
X40, 0, we can replace (22) by a single 
differential equation in e. Making use of 
the fact that o = 27;8;b;, we find 


Sic 1— 
: Fe~o(H- "s. 
a a 


where h = Y,hjb;. 


(23 ¢=h- 
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aH/s (1-8) 
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h/H«1/s 


FIGURE 1 


Equations (21) and (23) constitute an au- 
tonomous system of differential equations in 
e and c. The solution to this system, to- 
gether with (13), (20), and the definition of 
c, provide a complete description of the 
evolution of spending and the number of 
products in each country. From these, the 
paths for outputs, employments and final- 
goods prices are easily derived. Thus, we 
shall use this two-equation system to ana- 
lyze equilibrium dynamics. 


54 restriction that we have not explicitly taken into 
account to this point is that product development must 
be nonnegative in each country. This condition cer- 
tainly is satisfied in the neighborhood of a steady-state 
equilibrium with positive growth, but it need not hold 
all along the convergent path to such an equilibrium. 


In Figures 1 and 2 we depict the station- 
ary points for e. We draw the é = 0 locus as 
increasing and concave (see (21)). To under- 
stand the positive slope of this curve, ob- 
serve from (19) that the interest rate can be 
expressed as 


R= Be/o(e —1). 


Thus, an increase in spending per product 
increases the interest rate and (from (20) 
reduces the zate of growth of n (the former 
because the profitability of R&D rises with 


(24) 





Thus, strictly speaking, the equilibrium dynamics that 
we describe below apply for sure only in the vicinity of 
a steady state. 
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ah/(1-8) 
aH/s(1-8) 
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h/H»1/s 





Ficure 2 


derived demand, the latter because more 
spending means less savings and hence less 
investment). Since an increase in the inter- 
est rate raises the rate of growth of nominal 
spending, and the rate of growth of e is just 
the difference between the rates of growth 
of E and n, it follows that an increase in e 
raises the growth rate of e. To compensate 
for this acceleration in spending per prod- 
uct, if e is to be stationary, o must rise. An 
increase in o lowers the interest rate and 
raises the rate of growth of n, thereby re- 
ducing the rate of growth of e. 

Next, we distinguish two subcases de- 
pending on the relative sizes of A/H and 
1/s.$ It can be shown that h/H>1/s if 


SA borderline case arises when b 17 53; that is, 
when comparative advantage is absent. Then h/H = 
1/5, and g = b always. Convergence to the steady-state 
level of e is immediate in this case; see Grossman and 
Helpman (19892). 


and only if (b, — b) (haba / s, — hib, /s;) 
>0. Therefore, the case h/H>1/s, de- 
picted in Figure 2, applies, for example, if 
the shares of the two countries' final out- 
puts are in proportion to their relative ef- 
fective labor forces; that is, h, / s, = h; / S2- 
But a bias in size relative to budget share of 
final output can reverse the inequality and 
hence the relationship between A/H and 
1/s. This gives us the case h/H «1/s, 
shown in Figure 1, with which we begin the 
discussion. 

In general, both 1/s and h/H must lie 
between b,,, and 5,,,. The & =0 curve in 
Figure 1 is everywhere downward sloping, 
crosses the horizontal axis at 5 / H, and is 
discontinuous at o — 1/5. The slope of the 
curve is understood as follows. For ce = 0, 
we must have 6,-— O0, which requires that 
the resources available for R&D in each 
country be just sufficient to preserve the 
country's share in the world's number of 
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varieties. Consider country 1 and suppose 
for concreteness that this country has com- 
parative advantage in R&D. Then an in- 
crease in o lowers g, thereby reducing the 
resources needed for production of middle 
products. The fall in e, also reduces the 
amount of R&D country 1 must perform to 
preserve its share in the number of prod- 
ucts. Ceteris paribus, c, would tend to rise. 
An increase in e, on the other hand, diverts 
resources away from R&D to production of 
middle and final products in country 1. But 
it also causes the world’s rate of product 
growth to fall, thereby diminishing the 
amount of R&D country 1 must undertake 
to maintain its share of middle products. 
The relative magnitudes of these two effects 
depend upon country 1’s relative size, and 
on the share of its final product in aggregate 
spending. In the case under consideration, 
the second effect dominates, and so the 
o = 0 curve slopes downward. 

In this case, there exists a unique steady 
state shown as point 1 in the figure. For 
initial values of o not too different from 
that at point 1, a unique trajectory (saddle- 
path) converges to the steady state. This 
trajectory, labeled SS, fulfills all equilibrium 
requirements and satisfies the intertemporal 
budget constraint with equality. Along this 
trajectory (in the vicinity of the steady state), 
the interest rate and profit rate are declin- 
ing (see (24)) and nominal expenditure E is 
rising. If the country with comparative ad- 
vantage initially has a share of products that 
is smaller (larger) than its steady-state share, 
expenditure rises more slowly (rapidly) than 
the number of products. 

The case depicted in Figure 2 arises when 
h/H>1/s. Then the ó = 0 schedule slopes 
upward. If the curve intersects the é=0 
locus in the positive orthant at all, it must 
intersect it twice, as at points 1 and 2.’ The 


7The geometry supports this claim, once we recog- 
nize that the ¢ = 0 curve asymptotes to the horizontal 
line at aH /s(1— B), whereas the é=0 curve asymp- 
totes to the horizontal line at a( H+ p)/s(1— B). The 
algebra provides confirmation, as simple manipulation 
reveals that the steady-state growth rate solves a 
quadratic equation. 
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lower point (point 1) represents the steady 
state with the higher rate of growth (growth 
rates increase as we move down along the 
é=0 schedule, as we demonstrate below) 
and indeed the growth rate corresponding 
to point 2 may be negative. More impor- 
tantly, as we show in Appendix B of our 
1989a working paper, the equilibrium at 
point 1 exhibits saddle-path stability, 
whereas that at point 2 is locally unstable.® 
To the right of point 1, the saddle-path 
leading to that point remains trapped in the 
area bounded by the é=0 locus and the 
line segment joining points 1 and 2, and is 
everywhere upward sloping. Thus, the quali- 
tative properties of the dynamic trajectory 
that leads to a stable, positive-growth equi- 
librium in Figure 2 mimic those of the sta- 
ble saddle-path in Figure 1. 

For the remainder of this paper we shall 
restrict our discussion to stable steady-state 
equilibria with positive rates of growth. That 
is, we focus our attention on equilibria such 
as those at the points labeled 1 in Figures 1 
and 2. In the steady state there occurs in- 
traindustry trade in middle products and 
interindustry trade in consumer goods, with 
the long-run pattern of trade determined by 
comparative advantage, productivities in the 
two final-goods industries, and consumer 
preferences. 


III. Determinants of Long-Run Growth 


Our model generates an endogenous rate 
of long-run growth. We now are prepared 
to explore how economic structure and eco- 
nomic policy affect this growth rate. In this 
section, we derive the implications of sec- 


*We strongly suspect, however, that whenever there 
exist two positive-zrowth, steady-state equilibria in the 
admissible range, there also exists a third (saddle-path 
stable) steady-state equilibrium with zero growth. We 
have established the existence of such an equilibrium 
for some parameter values, but so far have been unable 
to construct a general existence proof. Since the equi- 
librium at point 1 in Figure 2 can only be reached if 
the initial value of c is less than that at point 2, we 
suspect that initial values of e in excess of that at 
point 2 (and perhaps only these) imply convergence to 
a steady state with zero growth. 
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toral productivity levels, country sizes, and 
demand composition for the steady-state 
growth rate. The influence of trade policies 
and of subsidies to R&D are treated in the 
next section. 

We derive the long-run values of e and o 
by setting é and ¢ to zero in (21) and (23). 
The steady-state magnitudes, e and c, solve 





(25) — + 





(26) (1-&s)+5H=h. 


a 


Whenever 1/s > h/H, these equations 
provide at most one solution for (é,¢) con- 
sistent with g > 0. When 1/s < h/ H, there 
may be two such solutions, in which case we 
select the stable equilibrium, that is, the 
one with the smaller values for € and o. 
Stability implies, in this latter case, that the 
o = Q curve intersects the é= 0 curve from 
below (see Appendix B of our 1989a work- 
ing paper). We make use of this condition, 
namely, 


Bz aH —(1— B)se 


CD rr^  SH-h 


fo 1/s<hA/H, 


in signing the comparative-dynamics deriva- 
tives that follow. 

The growth rate of the number of vari- 
eties in the steady-state equilibrium can be 
derived from the solution to (25) and (26), 
together with (20). From this, we can easily 
calculate the growth rate of output. In the 
steady state, nominal expenditure grows at 
rate g, while (3) implies that py, grows at 
rate [1— 8 /(1— &)]g. From these facts and 
(15), we deduce that final output grows at 
rate Bg /(1— e). 

It is worth noting at this point that the 
steady-state equations (25) and (26), as well 
as the equation for g, do not rely on our 
assumption of perfect capital mobility. In 
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the absence of capital mobility, the steady 
state would be the same as long as con- 
sumers worldwide share identical prefer- 
ences (and therefore common subjective 
discount rates)? 

It is instructive to begin the discussion 
with the case in which neither country ex- 
hibits comparative advantage in conducting 
R&D; that is, b,— b,— b. This case has 
h=bH and s=1/b. Then (25) and (26) 
provide a unique solution for @ and c, 
which upon substitution into (20) yields the 
long-run growth rate 


(28) 


This equilibrium growth rate shares much in 
common with that derived by Romer (1990) 
for a closed economy. In particular, the 
growth rate rises with effective labor H and 
declines with the subjective discount rate. 
Our measure of effective labor adjusts raw 
labor for productivity in R&D (recall that 
H=¥,L;/4a,,;), so greater effectiveness in 
research in either country, as well as a larger 
world labor force, necessarily means faster 
growth. Long-run growth does not, how- 
ever, depend upon coefficients that deter- 
mine absolute productivity in the intermedi- 
ate or final goods sectors (such as A; or 
axi). Nor do properties of the instanta- 
neous utility function u(-), including the 
product composition of final demand, play 
any role in the determination of g. As we 
shall see presently, all these features (ex- 
cept for the absence of an effect of A; on 
8) are special to a world without any com- 
parative advantage. 

Consider next the case with 1/5 A/H. 
The curves €=0 and o =0 in Figure 3 
describe the initial situation, with a unique 
initial steady state at point 1. Now suppose 


"The cases of perfect and imperfect capital mobility 
do differ in their implications for the steady-state share 
of each country in aggregate spending E. However, as 
should be clear from (25) and (26), the cross-country 
composition of E does not matter for the issues taken 
up in the present section. 
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h/H<I/s 


a(H+p)/s(I-B) 


ah/(I-B) 





Ficure 3 


that preferences change so that s increases. 
This corresponds to a shift in tastes in favor 
of the final good produced by the country 
with comparative advantage in performing 
R&D. From (25), we see that the é=0 
curve shifts down, say to é’=0 in the figure. 
Equation (26) implies that the ¢ = 0 sched- 
ule shifts out (in the positive orthant) to 
o'=(0. The new steady state occurs at a 
point such as 2. But observe that all points 
on é’= 0 to the right of its intersection with 
ray OR are characterized by slower steady- 
state growth than at point 1. This claim 
follows from (20) and (25), whence 


Be 


(29) P-(G-ps 


^ 


Since the intersection of 9g'- 0 and é’=0 
necessarily lies to the right of the intersec- 
tion of the latter curve with OR, we have 
established that an increase in s reduces 
steady-state growth. 

When tastes shift unexpectedly toward 
the final good of the country with compara- 
tive advantage in R&D, resources there 
must be reallocated to satisfy the relatively 
higher consumer demand. A process begins 
whereby labor. there shifts out of R&D and 
the manufacture of middle products. Prod- 
ucts accumulate more slowly in this country 
than in the other, and over time its share of 
middle products falls (i.e., o rises). Output 
per middle product changes by the same 
proportion in both countries (see (18)). So, 
in the new steady state, the country with 
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comparative disadvantage in R&D is re- 
sponsible for a relatively larger share of the 
world’s innovation, with adverse conse- 
quences for the common steady-state growth 
rate. Of course, the opposite conclusion ap- 
plies when s falls. Moreover, the same re- 
sults obtain at stable equilibrium points 
when 1/s « h/H (see our 1989a working 
paper). We have thus proven the following: 


PROPOSITION 1: Stronger relative demand 
for the final good of the country with compar- 
ative advantage in R&D lowers the long-run 
share of that country in the number of middle 
products and slows long-run growth of the 
world economy. In the absence of compara- 
tive advantage in R&D, the long-run growth 
rate is independent of the relative demand for 
final goods. 


Next we consider the dependence of 
growth on the sizes of the effective labor 
forces. Effective labor may grow without 
affecting cross-country comparative advan- 
tage either because the stock of irrepro- 
ducible resources expands, or because the 
productivity of labor in all uses (or in R&D 
and intermediate-good production) rises 
equiproportionately. In the first experiment, 
suppose that both countries experience 
equiproportionate, once-and-for-all in- 
creases in the sizes of their effective labor 
forces. We have already seen that this 
change would augment world growth in the 
absence of comparative advantage. In our 
1989a working paper we prove that the same 
result carries over to a world with inter- 
country differences in relative productivities 
at R&D and manufacturing. We establish 
the following there: 


PROPOSITION 2: An equiproportionate, 
once-and-for-all increase in the effective labor 
forces of both countries accelerates’ long-run 
growth. | 


Greater resources generate faster growth in 
our model, as in Romer (1990), essentially 
because dynamic scale economies character- 
ize long-run production. 

We investigate next the effects of an in- 
crease in the effective labor force of a single 
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country. Conceptually, it proves convenient 
to decompose this change into two ele- 
ments. First, we increase h and H by the 
same percentage amount. This percentage 
is chosen to equal the product of the share 
of the expanding country in the world’s ef- 
fective stock of labor times the percentage 
increase in effective labor force that the 
expanding country actually experiences. This 
accounts for the total percentage change in 
H when H; changes. Then we must. adjust A 
with H fixed to arrive at the appropriate 
change in A. 

As an intermediate step, let us consider 
the effects of an increase in h alone. This 
corresponds to an increase in the effective 
labor of the country with comparative disad- 
vantage in R&D, and a decrease in the 
effective labor of the other, so that the sum 
remains constant. This imaginary reshuffling 
of the world’s resources shifts the ¢ =0 
schedule upward when 1/s>h/H, and 
downward otherwise. In either case, the é — 
0 curve is unaffected and the new steady- 
state point lies on this curve to the right of 
the original point. Noting (29), this proves 
the following: 


LEMMA 1: A reallocation of resources be- 
tween countries that maintains a constant 
world stock of effective labor raises the long- 
run growth rate and increases the long-run 
product share of the relatively R&D efficient 
country if and only if the share of this country 
in effective labor increases. 


When the effective labor force of only 
country 1 (say) increases, h rises by propor- 
tionately more or less than H, according to 
whether country 1 has comparative disad- 
vantage or advantage in R&D. If country 1 
has comparative advantage in R&D, then 
both the uniform increase in H and A and 
the adjustment (lowering) of h, that to- 
gether comprise the effect of an increase in 
Hi, serve to accelerate world growth. But if 
country 1 has comparative disadvantage in 
R&D, the two effects work in opposite di- 
rections. The increase in resources, by 
Proposition 2, speeds growth; but the real- 
location of given resources, by Lemma 1, 
slows growth. The net effect is ambiguous, 


COP ee PS. 
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as the numerical examples that we present 
in Appendix D of our 1989a working paper 
serve to demonstrate. We have established 
the following: 


PROPOSITION 3: The long-run growth rate 
is higher the larger is the effective labor force 
of the country with comparative advantage in 
R&D. A larger effective labor force in the 
country with comparative disadvantage in 
R&D may be associated with faster or slower 
growth, depending upon the exient of produc- 
tivity differences. In the absence of compara- 
tive advantage, long-run growth is faster the 
larger is the effective labor force of either 
country. 


These results emphasize the novel fea- 
tures of growth in a world with distinct 
countries and intercountry differences in 
relative productivities. They also suggest 
that findings reported by Paul Krugman 
(1990) may be somewhat special. A country 
need not enjoy faster growth by joining the 
integrated world economy, if the country 
enjoys substantial comparative advantage in 
R&D. Moreover, growth in resources or 
improvements in the productivity of existing 
resources do not guarantee faster long-run 
growth in a world equilibrium with free 
trade. If resources expand or become more 
efficient in the country with comparative 
disadvantage in R&D, then the resulting 
intersectoral reallocation of resources 
worldwide might slow innovation and growth 
everywhere. 


IV. Economic Policy 


In this section we discuss the effects of 
tariffs, export subsidies, and R&D subsidies 
on long-run growth. In order to do so, it is 
necessary for us to introduce the relevant 
policy parameters into the equations that 
describe instantaneous and steady-state 
equilibrium. To avoid repetition cf the de- 
tailed arguments presented in Section I, we 
present here only the necessary modifica- 
tions of the model and then explain their 
implications for the steady-state conditions. 
We restrict attention to small taxes and 
subsidies; this restriction facilitates exposi- 


rape x 
a ar ien 
at E, 


RI 
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tion, as the channels through which eco- 
nomic policies affect.long-run growth can be 
seen more clearly. We confine our analysis 
of trade policies to those that impede or 
encourage trade in final goods. 

The introduction of taxes and subsidies to 
the model necessitates consideration of the 
government's budget. As usual, we assume 
that the government collects and redis- 
tributes net revenue by lump-sum taxes and 
subsidies. In a static framework, this speci- 
fication suffices to determine completely the 
government’s budgetary policy. But in a dy- 
namic framework the budget need not bal- 
ance period by period, so budgetary policy 
in general must specify the intertemporal 
pattern of lump-sum collections and trans- 
fers. However, with perfectly foresighted 
and infinitely lived agents, our model ex- 
hibits the Barro-Ricardo neutrality prop- 
erty. Hence, we need not concern ourselves 
with the intertemporal structure of budget 
deficits so long as the present value of the 
government’s net cash flow equals zero. 

The presence of the aforementioned poli- 
cies modifies the decision problem for con- 
sumers in country 1 in two ways. First, we 
replace the price of good i in (12) by T; Pyp 
where 7,=1. With this formulation, py; 
remains the producer price of final good i, 
T,» 1 represents a tariff in country 1 on 
imports of consumer goods, and T, <1 rep- 
resents a subsidy by country 2 on exports of 
final output. Second, we add the present 
value of net taxes to the right-hand side of 
(13) as a lump-sum addition to consumer 
wealth. The amount of this collection or 
redistribution will differ across countries ac- 
cording to their policies. 

These modifications do not affect (14), 
which continues to describe the optimal in- 
tertemporal pattern of expenditures for 
consumers worldwide as a function of the 
pattern of equilibrium interest rates. In a 
steady state with é= 0, (14) reduces to 


(30) R-g-p. 


The effects of a country 2 import tariff and a 
country 1 export subsidy can be derived symmetrically, 
so we neglect these policies here and leave the maxi- 
mand for consumers in country 2 as before. 
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Notice that (30). implies that in any steady 
state in which countries grow at the same 
rate, long-run equalization of interest rates 
obtains. This property of our model holds 
irrespective of the presence or absence of 
international capital mobility and the pres- 
ence or absence of tariffs or export subsi- 
dies on final goods and subsidies to research 
and development. 

Turning to the production side, our poli- 
cies do not alter equations (3)-(8) describ- 
ing pricing and output relationships in the 
intermediate and final-goods sectors and the 
technology for knowledge creation. How- 
ever, R&D subsidies do change the private 
cost of R&D. We replace (10) by 


(10) 


where 5, » 1 represents subsidization of re- 
search costs in country i. It proves conve- 
nient to redefine our numeraire to normal- 
ize for the effect of the R&D subsidy on the 
price of intermediate inputs in country 1. 
Our new normalization dictates a modified 
equation for the price of intermediates pro- 
duced in country 2 as well. Together, these 
relationships, which replace (17a) and (17b), 
can be written as 


(17) Dx 


As for the market-clearing conditions, the 
factor-markets equation (16) is not affected, 
but we must replace (15) by 


Cri = Win / NSi» 


1 
; 7 (S41 x; / Ani) seep 


S4 Ei 
T. T s; E,, 


I 


5) 


pns 


where E, denotes aggregate spending by 


consumers in country i, and the shares of 
spending devoted to good i by residents 
of country 1 and country 2 are s= 
S(pypPy?T) and sj-s(py,py;), Te- 
spectively. Although import tariffs and ex- 
port subsidies on final goods do not affect 
steady-state producer prices of final output 
in our model! the direct response of 


l'This statement can be verified using equations (3), 
(4), and (17). 
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spending shares in country 1 to changes in 
trade policy must now be taken into ac- 
count. Moreover, R&D subsidies, if intro- 
duced at different rates in the two coun- 
tries, will affect the steady-state value of 
Py: /Pyz, and may therefore influence the 
long-run spending shares in both countries. 

This completes the necessary modifica- 
tions of the equilibrium relationships. We 
can now use the extended model to derive 
the equations describing steady-state equi- 
librium in.the presence of policy interven- 
tion. In a steady state, employment in the 
R&D sector is given by a, „ñ; / n = ainil; 
Making use of (4), (5), (f$, "o, ao, and 
(17) (which together imply ċ„=0 in a 
steady state), we find employment in the 
manufacture of middle products equal to 
a,,;0K8 + pXe —1)/S; Substitution of 
these terms into (16) yields the steady-state 
labor market-clearing condition, 


—1)(ē + 
(31) 76+ DEO 
S; 
1-8 
O 


where q;= py;Y;/n. Next, from (4)-(7), 
(30), and (17’), we obtain 





(32) (e-1)(&+p) 





re 
~BLa= 


Naturally, we also require 


(33) LG =1. 


Finally, (15’) implies 


- Sni 
(34) ; di7-p tni 


i 


It is straightforward, now, to verify that 
(31)-(34) imply (25) and (26) when T; = S; = 
1 for i— 1,2 (with ë = Y;2;). This provides a 
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consistency check on the extended model 
with policy instruments. 
We consider trade policies first. From 
(34), the ratio d, / q; satisfies 
(35) A d Sufit $1262 
d; Saë / T + $5, 





Now, for given expenditure levels 2;, equa- 
tions (31)-(33) and (35)—which constitute a 
system of five equations— provide a solution 
for (2,0,,05, d, J). In this system, the trade 
policy parameters appear only in (35). 
Therefore, the long-run effects of trade pol- 
icy depend only on their effects on 4, / q,, 
taking into account the induced adjustment 
in the spending levels e, and e,. Moreover, 
for small trade policies (i.e, with an initial 
value of T,- 1), the spending shares are 
equal across countries (5; = $;,), so the ef- 
fect on d, / d; of changes in the cross-coun- 
try composition of aggregate spending 
"washes out." 

Further inspection of (35) reveals that an 
increase in 7, starting from free trade with 
T; —1 Qie., a small import tariff i in country 
1) unambiguously raises 4, /4,.^ A tariff 
shifts demand by residents of country 1 to- 
ward home consumer products, and since 
relative producer prices do not change in 
the long run, steady-state reletive quantities 
must adjust. The effect of this change on 
the steady state is qualitatively the same as 
for an exogenous increase in world prefer- 
ence for final good 1, such as we studied in 
the previous section when we varied s;. 


The easiest way to see this is to write the right- 
hand-side of (35) as 


(Brio Py2u$:1G vo T22y2)8 + (i Py Py2)@2]/ 


[62(2y i T2 Py2)8i + $2( Pri Py2)é |, 


where #,{-) is minus the partial derivative of v(-) from 
(1) with respect to its ith argument divided by v(-). 
Then an increase in T, with py, /D y? constant clearly 
raises demand for final good 1 in country 1 (the first 
component of the bracketed term in the numerator 
increases) and lowers the demand there for final good 
2 (the first component of the bracketed ierm in the 
denominator falls). 
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Similarly, a small export subsidy in country 
2 (a reduction in T, to a value slightly below 
one) biases country 1 demand in favor of 
foreign final output. So we may apply di- 
rectly our results from Proposition 1 to state 
the following: 


PROPOSITION 4: A small import tariff or 
export subsidy on final goods reduces a coun- 
try's steady-state share in middle products and 
R&D. It increases the rate of long-run growth 
in the world economy if and only if the 
policy-active country has comparative disad- 
vantage in R&D. 


Commercial policies do affect long-run 
growth rates. They do so by shifting re- 
sources in the policy-active country out of 
the growth-generating activity (R&D) and 
into production in the favored sector. At 
the same time, a resource shift of the oppo- 
site kind takes place abroad in the dynamic 
general equilibrium. The net effect on world 
growth hinges on the identity of the country 
that favors its consumer-good industry. If 
import protection or export promotion is 
undertaken by the country that is relatively 
less efficient in conducting R&D, then 
growth accelerates; otherwise, growth decel- 
erates. 

Next, we investigate the effects of small 
subsidies to R&D, introduced from an ini- 
tial position of laissez faire. For these policy 
experiments, T,=1 before and after the 
policy change, so the expenditure levels é, 
cancel from (35). Suppose, first, that both 
countries apply subsidies at equal ad val- 
orem rates; that is, $, = S, = S. In this case, 
relative prices of final output do not change 
across steady states. Therefore, the spend- 
ing.shares 5;; do not change. In Appendix C 
of our 1989a working paper, we totally dif- 
ferentiate (31)-(33) and (35) with respect to 
S to prove the following: 


PROPOSITION 5: A small R&D subsidy 
by both countries at a common rate increases 
the rate of long-run growth in the world econ- 
omy. 


This proposition is not surprising, and cor- 
responds to a similar result for the closed 
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economy derived by Romer (1990). Since 
R&D represents the only source of gains in 
per capita income in our model, stimulation 
of this activity promotes growth. 

What is more interesting, perhaps, is the 
effect of a small R&D subsidy in a single 
country. As for bilateral subsidies, a unilat- 
eral subsidy promotes growth by bringing 
more resources into product development 
in the policy-active country. But now, rela- 
tive final-good prices change, so the spend- 
ing shares in (35) must be allowed to vary 
unless the utility function has a Cobb- 
Douglas form. Depending on whether the 
elasticity of substitution between final prod- 
ucts exceeds or falls short of one, this in- 
duced change in the pattern of spending 
can be conducive to or detrimental to 
growth. Moreover, an R&D subsidy in a 
single country will alter the relative shares 
of the two countries in product develop- 
ment. If the subsidy is introduced by the 
country that is relatively less efficient at 
performing R&D, this effect too can im- 
pede growth. In Appendix D of our 1989a 
working paper we show, by means of a 
numerical example using a Cobb-Douglas 
utility function, that an R&D subsidy intro- 
duced by the country with comparative dis- 
advantage in R&D might (but need not) 
reduce the world’s growth rate. We also 
prove in Appendix C that, for the case of 
constant spending shares, an R&D subsidy 
must encourage growth if it is undertaken 
by the country with comparative advantage 
in R&D. Thus we have the following: 


PROPOSITION 6: The provision of a sub- 
sidy to R&D in one country increases long-run 
growth if spending shares on the two final 
goods are constant and the policy is under- 
taken by the country with comparative advan- 
tage in R&D. Otherwise, the long-run growth 
rate may rise or fall. 


Our results here on the long-run growth 
effects of government policy have no imme- 
diate normative implications, both because 
we perform only steady-state comparisons 
and because the initial level of utility may 
differ along alternative growth paths. But in 
our 1989b paper we conduct a complete 
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welfare analysis for a small country with 
endogenous growth generated by technolog- 
ical progress as here. We find that the 
market determined rate of growth is subop- 
timally low due to the presence of the.non- 
appropriable spillovers in the process of 
knowledge generation. An R&D subsidy 
that speeds growth improves welfare for the 
small country until some optimal growth 
rate is achieved. Further increases in the 
subsidy rate reduce welfare even as they 
accelerate growth. We show also that trade 
policy need not raise welfare, even if it 
successfully speeds growth. The explanation 
for this lies in the presence of a second 
distortion in our economy, one that stems 
from the pricing of middle products above 
marginal cost. The mark-up pricing prac- 
ticed by intermediate producers gives rise to 
suboptimal use of middle products in the 
production of final goods. Trade policy that 
accelerates growth but reduces the output 
of middle products in the general equilib- 
rium can be detrimental to welfare. Con- 
versely, commercial policy that augments 
the output of middle products can improve 
welfare even if the growth implications of 
such policy are adverse. 

These results do not carry over immedi- 
ately to the current environment, though 
the considerations we found to be relevant 
in our 1989b paper are certainly applicable 
here as well. Normative analysis in the 
present two-country setting is complicated 
by the terms-of-trade effects that arise when 
policy alters the relative price of the final 
goods, changes the price of intermediates (if 


sectoral trade in middle products is not 


balanced), or varies the interest rate. Be- 
cause of these various terms-of-trade effects 
we have been unable thus far to find simple 
conditions under which growth-enhancing 
policy by one government raises that coun- 
try's welfare. 


V. Lags in the Diffusion of Knowledge 


We have assumed all along that research 
and development creates as a by-product an 
addition to the stock of knowledge that fa- 
cilitates subsequent R&D. Moreover, we 
supposed that the knowledge so created be- 
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comes available immediately to scientists 
and engineers worldwide. We now relax the 
latter assumption, in recognition of the fact 
that privately created knowledge, even if 
nonappropriable, may enter the public do- 
main via an uneven and time-consuming 
process. Also, since legal and cultural barri- 
ers may inhibit the free movement of peo- 
ple and ideas across national borders, we 
shall allow here for the possibility that in- 
formation generated in one country dissem- 
inates more rapidly to researchers in the 
same country than it does to researchers in 
the trade partner country. After extending 
the model we will reconsider the eifects of 
trade policies on the steady-state rate of 
growth. . 

In place of our earlier assumption that 
world knowledge accumulates exactly at the 
rate of product innovation (equation (9)), 
we suppose now that R&D expenditures 
contribute to country-specific stocks of 
knowledge according to 


(9) Kr) =A, fo eM-Pn fr) 


+A; f ero) dr, 


where K;(t) is the stock of knowledge capi- 
tal at time t in country i; With this specifi- 
cation, the contribution of a particular R&D 
project to general knowledge is spread over 
time. At the moment after completion of 
the project, none of its findings have perco- 
lated through the scientific and professional 
community. After an infinite amount of time 
has passed, the R&D project makes, as be- 
fore, a unit contribution to knowledge. Af- 
ter finite time, the contribution lies between 
these extremes of zero and one, as given by 
the exponential lag structure in (9t). The 
parameters A, and A, (with a, > Àp) distin- 
guish within-country and cross-country rates 
of diffusion. 

The introduction of lags in the diffusion 
of knowledge alters two of the fundamental 
equations of the model. First, (8) becomes 


(8t) A= LaKi/ 054; 
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Second, we have in place of (10), 








(107) Caz 7 Wd, / Kj. 

In a steady state with A =Á, =g, we 

have n,(r)=nAte®"~, so that 

(36) Kj(1)- an ms Lyon 
= G+ —— 5 |n(t 
i A +g ah ) 


| 2 ni(0,,05, g)n(t). 


So in the steady state, knowledge in each 
country is proportional once again to the 
total number of middle products. But the 
factor of proportionality has become coun- 
try-specific and endogenous. This means 
that the steady-state labor-input coefficient 
for R&D in country i, a,,;/p;, also is 
endogenous; that is, relative productivity in 
R&D depends now not only on relative 
natural abihties in performing this activity, 
but also on relative cumulative experience 
in research, as summarized by the o;'s. This 
consideration leads us to draw a distinction 
henceforth between natural and acquired 
comparative advantage in R&D. 

From (36) we see that when A, = Af > 
(ie., when diffusion lags are very short), 
HQ,7 43-1, and the extended model re- 
verts to the earlier formulation. For A, = A, 
finite, 4, = >, so that the ratio of the natu- 
ral-plus-acquired productivity parameters 
for each country is the same as for the 
natural productivity parameters alone. In 
this case, the pattern of comparative advan- 
tage cannot be. reversed by endogenous 
learning, and all results from before con- 
tinue to apply. We concentrate here on 
cases in which the rates of diffusion are 
unequal but the difference between them is 


small.? 


PA large difference between the within-country and 
across-country rates of diffusion may imply that, in the 
steady-state equilibrium, all R&D is carried out by one 
country. Such specialization, which is common in mod- 
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We derive the long-run effects of trade 
policy in the extended model using equa- 
tions (31)- (33) and (35), but with S, = S, =1 
(no R&D subsidies), with b; replaced by 
b,;/u7 (natural plus acquired comparative 


advantage in place of just natural compara- - 


tive advantage), and with A; replaced by 
h;u; (natural plus acquired effective labor in 
place of natural effective labor). For clarity 
of exposition, we shall also assume for the 
remainder of this section that the spending 
shares s; are constant. This assumption is 
valid when u(-) takes a Cobb-Douglas form. 

The new elements that diffusion lags in- 
troduce to the analysis of policy stem from 
the effects of relative size and demand-side 
bias. Before considering these new aspects, 
let us suppose that labor forces are equal 
and demand for the two final goods is sym- 
metric. By totally differentiating the system 
of steady-state equations (see Appendix E 
of our 1989a working paper), we establish 
the following: 


PROPOSITION 7: Suppose L,=L,, s,— 
$5, Gp y1 7 Gp y2, and Àp — A, > 0, but small. 
Then a tariff on imports of final goods in 
country i raises the long-run growth rate if a 
only if Ay ni > à, ,j. 


In this case, the effects of acquired com- 
parative advantage necessarily reinforce 
those of natural comparative advantage. The 
country that is relatively more productive in 
creating new blueprints will attain, in the 
steady-state equilibrium prior to the intro- 
duction of policy, a majority share of the 
world’s middle products. By its greater con- 
centration in R&D, it will gain more experi- 
ence in research and attain a higher 
steady-state stock of knowledge. Thus, the 
effects of learning will augment its initial 
comparative advantage in R&D. When pol- 
icy is introduced in one country or the other, 





els with a national component to increasing returns to 
scale, necessarily occurs here if static preferences are 
Cobb-Douglas and A, = 0 (i.e., all spillovers are inter- 
nal). Then the equations that we have developed to 
describe the steady-state equilibrium (which presume 
non-specialization in each country) would not be valid. 
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the implications of the dynamic resource 
reallocation for the global efficiency of R&D 
will be all the more significant. 

Now suppose that the two countries differ 
initially only in (effective) size, as measured 
by A;. Recall that with equal rates of diffu- 
sion, a small tariff in either country does not 
affect the long-run rate of growth. Now, 
however, we find the following: 


PROPOSITION 8: Suppose b, = b}, s, s; 
and À,— Ag 2 0, but small. Then a small 
tariff on imports of final goods raises the 
long-run. growth rate if and only if the policy is 
introduced by the country with the relatively 
smaller effective labor force. 


Here, the larger country will come to 
acquire comparative advantage in R&D, 
though it starts with none. The reason is as 
follows. With differential rates of diffusion, 
knowledge takes on the characteristics of a 
local public good. The larger country will 
have more (effective) scientists to benefit 
from this nonexcludable good as its share in 
world R&D exceeds one half. So it acquires 
over time a relatively larger knowledge base 
and hence a relatively more productive corps 
of researchers. Trade policy that serves to 
divert resources away from the R&D sector 
in the larger country once comparative ad- 
vantage has been established must be detri- 
mental to growth. 

The effects of demand-size bias are simi- 
lar. The country whose good is in relatively 
greater demand must devote relatively more 
of its resources to final-goods production. 
Thus, its R&D sector initially will be 
smaller. This country develops over time a 
comparative disadvantage in R&D, as its 
learning lags that in its trade partner coun- 
try. Protection in this country will improve 
world efficiency of R&D and thereby speed 
growth. 

Once we allow for lags in the diffusion of 
scientific knowledge and differential speeds 
of diffusion within versus between coun- 
tries, we find a richer set of possibilities for 
the long-run effects of trade policy. Com- 
parative advantage continues to play a criti- 
cal role in determining whether policy in 
one country will speed or decelerate growth. 
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But comparative advantage now must be 
interpreted with care, because it reflects not 
only natural ability but also the (endoge- 
nous) benefits from cumulative experience.'^ 


Since steady-state productivity in R&D > 


varies positively with the size of the R&D 
sector, all determinants of the equilibrium 
allocation of resources to this sector come 
to be important in the analysis of policy. 


VI. Conclusions 


In this paper, we have analyzed a dy- 
namic, two-country model of trade and 
growth in which long-run productivity gains 
stem from the profit-maximizing behavior of 
entrepreneurs. We have studied the deter- 
minants of R&D, where research bears fruit 
in the form of designs for new intermediate 
products and in making further research 
less costly. New intermediate products per- 
mit greater specialization in the process of 
manufacturing consumer goods, thereby en- 
hancing productivity in final production. In 
order to highlight the role of endogenous 
technological improvements as a source of 
growth we have abstracted entirely. from 


factor accumulation. But Romer (1990) has | 


shown that capital accumulation can be in- 
troduced into a model such as the one we 
have studied without affecting the analysis 
in any significant way. 

The interesting features of our analysis 
arise because of the assumed presence of 
cross-country differences in efficiency at R&D 
and manufacturing. Considerations of com- 
parative advantage in research versus manu- 
facturing of intermediate gcods bear im- 
portantly on the implications of economic 
structure and economic policy for long-run 
patterns of specialization and long-run rates 
of growth. We find, for example, that growth 
in world resources or improvements in R&D 
efficiency need not speed the rate cf steady- 


Endogenous comparative advantage also plays a . 


central role in Krugman's (1987) analysis of commod- 
ity-specific learning-by-doing. There, as here, produc- 
tivity increases with cumulative experience. But each 
good is produced in only one country in Krugman's 
model, so long-run comparative advantage is fully de- 
termined by the initial pattern of specialization. 
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state growth, if those changes occur pre- 
dominantly in the country with. comparative 
disadvantage in R&D. 


Concerning policy, we find for the first 


time a link between trade intervention and 
long-run growth. Any (small) trade policy 
that switches spending toward the consumer 
good produced by the country with compar- 
ative advantage in R&D will cause long-run 
growth rates to decline. Subsidies to R&D 
will accelerate growth when applied at equal 
rates in both countries, but need not do 
so if introduced only in the country with 
comparative disadvantage in R&D. When 
knowledge spillovers occur with a time lag 
and diffusion is faster within the country of 
origin than across national borders, compar- 
ative advantage becomes endogenous. Once 
we recognize that comparative advantage 
can be acquired as well as natural, we find a 
role for country size and demand-size bias 
in determining the long-run effects of pol- 
icy. 

Our emphasis on comparative advantage 
in research and development highlights only 
one channel through which trade structure 
and commercial policy might affect long-run 


growth. In other contexts, the trade envi- . 


ronment might influence the rate of accu- 
mulation of human capital or the rate at 
which a technologically lagging (less devel- 
oped) country adopts for local use the exist- 
ing off-the-shelf techniques of production. 
Investigation of the links between trade 
regime and these other sources of growth 
seems to us a worthy topic for future re- 
search. 
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The Fundamental Determinants of the Terms of Trade 
Reconsidered: Long-Run and Long-Period Equilibrium 


By WitLiAM Dariry, Jg.* 


Ronald Findlay’s analysis of the long-run equilibrium (uniform international 
growth rates) determinants of the terms of trade, in the context of North-South 
models of trade and growth, is reconsidered when the North is a Keynesian and a 
Kaleckian economy. Also examined are the determinants of the terms of trade in 
long-period equilibrium (unifcrm profit rates), so that the consequences of capital 
mobility are accounted for. Two surprising results are noted: first, the long-run 
terms of trade when the North is a Kaleckian economy will be independent of the 
North’s markup and, second, in the long-period it is theoretically possible for the 
North to raise its markup but experience a deterioration in its own terms of trade. 


` (JEL 110, 411, 441) 


In two important papers, Ronald Findlay 
(1980, 1981) undertook an extensive exami- 
nation of the “fundamental determinants” 
of the terms of trade —fundamental, in the 
sense, that these are real factors, in dy- 
namic equilibrium, governing the relative 
price at which commodities exchange in in- 
terregional trade. While Findlay (1980, pp. 
293-95; 1981, pp. 442-44) also explored 
short-run factors influencing the terms of 
trade when they function to clear trade bal- 
ances, the more intriguing aspects of his 
papers concern the factors dictating the 
terms of trade from the long view. Findlay 
sought to go beyond identification of the 
short-run direction of change of the terms 
of trade, looking instead for the central 
value toward which they must ultimately 
gravitate. 

To identify the long-run value of the terms 
of trade, Findlay followed the structuralist 
tradition of investigating their determinants 
in the context of a North-South trade model. 
Here the North is the more affluent, indus- 
trial region, while the South is the poorer, 
predominantly agrarian region. The North 


* Professor of Economics, University of North Car- 
olina at Chapel Hill. I am grateful to André Burgstaller, 
Patrick Conway, Amitava Dutt, Bobbie L. Horn, Marc 
Jarsulic, and especially several anonymous referees for 
exceptionally useful comments and criticisms. 
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exports manufactures and the South exports 
primary products, establishing an asymme- 
try between the regions via the international 
pattern of specialization. 

One can presume that the pattern arises 
from old-fashioned comparative advantage. 
However, the dynamic effects of such a pat- 
tern of specialization need not be salutary 
for Southern growth rates or for the South- 
ern terms of trade. Prebisch (1959) and 
Singer (1950) both theorized that the South 
typically would experience slower growth 
and deteriorating terms of trade, given such 
an international pattern of specialization. 
Moreover, for Prebisch and Singer, declin- 
ing terms of trade served as a virtual index 
of divergent growth between South and 
North. While they argued for a secular de- 
cline in primary product exporters' terms of 
trade, the theoretical backdrop to their 
proposition was essentially short run in 
character, rooted in the exercise of 
monopoly power by Northern producers as 
the terms of trade adjusted to bring about 
trade balance. They did not advance a the- 
ory that was long run, in an analytical sense, 
to correspond to their proposition about the 
secular movement in the terms of trade.! 


‘or course, there has been a vast amount of contro- 
versy about the empirical validity of the Prebisch-Singer 
hypothesis. Findlay (1981) conveniently has summa- 
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Findlay’s endeavor to characterize the 
long-run terms of trade suggests that they 
have an eventual “resting point.” The short- 
run or secular trends might have an ulti- 
mate position of equilibrium, an equilibrium 
whose characteristics can inform us about 
the "fundamental determinants" of the 
terms of trade. Short-run factors that have 
the terms of trade equilibrating the trade 
balance are more ephemeral, in part be- 
cause, from a Classical perspective, the in- 
teraction of supply and demand only offers 
a value consistent with market clearing (a 
“temporary” equilibrium) but not a long-run 
or long-period position (or dynamic equilib- 
rium). 

In what follows, fundamental or dynamic 
equilibrium analyses of the terms of trade 
are examined in two parts: first, an analysis 
and extension of the Findlay-styled explana- 
tion of the long-run terms of trade and, 
second, an alternative long-period explana- 
tion of the terms of trade. The distinction 
between the long run (or uniform growth 
rate condition) and the long period (or uni- 
form profitability condition) will be ex- 
plained in greater detail below. 


I. The Terms of Trade in the Long Run 


The Findlay (1980, pp. 291-93; 1981, pp. 
439—54) explanation for the long-run terms 
of trade is derived from his version of a 
North-South trade model. Findlay postu- 
lates that the North, producing a manufac- 
turing export that serves as an. investment 
good for both regions, is growing neoclassi- 
cally in smooth Solow fashion. The South, 
in contrast, is a Lewis unlimited labor econ- 
omy producing a primary product. Aggre- 
gate production functions are character- 


rized the panoply of objections raised on empirical 


grounds in one place. Sarkar (1986) provides a recent 
attempt to defend the Prebisch-Singer hypothesis on 
the basis of the evidence. Concern here is not with the 
empirical case for or against Prebisch and Singer's 
proposition but is devoted exclusively to the question 
of whether or not there is potential theoretical support 
for their hypothesis from the perspective of North- 
South trade that takes the long view. 
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istically neoclassical in both regions, with 
well-defined marginal products in both la- 
bor and capital. 

Findlay also makes Kalecki’s extreme 
(1971) assumption in the South: all savings 
are from profits; all wages are consumed. 
The propensity to save out of profits in the 
South is positive (and large); the propensity 
to save out of wages in the South is zero. 
The Southern profit rate, p,, is defined as 


(1) p, O7'(k,) 
| 62PJP,, 


where 8 is the terms of trade, the rate at 
which manufactures exchange per unit of 
primary products; mz'(k,) is the marginal 
product of capital where the function m is 
the intensive form of the South's aggregate 
production function; and k, is the capital- 
employment ratio in the South. 

If c is the savings rate out of profits in 
the South, under the extreme Kaleckian as- 
sumption, the South's growth rate, g,, will 
be 


(2) 8, 7 007'(k,). 


The condition of unlimited labor in the 
South fixes the real wage in terms of pri- 
mary products at W. Profit maximization by 
Southern employers yields the capital-out- 
put ratio, k*, in the South from the follow- 
ing equation that sets the South's real wage 
equal to its marginal physical product of 
labor: 


(3) wks) — km (kf) =. 


The Northern growth rate, gy, given Solow 
steady-state growth must be at the exoge- 
nous natural rate, ny: 


(4) En 7 Dy. 


Now Findlay (1980, p. 293; and 1981, pp. 
441-42) identifies the long run as a condi- 
tion where the two region's growth rates 
become the same, or, for "steady-state equi- 
librium in the world economy,” g, = gy. This 
constrains the South to save (and accumu- 
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late) at the North’s natural rate of growth. 
Uniformity of growth rates yields Findlay’s 
expression for the long-run terms of trade: 


om'(k*) 

This is a very compact and interesting 
result. Findlay's equation for the terms of 
trade under balanced growth worldwide has 
it that the terms of trade are governed by 
the natural rate of growth in the North, the 
savings rate out of profit income in the 
South, technical conditions in the South's 
export sector, and, implicitly, the real wage 
in the South. As Findlay (1981, p. 442) 
observes, the provocative feature of this re- 
sult is that the terms of trade "[are] com- 
pletely independent of [the] production 
function and [the] propensity to save of the 
North and of the demand for imports in the 
two regions." The terms of trade function as 
the relative price that adjusts to bring about 
balanced growth internationally. 

Findlay's result can be considered from a 
different angle that will prove useful in the 
ensuing discussion. For macroeconomic 
equilibrium in the world economy, the fcl- 
lowing condition must be met: 


(6) (gi - eS)u * (81—83)(1-u) 20 


us Ky./K. 


Condition (6) says that a weighted sum of 
the difference between each region's invest- 
ment and savings rates must equal zero. 
Weights are given by each region's share in 
the available capital supplied by the North. 
Condition (6) is equivalent to the statement 
that, in a global sense, investment must 
equal savings.? 


?World macroeconomic equilibrium requires that 
the international sum of investment and sum of savings 
must cancel, or Uy — Sp) +U, — $,) 0, where I 
N, S) is investment in each region and S,i= N, S) is 
savings in each region. The global macroeconomic 
equilibrium condition can be transformed into growth 
rates by multiplying each bracketed term as follows: 
Uy - Sy) Ky /Ky +U, - SK, /K,= 0. Since L/K; 
=g] and S,/K,;= 983, the preceding expression can 
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There are two circumstances under which 
(6) will hold. First, the investment and sav- 
ings rates could equalize separately in the 
North and the South. Second, the two re- 
gions could run offsetting “surpluses” and 
“deficits,” with the surplus region transfer- 
ring capital to the region experiencing the 
deficit. In the first case, it is reasonable to 
treat the capital or manufactured good as 
internationally immobile after installation in 
either region; capital is treated as fixed. In 
the second case, the effects of capital mobil- 
ity must be taken into account; capital can 
be treated as circulating. 

From the standpoint of condition (6), 
Findlay adopts the first type of circum: 
stance, assuming for the North that g} = gs 
and for the South that g/ = g?. The North’s 
saving rate, given its character as a Solow 
economy, is given by sa(k,), where 
s is the propensity to save, the function 
a(ky) is the average product of capital, and 
ky is the capital intensity in North's pro- 
duction. The North's investment rate, im- 
plicitly, is given by its natural rate of growth, 
ny. The South, sharing the North's growth 
rate in the long run, must also invest at rate 
ny, While the South's saving rate, as noted 
above, is dictated by 67’(k*). It is equal- 
ization of investment and savings rates in 
the South that provides a basis for deriva- 
tion of Findlay's result for the long-run 
terms of trade, 0* = ny / aa (k*). 

Findlay's assumption of full employment 
growth in the North eliminates Northern 
savings behavior and technology from being 
determinants of the long-run terms of trade. 
However, with unemployed labor in both 
regions, matters can change sharply, al- 
though demand elasticities will remain irrel- 
evant. Note also in what follows, similar to 
Findlay, Laursen-Meltzer (1950) effects are 
ignored—so that savings rates in both re- 
gions do not depend upon the terms of 





be written as gf. — g§) Ky t (g] — g5)K, — 0. Defining 
û= Ky, /K, the North's share in the global capital 
stock, means that we can obtain the expression in 
equation (6): (gh — gnu t (Gg! — g5)4-—u)=0. This 
equilibrium condition undergirds much of Taylor’s 
(1983) work on North-South trade and growth. 
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trade. In fact, savings rates are assumed 
fixed and immutable in both North and 
South for posterity. 

After Sen (1963) and Taylor (1981; 1983, 
p. 181), the North can be given a more 
Keynesian cast by dropping the assumption 
of full employment /natural rate growth and 
introducing, instead, an independent invest- 
ment function. Northern investment can be 
treated as an increasing function of the 
North’s rate of profit, p,: 


(7) sj-igtiüp,-—igtif'(ky). 


The parameters i, and i, are positive 
constants, and f'(k,) is the marginal prod- 
uct of capital in the North. Setting invest- 
ment equal to savings in the North yields 
the equilibrium value for the North's capital 
intensity, kğ *: 


(8) sl-2$--igti f'(kä*)=sa(kğġ*). 


In Figure 1 the determination of k$* is 
displayed visually. To obtain an economi- 
cally meaningful solution for k$*, the sav- 
ings schedule must intersect the investment 
schedule from above.? 

Long-run equilibrium necessitates that the 
South grow at the same pace as the North. 
Therefore, either ij +i, f(k$*) or sa(kġ*) 
has to be set equal to the South's invest- 
ment rate, and then, in turn, the South's 
saving rate must adapt to the North's growth 
rate via adjustment in the terms of trade. 
Using sa(k#*) for the North's growth rate, 
the long-run terms of trade now will be- 
come 


sa(kx*) 

(9) MW 
on'( ks) 

Note that the equilibrium value of the 

North’s capital intensity, k5$* will not, in 

general, be the same as the capital intensity 


rf, for example, the intensive form of the North's 
production function were Cobb-Douglas or f(ky) = kx, 
the equilibrium value of the capital labor ratio would 
be k#* =Lig /G — ia) / Ol To ensure that &;** is 
a positive number, s>i,a, or the savings schedule 
must be steeper than the investment schedule. 
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FIGURE 1 


that prevailed under Solow full-employment 
growth, k$, and sa(k$*) will not, in general 
be the same as the natural growth rate, ny. 
Both savings propensities now matter —o 
out of profits in the South and s out of total 
income in the North—in establishing the 
long-run value of 6. Technical conditions in 
both regions also matter now. Again im- 
plicit in the determination of k* is the 
Lewis South's fixed real product wage. Cor- 
respondingly, determination of k#* in the 
Keynesian North means that implicitly the 
long-run terms of trade also depend upon 
the parameters of the investment function. 

Keep in mind that only profits are saved 
in the South, while some of both categories 
of income are saved in the North. Even this 
latter asymmetry need not be maintained if 
we generalize the North's savings function 
in Kaldor (1955) fashion. Suppose there are 
different propensities to save out of profits 
and wages in the North, s, out of profits 
and s, out of wages. For the time being 
retain Findlay's assumptions that the rate of 
profit can be identified with the marginal 
product of capital. The North's savings rate 
will be 


(10) gy — (s, 7 s) fk) + sy a( ky). 


If we maintain the assumptions that savings 
and investment separately must equalize in 
each region, that North and South grow at 
the same rate, and that the Keynesian North 
has an independent investment function of 
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the form ig i, f (ky), the long-run terms 
of trade will be 


$,—s,) P (kS) + s,a(k* 
(1) pra ez SO PU scan) 
on'(k*) 
In the special case where s,, = 0, again the 
extreme case of a Kalecki (1971) savings 
function, (11) simplifies to 


Los Pr) 


Now the factors influencing the terms of 
trade in both regions appear to be symmet- 
rical—the invariant savings rates out of 
profits and production functions or, some 
might say, tastes (for foregone consump- 
tion) and technology. The remaining asym- 
metry involves the special role of the North's 
investment function, which must become, 
de facto, the South's investment function 
because steady-state growth in the world 
economy means the South’s growth rate 
conforms to the North's.* 


JI. Markup Pricing in the North and the 
Long-Run Terms of Trade 


Thus far, the results examined have been 
derived under the assumption of neoclassi- 
cal production functions and marginal prod- 
uct factor pricing in both regions. And, thus 
far, the results derived are consistent with 
any economically meaningful distribution of 
international capital between North and 
South; that is, u can take any value between 
zero and one. How might the long-run terms 
of trade be characterized under alternative 
assumptions about technology and distribu- 
tion theory? | 

To push back toward the types of asym- 
metries that enliven North-South models, 
treat the North as following the dictates of 


“There is still a further difference that lurks just 
below the surface. Northern output is a basic good, 
while Southern output is nonbasic in a Sraffa (1950) 
sense. I am grateful to André Burgstaller for pointing 
this out to me. 


SEPTEMBER 1990 


a markup pricing rule in Kaldor-Kalecki- 
Weintraub fashion (see Reynolds, 1987, for 
a useful exposition). This also corresponds 
to Prebisch (1959) and Singer’s (1950) idea 
that the manufactured goods producer exer- 
cises monopoly power in pricing. It also is 
similar to Taylor (1981), who confined 
markup pricing to the North. 

With a markup, my, over unit costs, the 
North’s manufactured goods’ price will be 


(13) Py =(1+ my) [(wyLy/K) 


+ (PyKy/K)]. 


The markup is assumed to be stable for 
institutional reasons discussed in Reynolds 
(1987, pp. 37-62)? 

In (13) P, is the price of the North's 
good, the capital good; w, is the nominal 
wage; Ly is the level of employment; Y, is 
the real quantity of the North's output; and 
Ky is the North's capital stock. 

If the North’s profit rate is defined as the 
ratio of the surplus above expenditures to 
the value of the capital stock, using (13) 
markup pricing implies that the Northern 
rate of profit will be 


my(wyLy + Py Ky) 








14 = 
( ) Pn Py Ky 
My 1 
= — =M 
sells ida 
o my 
u=Ky/K_ My e LL. 
N 


The term My in the numerator on the far 
right-hand side of (14) is the ratio of the 
markup over one plus the markup; the term 


>The markup pricing mechanism often is introduced 
in analyses where firms are presumed to have excess 
capacity. Here, however, it is assumed that although 
there is generally less than full employment of labor, 
the capital stock is utilized fully. Peter Reynolds (1987, 
p. 54) suggests that whereas markup pricing could give 
way to supply and demand determination of prices 
once full capacity is reached, “this is the exception 
rather than the rule" in Kaleckian analysis. 
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u in the denominator is, again, the North’s 
share in the total available capital. 

Now, if North’s investment increases with 
profitability, the investment function will 
take the following. form: | 


(15) Bh =i +i py =ig ip (My/u). 


Assume, further, that savings in the North 
are exclusively drawn from profits so that 


(16) EN 755p = Sy (My /u). 


When investment and savings rates equal- 
ize in the North, they no longer determine 
the North's capital intensity. Instead, they 
determine the North's share in global capi- 
tal. Equality between the right-hand sides of 
(15) and (16) yields the following equilib- 
rium value for the Northern proportion of 
international capital: 





Note, again, for an economically meaningful 
value of u the savings schedule must be 
steeper than the investment schedule. 

Now if the South is constrained to grow 
at the North's growth rate to achieve long- 
run equilibrium, then the North's growth 
rate, s, CMy / u*), will serve as the South's 
investment rate. To have g/=g> means 
that 


(18) s, (My /u*) e 00m (7), 


where, again, the terms of trade adjust to 
achieve the equality. 

Substitution of (17) into (18) throws up an 
intriguing result for the long-run terms of 
trade: 

Si ; 
19)  e*- pe 
0D "7 Gs ipe (e) 





Now the terms of trade depend on the 
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parameters of the savings and investment 
functions in the North as well as the savings 
propensity, technical conditions, and the 
real wage in the South. The numerator is 
the product of the North's propensity to 
save out of profits and the autonomous 
component of the North's investment func- 
tion. What is striking is the independence of 
the long-run terms of trade from the North's 
markup. Of course, the size of the markup 
does affect directly the share of capital lo- 
cated in the North. A higher markup, 
amounting to a regressive redistribution of 
income against labor in the North, would 
mean a higher equilibrium value for u. 

Up to this point, condition (6) has been 
met by assuming that investment matches 
savings separately for each region and that 
the South shares the North's growth rate. 
This need not be the case, however. Indeed, 
it will not be the case if the South also has 
an independent investment function of a 
similar type as the North's. 

Investment in the South can be specified 
as an increasing function of South prof- 
itability in parallel fashion with the North's 
investment function: 


(20) gi - hy c hip, — ho  h9m' (k?). 


Since k* is determined by profit maximiza- 
tion and the presence of a fixed real wage in 
the unlimited labor South, it is only by 
sheer accident that the parameters of the 
South's investment and savings schedules 
will align to equalize g! and g?. Conse- 
quently global macroeconomic equilibrium 
will hold when the following condition ob- 
tains: 


(21) [iou P= s,)(My)] 
+ [hg + (h — 0) Om'(k#)](1—u) =0, 


where investment and savings are not sepa- 
rately equal in each region. — 

The difficulty now is that the system is 
overdetermined. Condition (21) offers a sin- 
gle equation, but there are two unknowns, @ 
and u. Moreover, condition (21) implies a 
transfer of capital from one region to the 
other, so that even after installation, the 
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available capital is mobile. It is natural to 
assert that the available capital shifts in 
response to profit rate differentials between 
the regions. It also is natural to consider the 
North to be the “surplus” capital or excess 
savings region, so that the term [igu t (i, — 
$5) My] is negative, while the South is the 
"deficit" capital region or excess investment 
region, so that the term [ho + (A, — 
o)0«'(k*)] is positive. This suggests an al- 
ternative characterization of dynamic equi- 
librium developed in the next and final sec- 
tion of the paper. 


III. The Terms of Trade in the Long Period 


: Findlay’s long run is one of balanced 
steady-state growth. There is, however, at 
least one alternative procedure for charac- 
terizing equilibrium growth fram a long view. 
This is the long-period concept from Classi- 
cal political economy (Ricardo) where equi- 
librium prevails when rates of profit become 
the same in all activities. In place of uni- 


Implicitly, the model of Arthur Lewis’s (1969, pp. 
17-22) Wicksell lectures — the same as the model of 
trade between the temperate zone and the tropics near 
the end of his (1954, pp. 182-84) justifiably famous 
paper on growth with surplus labor—adopts a uniform 
profit rate condition to characterize the equilibrium 
terms of trade. The international profit rate simply 
equalizes at zero since he assumes the perfectly com- 
petitive zero profit condition. His qualitative results 
would not alter if he assumed positive profit rates as 
long as, once again, they are uniform across all sectors 
(see Darity, 1989). . 

Stable relative productivities be-ween North and 
South agriculture and North and South export com- 
modities lead to a determinant equi:ibrium expression 
for the terms of trade. If productivities are changing 
but the pattern of specialization is stable, then Pre- 
bisch-Singer results can follow. Lewis (1969; also see 
Findlay, 1981, p. 433) asserts that empirically produc- 
tivity has risen fastest in developed country agriculture, 
next fastest in developed country manufacturing, slower 
in developing country primary product making, with 
virtually no technical progress in ceveloping country 
agriculture. 

David Evans (1987) says that the empirical case 
does not unambiguously favor Lewis's claims of a sec- 
toral bias in technical change in favor of temperate 
zone (Northern) agriculture. Evans also considers de- 
mand-side effects, asserting that a key role must be 
assigned to Engel curve effects in the determination of 
the terms of trade. However, Bardhan (1982) has sought 
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formity of growth rates between North and 
South, or the condition gy = g,, substitute 
uniformity of profit rates, or py = p,. 

These conditions are not generally com- 
patible. If both regions possess independent 
investment functions that are not 1dentical 
and propensities to save that are not identi- 
cal, uniformity of profit rates will not corre- 
spond to uniformity of growth rates.” A 
global economy in long-period equilibrium 
can display persistent divergence in growth 
rates between North and South—a dynamic 
equilibrium with uneven development. 





to demonstrate that the basic Lewis results remain 
intact even when demand considerations are brought 
on the scene in a full general equilibrium treatment. 
As Lewis (1969, p. 22) himself observed, "the terms of 
trade are tied rigidly by productivities only for so long 
as it pays both countries to produce both food and one 
other commodity. As soon as productivities move out- 
Side these limits, the terms of trade cease to be tied 
rigidly." 

André Burgstaller (1987) recently has provided a 
clever speculative history (and forecast?) of how the 
terms of trade might unfold under various scenarios as 
the pattern of specialization alters between “England” 
and "India." In Burgstaller's long run—the fifth and 
final stage of his Classical model of trade and develop- 
ment—India becomes the industrial center and Eng- 
land an agrarian nation, reversing their initial roles. Of 
course, the terms of trade continuously will move 
against the "South," but here the South formerly was 
the North! 

In general in a Findlay long run, rates cf profit will 
not equalize. For example, in Findlay's case when 
0* — ny / ar (k*), the rate of profit in the South sim- 
plifies to the ratio n y/o. This will only be equal to the 
North's rate of profit, f’(k%) by a conscious and nonar- 
bitrary selection of values for ny and c. 

Or, consider that since u = Ky/K js the, North's 
share in world capital, 4—(1— uXKy — K,). The 
North's capital share will become stationary (or 2 — 0) 
under two circumstances. Either u=1, implying that 
the capital good is used exclusively in the North and 
South production disappears altogether, or Ky = K,, 
implying that the Findlay long-run condition of bal- 
anced international growth must hold. When each re- 
gion has independent investment functions dictating 
their rates of growth, that is, Ky = ig +ijpy and K,= 
ho + hips, growth rates only will coincide when profit 
rates diverge—-unless the investment functions happen 
to be identical for North and South. Therefore, in this 
final section an alternative specification for the 2 equa- 
tion is advanced, consistent with the notion that capital 
continues to be moved between regions until profit 
rates equalize. 
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The shift rule for movements of available 
capital will be based upon a profitability gap 
between the regions. The variable u has 
been defined as the ratio K,/K, the 
North's share of world capital. If capital is 
fully mobile, this capital share will vary in 
response to the profit rate differential be- 
tween the regions. 

Therefore, the percentage rate of change 
of u, represented by the careted variable 
below, responds positively when py > p, and 
negatively when py « p,. The international 
distribution of capital ceases to change when 
profit rates become uniform, so that u takes 
on a fixed value. Therefore, the dynamic 
equation capturing the presence of capital 
mobility is 


(22) 


5Burgstaller and Saveedra-Rivano (1984)—also dis- 
cussed in Findlay (1984, pp. 226—-29)— modify the 
Findlay North-South model and introduce capital mo- 
bility. They manage to construct a model where simul- 
taneously long-run and long-period conditions obtain. 
For their version of a world with a Solow North and a 
Lewis South, the terms of trade must satisfy both 
Findlay’s long-run condition of balanced international 
growth and also a long-period condition that the terms 
of trade must equal the ratio of the North's marginal 
physical product of capital to the South's marginal 
physical product of capital. This implies, in turn, that 
the following equality must hold: n/o7m'(k,) = 
f(ky)/m'(k,), which simply means that balanced 
growth must obtain in the North between its labor 
force and its capital stock, a condition Solow growth 
ensures in the first place. 

On the other hand, in Burgstaller (1985) the equilib- 
rium terms of trade are determined solely on the basis 
of the long-period criterion. Capital is treated as a 
wages fund rather than a produced means of produc- 
tion. The mobility of advances to labor across regions 
in response to profit rate differentials assures eventual 
equalization of returns. Under long-period conditions 
Burgstaller derives the following result for the terms of 
trade: 8* = (w,qu)/(wy q,), where each w; is the wage 
rate in the corresponding region and each q; is the 
output-labor ratio in each region. As w; goes up or q; 
goes down, production costs in each region rise. 
Burgstaller directly obtains a strictly Classical result; 
the terms of trade improve as costs of production rise, 
and they deteriorate as costs of production decline. 
With capital as a produced means of production the 
Classical result appears only indirectly; that is, as the 
markup rises in a particular region, labor's share in 
income falls, hence relative labor costs fall, and the 
terms of trade decline for that region. 
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The parameter q is a positive constant that 
can be interpreted as a reaction-response 
coefficient. The larger its value, the more 
rapid the convergence between profit rates; 
in the limit, when q —o the convergence 
would be instantaneous. 

First, consider the setting with indepen- 
dent investment functions in both regions 
under joint marginal product factor pricing. 
The key is how to determine the capital 
intensity in the North, ky, although both 
the assumption of natural rate growth and 
the assumption that g4 = EN; made above, 
have been dropped. Growth in the North's 


.capital stock is governed by the rate of 


investment there, g^. The percentage rate 
of increase of North's employment can be 
considered a negative function of the real 
wage in the North. Therefore, the specifi- 
cation of the percentage growth rate of the 
North's capital intensity is 


(23) ku - Ky - E mi, py 
x(wy / Py)]. 


The parameters x, and x, are positive con- 
stants in what can be interpreted as a dy- 
namic labor demand function. The ratio 
wy / Py is the real wage in the North. 

Under marginal product factor pricing the 
rate of profit, py, will be the marginal prod- 
uct of capital in the North, f'(k,), and the 
real wage, wy, /Py, will be the marginal 
product of labor in the North. An equilib- 
rium value of the capital intensity, k$, can 
be derived from the stationary condition for 
equation (23): 


Ss 


(24) ky e dig i P(k&) — Xo 
+xi[ f(kR)— kSP(k)] =0. 


Figure 2 displays the equilibrium level of 
the North’s capital intensity at the intersec- 
tion of the K n and Ly schedules. 

The equilibrium value of the South’s cap- 
ital intensity still is determined by the pres- 
ence of the fixed real wage there. With the 
equilibrium value of the North’s capital in- 
tensity in hand, it is straightforward to ob- 
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A ^ 
Ky, Uu 





FicunE 2 


tain the long-period terms of trade by find- 
ing the stationary value for equation (22), 
û = 0: 

Q3) — = (ky) /m'(k?). 

When condition (25) is attained there is no 
incentive to transfer capital between the 
regions. The long-period terms of trade, in 
this case, are simply the ratio of the respec- 
tive North and South marginal preducts of 
capital. 

The macroeconomic equilibrium condi- 
tion, (21), now can be used to solve for u, 
yielding the equilibrium international distri- 
bution of capital. The long-period value of 
the share of capital ultimately located in the 
North depends upon all the parameters of 
the investment and savings functions in both 
regions and, by using (25), it depends upon 
the technical conditions in the North: 


ho th - e)f'(k&) 
([ho (hi-o) f (ka) 
= [io " (i, Z sy) f (kn) |} 


The prior assumption that the North is the 
capital surplus region and the South is the 
capital deficit region ensures that u* is posi- 
tive in sign and takes a value between zero 
and unity. 

Next consider the case where the North 
practices markup pricing, exercising some 
degree of monopoly power in trade, while 


* o 


(26) 
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the South continues to be a marginal prod- 
uct factor pricing region. This is, again, the 
analogue to the Prebisch-Singer case that 
emerges from this type of analysis. Under 
asymmetric international pricing of this sort, 
when @ from (22) is set equal to zero, the 
following relationship between 6 and u 
emerges: 


(27) u= My /O'(k*) 


My 8 [my /(1* my)]. 


Now the macroequilibrium condition, 


(21); provides the needed second equation 


in the same two variables. In principle, the 
combination of equations (27) and (21) of- 
fers the route to a long-period solution for 
the value of 0 and u under asymmetric 
output pricing between North and South. 
However, beceuse of the structure of (21) it 
is not possible to derive a closed form solu- 
tion for either 0 or u. The strategy pursued 
here is to use (27) to substitute for u in (21) 
and then totzlly differentiate the expression 
that remains to investigate the partial 
derivative 08 /0My. Does a rise in the 
markup in the North, as Prebisch and Singer 
suggest, improve the terms of trade for the 
North? The relevant expression for assess- 
ing this question is 


(Go — Ag) [1/6*$ (k*)] 
pi PR +(i,-s,)+(e—h,)} 
9My Y {Cio - hy) My / v (k*)] 

*(o - h)[My/(&)]) 


Because the North is the capital surplus 
region, while the South is the capita! deficit 
region, it is reasonable to treat both of the 
terms i,—s, and (o — hj) as negative. If 
the rate of autonomous investment in the 
North is smaller than the rate of au- 
tonomous investment in the South, that is, if 
ig < ho, then a paradoxical result occurs. An 
increase in the degree of monopoly in the 
North leads to a deterioration in the North's 
terms of trade. 

However, if the rate of autonomous in- 
vestment in the North exceeds the rate 
in the South and the product (i, — 


(28) 


H 


1 
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hy 1/67'(k*)] is sufficiently large to make 
the numerator positive while the denomina- 
tor remains negative, it is possible to have a 
circumstance where an increase in the 
North's markup—corresponding to a re- 
gressive redistribution of income—leads to 
an improvement in the North’s terms of 
trade.? This is more likely to be the case if 
the initial long-period value of 0* is smaller 
than unity. The Prebisch-Singer result can 
emerge in the long period, although it is 
absent in the long run, when the terms of 
trade under asymmetric international pric- 
ing are independent of the North's markup. 

Finally, consider the symmetric case 
where producers in both regions possess 
some degree of monopoly power and utilize 
a markup pricing strategy. In this instance, 
while the rate of profit for the North is 
given by (My /u), the South’s rate of pront 
will be 


Q9) p, = M,a/6 
M,=[m,/(1+m,)] 
a 5 K,/y,. 


The term o represents the output-capital 
ratio in the South. To simplify the exposi- 
tion impute a fixed coefficients technology 
to the South, so that a is a constant. 

When the North's share of the global 
capital stock stabilizes, so that ô = 0, now it 
implies the tolong relationship between 
u and 6: 


(30) u — (My / M;)(8/a). 


?The Prebisch-Singer case is the same as the results 
obtained by Amitava Dutt (1988) and Patrick Conway 
and myself (1988) in models of the determination of 
the terms of trade in the short run, that is, when the 
terms of trade serve to bring about balance of trade 
equilibrium. However, this case is the opposite of what 
l obtained in Darity (1989), where I found that the 
paradoxical outcome of a rising markup in the North 
leading to a deterioration in the terms of trade for the 
North in both the long run and long period was the 
general result! This earlier unambiguous finding ap- 
pears to be attributable to the lack .of the explicit 
dynamic structure used in this paper that combines the 
û = 0 equation with the world macroeconomic equilib- 
rium condition. 
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Since again (21) cannot be used to solve 
directly for a closed form solution for 6, we 
pursue a similar strategy to that employed 
in the asymmetric pricing case. After substi- 
tuting 4 from (30) into equation (21), the 
equation is differentiated totally and the 
partial derivatives 06 / 9M, and 06 /0M, are 
derived: 


ag* 





(31) 


{ (in /My)(0*/@) + (i — sp) 
7 [hg + (A - e)of T7 M; (0*/2)]) 
(7 ion ( My /M,) + (a — hy)aM,(1—u*) 
t [ho + (hy — e)o? |[(My / M;)/a]) 








à* 
(32) T 





(C io( My /M?) (0*/a) 
N +- o)(a/0)]0-u)} 
(7 ioo ( My /M,)+ (a —h,)aM,(1- u*) ` 
* [5o 07 7) 02 ][(Mn/M,)/a]} 


Inspection of equations (31) and (32) in- 
dicates that both the Prebisch-Singer out- 
come—a rise in its markup improves a re- 
gion's terms of trade—and the paradoxical 
outcome—a rise in its markup causes a 
region's terms of trade to deteriorate—are 
theoretically possible under a wide range of 
parameterizations. This is especially true 
given the presence of the parameter o, the 
output-capital ratio in the South. 

There are several lingering difficulties. 
Are centers of gravity for the terms of trade 
empirically as well as analytically of inter- 
est? What sets of trading partners actually 
fit the various asymmetries assigned to North 
and South in this discussion? How does 
Lewis's observation about changing relative 
productivities and patterns of specialization 
influence the theoretical outcomes for the 
terms of trade? How do Marshall’s (1920, p. 
601) worrisome “complex actions and rela- 
tions of credit" (and finance) impinge upon 
the conclusions reached here; is either the 
long run or long period truly independent 
of monetary considerations? These ques- 
tions are not Solow's (1956, pp. 93-94) 
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“cobwebs” to be brushed lightly aside; these 
are serious issues to be pursued eagerly in 
further research that extends the search for 
a fundamental explanation of the terms of 
trade between developed and developing 
countries. 
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The Case of the Vanishing Revenues: 
Auction Quotas with Monopoly 


By Kara KrisHNA* 


This paper examines the effects of auctioning quota licenses when monopoly 
power exists. Here the sales of licenses will never raise any revenue if domestic 
and foreign markets are segmented. More surprisingly, the inability to raise 
revenue is shown to persist even when arbitrage across markets is possible as long 
as the quota is not too far from the free trade import level. This suggests that 
existing revenue estimates from auctioning quota licenses, which are based on the 
assumption of competition, are upwardly biased. It also makes it likely that 
quotas implemented by auctioning licenses, even when set optimally, have adverse 


welfare effects. (JEL. 422) 


In this paper I examine the case for auc- 
tion quotas when there is a foreign mo- 
nopolist. A companion paper deals with 
oligopolistic markets.! 

One of the most common criticisms of 
voluntary export restrictions (VERs) and of 
the way quotas are currently allocated is 
that they allow foreigners to reap the rents 
associated with the quantitative constraint. 
It has been suggested that auctioning im- 
port quotas would be a remedy for this. 

A Congressional Budget Office (CBO) 
memorandum estimates quota rents possi- 
ble in 1987 for a group of industries to be 
3.7 billion dollars.^ It compares this to the 
C. Fred Bergsten, Ann Kimberly, Elliott 
Schott, and Wendy Takacs (1987) estimate 
made for the Institute for International 
Economics (IIE) of 5.15 billion. More re- 
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man, M. Ben Zvi, Lawrence Summers, David Tarr, 
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revise the paper, though the usual disclaimer applies. 

'See Krishna (19882). 

? Memorandum of February 27, 1987, from Stephen 
Parker on revenue estimates for auctioning existing 
import quotas (publicly circulated). 
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cently, David Tarr (1989), using a general 
equilibrium model, estimated these rents in 
textiles, steel, and autos to be even higher 
—about 14 billion dollars in 1984. Takacs 
(1987) points out that proposals to auction 
quotas have become increasingly frequent? 
She states: "Commissioners Ablondi and 
Leonard of the U.S. International Trade 
Commission (ITC) recommended auction- 
ing sugar quota licenses in 1977. The ITC 
recommended auctioning footwear quotas 
in 1985. Studies by Gary Hufbauer and 
Howard F. Rosen (1985) and Robert Z. 
Lawrence and Robert E. Litan (1985) sug- 
gested auctioning quotas and earmarkin 
the funds for trade adjustment assistance." 

Despite the importance of the issues in- 
volved, the intuition behind such statements 
and the procedure used in the estimation 
are based on models of perfect competition. 
In such models, the level of the quota deter- 
mines the domestic price, and the differ- 
ence between the domestic price and the 
world price determines the price of a li- 
cense when auctioned. If the country is 
small, then the world price is given. If the 
country is large, then the world price does 
change with a quota. How it changes is 


3The interested reader should consult Bergsten 
et al. (1987) and Takacs (1987) for a historical and 
institutional perspective on work in this area. 

“See Takacs (1987), fn. 7. 
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determined by supply and demand condi- 
tions in the world market. 

However, when markets are imperfectly 
competitive, as they are thought to be in the 
market for autos, this analysis is misleading.’ 
In such environments, prices are chosen by 
producers so that there is no supply curve, 
and the response of producers to the con- 
straint must be taken into account when 
determining the price of a license when it is 
auctioned off. For example, if the response 
of profit-maximizing producers is to adjust 
their prices so that there is no benefit to be 
derived from owning a license to import, its 
auction price must be zero! 

Therefore, the question that needs to be 
addressed concerns the behavior of produc- 
ers in response to quantitative constraints, 
and the impact of this on the price of a 
license. There are two policy questions that 
need to be addressed. First, should existing 
licenses be auctioned off? Second, if quotas 
are set at their optimal level, can they be 
welfare-improving over free trade? There 
has been relatively little work in this area. 
The work on the effects of quantitative re- 
strictions in imperfectly competitive mar- 
kets is linked to the above question,® but to 
date, little work on what this might suggest 
about the price of a license seems to exist. 

In this paper, I develop a series of models 
of monopoly that address this issue. The 
models show that the way in which licenses 
are sold, the demand conditions, and the 
market structure all influence the resulting 
price of a license. The results indicate that 
there is reason to expect that the price of a 
license may be much lower than that indi- 
cated by applying models of perfect compe- 
tition. Thus, estimates of potential revenues 
such as those of the CBO and Bergsten 
et al. may be far too large. Moreover, if no 
revenues are to be raised from auctioning 
quotas unless they are very restrictive, the 


"Auction quota revenues in autos for 1987 are esti- 
mated at about $2.2 billion by Bergsten et al. (1989). 
The quota revenues fluctuate quite considerably over 
the years as demand fluctuates which alters the restric- 
tiveness of the quota. 

See Krishna (1989b) for a survey of this work. 
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profit-shifting effect of such quotas, even 
when auctioned off, is unlikely to outweigh 
the loss in consumer surplus of such poli- 
cies. For this reason, they are likely to have 
adverse welfare consequences even when 
set optimally. 

. I- do not argue that in the real world 
license sales will raise no revenues. In the 
presence of uncertain demand they will, as 
licenses have an option value in this case. I 
merely point out that there is reason to 
expect revenues to be lower than those esti- 
mated under the assumption of perfect 
competition. I make my arguments in the 
simplest model, one without uncertainty. 
The uncertainty case is discussed in Krishna 
(1988b). 

If there is a single foreign supplier of the 
product and markets are segmented, the 
price of a license is clearly zero. It is opti- 
mal for the monopolist to raise his price in 
response to a quota or VER so that the 
price of a license becomes zero. This model 
with segmented markets is mentioned in 
Hirofumi Shibata (1968), and it is developed 
diagrammatically in Takacs (1987) and in 
Paul Krugman and Elhanan Helpman 
(1989), who also look at foreign oligopoly. 

However, one would expect that the pres- 
ence of other markets and the possibility of 
arbitrage between them would make it opti- 
mal for the foreign monopolist to limit his 
price increase in response to a quota, 
thereby creating a positive price for the 
license. Thus, one might expect nonzero 
prices for licenses when markets are not 
segmented even with a foreign monopoly. 
Somewhat surprisingly, this is not necessar- 
ily so. Quotas set close to the free trade 
level always have a zero price. The only 
effect is an increase in the world price! This 
is the subject of Section I. 

A simple example is developed in Section 
II in order to show how restrictive the quota 
has to be for the license price to become 
positive. The way that this varies with the 
relative size of the markets and demand 
elasticities is considered in addition to the 
welfare consequences of such policies. Sec- 
tion III briefly discusses the effect of an 
alternative timing structure on the results 
and argues that the spirit of the results 


830 ' | THE AMERICAN ECONOMIC REVIEW 


remains valid. Section IV contains some 
concluding remarks and.directions for fu- 
ture research. 


I. Foreign Monopoly with Costless Arbitrage 


In this section it is assumed that there is a 
foreign monopolist who cannot price dis- 
criminate between his markets. Let Q(P) 
` and q( p) denote the demand functions fac- 
ing the foreign firm in the home market and 
in the other market(s) respectively. Let 
Clq + Q] denote its cost function.? 

Let R(P) be profits from sales in the 
home market. Assume that R(P) is concave 
in P and is maximized at P*. Similarly, let 
r( p) be the profits from sales in other mar- 
ket(s), and let r(p) be concave and maxi- 
mized at p*. It is easy to see that 


€ 


p*- 





e 
E and p "lei 


where & and e are the respective demand 
elasticities, so that the monopolist in the 
absence of arbitrage would choose to charge 
a higher price in the market with less elastic 


demand. Because of arbitrage, however, the . 


monopolist will choose one price that will 
be between the two prices he would have 
set in the absence of arbitrage possibilities. 
The optimal price for him to set maximizes 
mq (P) — R(P)+ rCP), and is given by 


£ 
M 
di &£—1 e 
where & = ĝe +(1— 0)e and 09 — Q/(q +Q). 
This is the free trade price. Thus, the 
monopolist chooses price as if he were faced 
with one market where the elasticity of de- 
mand is a share-weighted combination of 
the elasticities of the two markets. The 
question, then, is how a quota affects the 





"There may be domestic competitive suppliers in 
which case the monopolist’s demand in what follows 
should be interpreted as the residual demand curve. 

Here marginal costs are assumed constant. Similar 
results obtain when marginal costs are not assumed 
constant. 
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price charged by the monopolist when the | 
quota licenses are auctioned off. 

At this point it is important to be clear 
about exactly what constitutes a license, how 
licenses are sold, and what the timing of 
moves is. With market segmentation, a li- 
cense is defined to be a piece of paper that 
entitles its possessor to buy one unit of the 
product in question at the price charged in 
his market. If arbitrage is possible, then the 
possessor buys at the lower of the prices 
charged by the seller in the home and the 
world market. However, it is a dominated 
strategy for the monopolist to attempt to 
charge different prices in his different mar- 
kets as sales will only be made at the lower 
of the two prices. For this reason, the 
monopolist can be restricted, to choosing 
only one price. 

Licenses are sold in a competitive market 
either to competitive domestic retailers with 
zero marginal costs of retailing or directly to 
consumers. In Sections I and II, I assume 
that the timing of moves is as follows. First, 
the government sets the quota. Then the 
monopolist sets his price. Finally, the mar- 
ket for licenses clears. This timing is consis- 
tent with the idea that the market for li- 
censes clears more frequently than the 
monopolist sets prices, and that the govern- 
ment sets the quota even less frequently 
than the monopolist sets prices. Section III 
studies the case where the monopolist can. 
adjust prices faster than the rate at which 
the market for licenses clears. 

The model is then solved backwards as 
usual. First consider the market for licenses. 
If the price charged by the monopolist is P 
and the price of a license is L, then the 
demand for licenses must equal the demand 
for the good at price P + L, Q(P + L). The 
supply of licenses is V, the level of the 
quota. The equilibrium price of a license 
is given by L(P,V). LC) is defined by the 
market for licenses clearing, so that Q(P + 
L)=V. Notice that if Q(P)«V, then 
L(P,V) «0 as defined thus far. However, 
since a quota is not binding if such a high 
price is charged, L(P,V) is defined to be 
zero in this case. Let P"(V) be defined by 
Q(PV(V)) «V so that L(P,V)> 0 and the 
quota is binding if P < P’(V). By the defi- 
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q+Q 


Ficure 1 


nition of L(-), it is apparent that if P < 
P'(V), then demand at home equals V, 
although V is less than Q(P). 

Now consider the total demand facing the 
monopolist with a quota. As demand in the 
home market has. shrunk for P < P'(V), 
the total demand curve has a kink in it at 


P'(V). This is depicted in Figure 1 for the - 


quota level V. 

In the absence of any quota, demand is 
given by AD, and marginal revenue by AF. 
The monopolist sets price equal to P", and 
sells QCP™”)+ q(PM). For convenience, 
Figure 1 is drawn so that Q(P) and q(P) 
are identical and linear. Hence the marginal 
revenue corresponding to total demand co- 
incides with Q(P). A quota at the free trade 
level, Q(P^), makes the demand facing the 
monopolist into ABE. This creates a kink in 
the demand curve at P", Marginal revenue 


is given by AGHR. Therefore, it remains: 


optimal for the monopolist to price at P. 

Now consider the effect of reducing the 
quota from Q(P™) to V. V is the largest 
quota such that the intersection of marginal 
revenue and marginal cost occurs on the 
steeper but not vertical segment of the 
marginal revenue curve. This raises the price 
at which the quota binds to P’(V) from 


P^, and the kink in demand occurs at 
P"(V). The demand curve with a quota at V 
is given by AIJ and the corresponding 
marginal revenue curve by AKLM. Notice 
that if V is close to Q™, the profit maximiz- 
ing point must occur at the intersection of 
the vertical part of the marginal revenue 
curve and marginal cost, C.? Therefore, the 
monopolist will find it optimal to charge 
P"(V) so that the price of a license is zero! 
Only if V is so small that the intersection of 
the marginal revenue curve (with a quota) 
and the marginal cost curve occurs on the 
steeper but not vertical segment of the 
marginal revenue curve will the price of a 
license be positive. This can only occur if V 
is substantially below Q™. In Figure 1, any 
V lower than V, the quota level depicted, 
gives L(-) » 0. 


°This is because when V = Q^, the intersection of 
HR, the marginal revenue curve corresponding to BE, 
part of which is given by HR, with marginal cost would 
occur at a lower output level than the output level at 
which the kink in demand occurs. As the quota falls, 
the kink in the demand curve moves back twice as fast 
as the intersection of the marginal revenue curve with 
marginal cost. Finally, at the quota level of V depicted, 
the two coincide. 
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FIGURE 2 


Figure 2 illustrates the effects of varying a 
quota. The domestic demand curve is de- 
picted by 44. BB depicts the profit-maxi- 
mizing price charged in both markets by the 
monopolist assuming that home demand is 
fixed at V independently of that price. As 
r(P) is concave, BB is upward sloping. Note 
that when V, which along with Q is on the 
horizontal axis, equals Q™, the price along 
BB must exceed P", Also, AA and BB 
intersect at output level V by the definition 
of V given previously. 

The price charged when the quota is V' is 
defined by P"(V), which is the dark line in 
Figure 2. If the quota is set above Q™, it is 
not binding, so that the price charged lies 
along EC. If the quota lies between Q” 
and V, so that it is *not too restrictive," the 
intersection of marginal revenue and cost 
occurs along the vertical segment of 
marginal revenue, so that the profit-maxi- 
mizing price equals P"(V) and lies along 
AA. If the quota is below V, so that it is 
* very restrictive," then the profit-maximiz- 
ing price lies along BB. Therefore, the price 
charged by the monopolist first rises and 
then falls as the quota is reduced from the 
free trade level. 

The license price is the vertical distance 
between P™(V ) and the demand curve, AA. 
Only when P™(V) falls below AA can L(-), 
the license price, be positive. Since it is only 


when L(-)>0 that auctioning quota li- 
censes can raise revenue, it is only when the 
quota is set below V that welfare can rise. 
Moreover, since welfare falls as the quota 
falls from Q^ to V, an even stronger condi- 
tion is required for auctioning quotas to 
increase welfare relative to free trade. When 
V is set below V, it is apparent from Figure 
2 that the sum of consumer surplus and 
license revenues must fall short of free trade 


' consumer surplus as long as V > V. Even at 


V «V, it is short by GFE, the deadweight 
loss of such z policy. Thus, only if P/(V) < 
P™, can welfare possibly rise. 

This leaves the issue of auctioning exist- 
ing quotas. Here the desirability of this pol- 
icy depends on the existing method of im- 
plementation. If the VER is implemented 


. by giving away licenses to foreign agents 


other than the foreign monopolist and the 
licenses are transferable in a competitive 
market, the monopolist’s problem is the 
same whether or not licenses are auctioned. 
In this case, the only effect of auctioning 
licenses is to transfer license revenues, if 
they exist, from foreign license holders to 
the domestic government. In this case, auc- 
tioning licenses is better than giving them 
away only when the license price is positive, 
that is, when V « V. This gives us Proposi- 
tion 1, which addresses the two policy ques- 
tions raised earlier. 
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PROPOSITION 1: With a foreign monopoly 
and perfect and costless arbitrage, quotas at or 
close to the free trade level implemented by 
auctioning licenses yield no revenues and must 
reduce welfare. Quotas must be very restrictive 
if they are to raise welfare over free trade. 
Moreover, auction quotas dominate VERs at 
the same level only if the restriction is set at 
quite a low level. Note also that if the world 
price rises, a quota by one country reduces the 
welfare of other importing countries as well as 
that of the exporting country whose firm’s 
profits fall with a quota. 


Other methods of implementing a VER 
would lead to a different status quo and so 
a different answer about the desirability of 
auctioning existing quotas. For example, if 
the VER were implemented by giving all 
the licenses to the foreign monopolist, then 
the foreign monopolist would take license 
revenues to be a part of his profits and this 
would affect his pricing behavior. This case 
is analyzed in Krishna (1989a). 

Three further questions naturally arise. 
First, how restrictive must the quota be 
before a license commands a positive price? 
Second, how does the answer to this ques- 
tion depend on demand conditions? And 
finally, under what conditions can a quota 
that is auctioned off raise welfare over free 
trade? Next, a simple example is worked 
out that sheds some light on these ques- 
tions. 


JI. An Illustrative Example 


This example focuses on the role of the 
home market size relative to that of the 
other market and of demand elasticity in 
determining the effects of quota auctions. 

It is assumed that consumers at home and 
abroad have identical constant elasticity de- 
mand functions given by P~*. There are, 
however, N consumers at home and n con- 
sumers abroad so that demand in the home 
market is O(P)=NP~* and that in the 
foreign market is q(P) — nP^*. As before, 
marginal costs are constant at C. Profit max- 
imization in the absence of any quotas re- 
sults in the monopolist charging PM = 
Ce /(e —1), and selling Q" = N(Ce/ 
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(e —1))^* at home. As usual, it is assumed 
that e> 1 so that profits are well behaved. 

The smallest quota for which the license 
price is zero is depicted in Figure 1 by V. 
When V is set as the quota, the marginal 
revenue curve associated with the market 
demand curve when the quota is binding 
intersects the marginal cost curve at exactly 
the level of the quota. If the quota is set at 
V, the marginal revenue corresponding to 
the market demand curve when the quota is 
binding equals the marginal cost curve so 
that 


d|(P —C)V 4 (P - C)nP"*] 
dP 


This implies that 
(1) V-(P — C)enP- 6*? + nP^* =0. 


Moreover, for this to hold at the point where 
the constraint just binds, it must also be 
that 


(2) NP-* =V. 


Substituting (2) into (1) gives 


1-(P-C di i 0 
e Senn. —+—= 
(PaO TR 


so that the price at which this occurs, P’(V), 
is given by 


E Ce 
(0) PO) -— y. 
g£—-1—— 


Hence, 


834 . THE AMERICAN ECONOMIC REVIEW 


and 


€ 


W S- 


There are a few things to notice about 
this expression. First, Q^ always exceeds V 
so that the quota must be set below the free 
trade level of imports for a license price to 
be nonzero. Second, if n= N, 


EUM i £—1Y 
S- i 


If £> 2, say, e = 3, then Q” / V —8 so that 
the quota needs to be quite restrictive for a 
license price to be positive. Third, for any € 
and N, the limit as n>» of QM/V is 1. 
Thus, as the home market becomes small 
relative to the foreign one, the license price 
becomes positive when the quota is not very 
restrictive. Fourth, for any e, as long as 
e-1-(N/n)>0, Q"/V rises as N/n 
rises. Fifth, if e—1—(N/n) «0, the price 
of a license is always zero. As s - 1 — CN /n) 
approaches zero from above, P’(V)> 
and V — 0. Thus, the range where L(-) > 0, 
that is, where V < V, shrinks to zero. 

. If e and e& can differ in the constant 
elasticity case, then the analogue of (1) im- 
plies that py) Ce/(e —1) as long as 
V »0. Since the free trade price, P^, is 
C£ /(é — 1), where E = 0 - (1— 0)e and 8 is 
the share of the home market, P^ lies 
between Ce /(e —1) and Ce /(e — 1). More- 
over, as Ce /(e — 1) rises as e falls, PM(V) 
must exceed P™ if e « e. In this case, auc- 
tion quotas must reduce welfare. If e» e, 
P™(V) can lie below P^ and it is possible 
for welfare to rise when the optimal quota 
is set. The intuition for this is that if foreign 
elasticity of demand, e, is greater than home 
elasticity of demand, e, then raising price to 
make the quota bind has a high cost in 
terms of losing customers in the foreign 
market. This is especially so if the home 
market is small relative to the rest of the 
world. It is possible to construct examples 
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of cases where the optimal policy is a quota 
below the free trade level of imports.? 

The results of this section are summa- 
rized in Proposition 2. - 


PROPOSITION 2: If the elasticity of de- 
mand is constant and equal in both markets, 
then welfare must fall if auction quotas are 
imposed. If the home market demand is less 
elastic than that of the rest of the world, that 
is, e > e, welfare can rise if the optimal quota 
is auctioned off. If home market demand is 
more elastic than the rest of the world, that is, 
e <e, then welfare must fall. 


III. The Importance of Timing 


So far we have assumed that the market 
for licenses clears faster than the monopolist 
sets his price. Thus, tlie monopolist can act 
like a Stackelberg leader and take into ac- 
count the effect of his actions on the equi- 
librium price of a license. One might ask 
whether this timing structure is responsible 
for the results. Here I argue that this is not 
the case. 
© Consider the model of Section I with the 
new timing structure..In the last stage, the 
firm chooses price P taking as given the 
value of L and V. Its profits thus depend on 
how consumer demand is affected given this 
level of L end V. If consumers assume that 
L is fixed and that any number of licenses 
will be available at this price, their demand 
for the gocd is given by Q(P + L) even if P 
is very low. I call this the case with myopic 
consumers. In this case there is no kink in 
the. demand: curve facing the monopolist. 
The equilibrium license price, L(V), can be 


‘shown to be exactly the specific tariff that 


leads to V being demanded. This gives: 
Proposition 3. 


PROPOSITION 3: When firms take L as 
given, the equilibrium price of a license is zero 
if VzV*. It is positive if V «V^ and in- 
creases as V decreases. Thus, for V «V^, a 
license always has a positive price. 


Assuming e=11,e= 8, N/n= 0.0001 yields one 
such example. . 
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If, on the other hand, consumers do not 
assume that L is fixed even if P is very low, 
we have the case with nonmyopic con- 
sumers. Here consumers realize that the 
number of licenses is limited to V. They 
infer that if the monopolist charges a very 
low price so that Q(P + L)» V, that is, 
P < P*(V)— L, then the shadow price of a 
license will exceed L and equal L where 
QCP + L) - V, so that L = PV(V)— P. This 
will give the monopolist a total demand of 
q(P)+V instead of g(P)+ Q(P + L). 

Consumers therefore take the license 
price as given when the product price is 
high, but they realize that a low product 
price creates a black market for licenses 
and raises the effective license price. This 
asymmetry can be shown to create a contin- 
uum of equilibrium license prices. A zero 
license price remains an equilibrium as long 
as the quota is not too restrictive. The re- 
sults are summarized in Proposition 4. 


PROPOSITION 4: With  nonmyopic con- 
sumers, zero remains an equilibrium license 
price as long as the quota is not too restrictive. 
For more restrictive quotas, equilibrium li- 
cense prices are bounded away from zero. 
However, there is a continuum of such license 
prices for any quota below the free trade level." 


Thus, the result that auction quotas may 
not raise revenues unless they are quite 
‘restrictive reemerges even when the timing 
of moves is altered and consumers are not 
myopic. However, it is less compelling here 
as other equilibria with positive license 
prices also exist. 


IV. Conclusion 


The main thrust of this paper is that with 
foreign monopoly the effects of auctioning 
quota licenses must take into account the 
induced effect on the pricing policy of the 
monopolist. Doing so casts doubt on the 
revenue-raising potential of such policies 
and this adversely affects their welfare con- 


"For detailed proofs of Propositions 3 and 4, see 
Krishna (1989b). 
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sequences. Though this paper deals with 
foreign monopoly, the point is more gen- 
eral. Similar results go through with foreign 
oligopoly as well, as shown in Krishna 
(19882). 

Still, much remains to be done to deter- 
mine the desirability of auction quotas. First, 
their desirability under uncertainty needs 
more study. Second, it may be possible to 
use recent work on computable partial equi- 
librium models, such as that of Avinash 
Dixit (1987) and Anthony Venables and 
Alasdair Smith (1986), to help build empiri- 
cally implementable models to give esti- 
mates of the welfare effects of auctioning 
quota rights in’ particular markets. Third, 
while this paper assumes the market for 
licenses is competitive, it would be desirable 
to study the determinants of the market 
structure in the market for licenses. Fourth, 
as the details of the implementation proce- 
dure, for both VERs and auctioning quotas 
are crucial, more attention to these is war- 
ranted. I am currently working along these 
lines. 


SYMBOLIC NOTATION 


P Price in the home market 
Price in the foreign market 
Demand in the home market 
q(p) Demand in the foreign market 
Clq + Q] Cost function 





R(P) Profits in the home market 
r(p) Profits in the foreign market 
E Demand elasticity in the home market 
e Demand elasticity in the foreign market 
9 Share of the home market in sales — 
q*Q 
E Share weighted average of elasticities 
8s - (1— 0)e 
V Quota level 
pM Price under free trade 
PV(V) Price at which the quota just binds 
P"(y) Price charged by monopolist when quota is V 
L(P,V) License price as a function of P and V 
P$ Price at which R(P) is maximized 
p* Price at which r(p) is maximized 
T(P) Tota! profits which equal R(P)+ r(P) 
V Smallest quota where L(P,V) equals zero 
V Quota where PM(V) = PM 
N Size of the home market 
n Size of the foreign market 
L Effective license price with nonmyopic 


consumers | 
Free trade imports 


e 
& 
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Tying, Foreclosure, and Exclusion 


By MicHAEL D. WuInston* 


In recent years, the "leverage theory" of tied good sales has faced heavy and 
influential criticism. In an important sense, though, the models used by its critics 
are actually incapable of addressing the leverage theory's central concerns. Here I 
reconsider the leverage hypothesis and argue that tying can indeed serve as a 
mechanism for leveraging market power. The mechanism through which this 
leverage occurs, its profitability, and its welfare implications are discussed in 


detail. (JEL 610) 


A firm engages in tying when it makes the 
sale (or price) of one of its products condi- 
tional upon the purchaser also buying some 
other product from it. Tying has a long 
history of scrutiny under the antitrust laws 
of the United States, and throughout this 
history it has been harshly treated by the 
courts.’ A primary basis for this condemna- 
tion has been the courts’ belief in what has 
come to be known as the “leverage theory” 
of tying: that is, that tying provides a mech- 
anism whereby a firm with monopoly power 
in one market can use the leverage provided 
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paper was done during the summers of 1986 and 1987. 
I would like to thank the Federal Trade Commission 
and the Graduate School of Business at Stanford Uni- 
versity for their hospitality during that time. I would 
also like to thank Patrick Bolton, Franklin Fisher, 
Louis Kaplow, Richard Schmalensee, and Jean Tirole 
for their helpful comments and George and Elizabeth 
Whinston for their editing assistance. Financial sup- 
port from the National Science Foundation through 
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‘Tying doctrine was originally developed in patent 
cases (Motion Pictures Patents Co. v. Universal Film 
Manufacturing Co., 243 U.S. 502 (1917)). Since then a 
long line of case law has developed under both Section 
1 of the Sherman Act and Section 3 of the Clayton Act. 
(See, for example, International Salt v. U.S., 332 US. 
392 (1947) and Northern Pacific Railway Co. v. U.S., 
356 U.S. 1 (1958).) Similar ideas have also been devel- 
oped under Section 2 of the Sherman Act. (See, for 
example, U.S. v. Griffith, 334 U.S. 100 (1948) and U.S. 
v. United Shoe Machinery Corp., 110 F. Supp. 295 (D. 
Mass. 1953).) Two cases involving less harsh treatment 
are Times-Picayune Publishing Co. v. U.S., 345 U.S. 594 
(1953) and U.S. v. Jerrold Electronics Corp., 365 U.S. 
567 (1961). 
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by this power to foreclose sales in, and 
thereby monopolize, a second market. 

In recent years the leverage theory has 
come under heavy attack from a number of 
authors whose arguments are traceable to 
the University of Chicago oral tradition as- 
sociated with Aaron Director (see, for ex- 
ample, Director and Edward Levi, 1956; 
Ward S. Bowman, 1957; Richard A. Posner, 
1976; and Robert H. Bork, 1978). A typical 
rendition of their criticism goes along the 
following lines: Suppose that a firm is a 
monopolist of some good A that a con- 
sumer values at level v4 and that costs c, to 
produce. The consumer also consumes some 
other competitively supplied product B that 
she values at level v, and that can be pro- 
duced at a unit cost of cg. Now, the 
monopolist could require the consumer to 
purchase good B from him if she wants 
good A, but what will he gain? The con- 
sumer will only purchase such a bundle if its 
price is no larger than v4 cg, and so the 
monopolist can do no better than earning 
(v4 — c4), the level he earns selling good A 
independently. In short, there is only one 
monopoly profit that can be extracted. 

Similar arguments are given for the 
case of complementary products. Richard 
Posner (1976), for example, comments as 
follows: 


[A fatal] weakness of the leverage the- 
ory is its inability to explain why a firm 
with a monopoly of one product would 
. want to monopolize complementary 
products as well. It may seem obvi- 
ous..., but since the products are by 
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hypothesis used in -conjunction with 
one another..., it is not obvious at all. 
If the m. of the tied product is 
higher than the purchaser would have 
to pay on the open market, the dif- 
ference will represent an increase in 
the price of the final product or ser- 
vice to him, and he will demand less of 
it, and will therefore buy less of the 
tying product. To illustrate, let a pur- 
chaser of data processing be willing to 
pay up to $1 per unit of computation, 
requiring the use of 1 second of ma- 
chine time and 10 punch cards, each 
of which costs 10 cents to produce. 
The computer monopolist can rent the 
computer for 90 cents a second and 
allow the user to buy cards on the 
open market for 1 cent, or, if tying is 
permitted, he can require the user to 
buy cards from him at 10 cents a card 
—but in that case he must reduce his 
machine rental charge to nothing, so . 
what has he gained? [p. 173]. 


Thus, the critics contend, if a monopolist 
does employ tying, his motivation cannot be 
leverage. In its place, they point to a num- 
ber of socially beneficial, or at worst am- 
.biguous, alternative explanations for tying: 
for example, price discrimination (Bowman, 
1957), achieving economies of joint sales, 
protection of goodwill, risk sharing, and 
cheating on a cartel price. Almost inadver- 
tently, the more formal economics literature 
on tying (Meyer L. Burstein, 1960; Roger D. 
Blair and David L. Kaserman, 1978; Richard 
Schmalensee, 1982) has reinforced this view 
as a result of its exclusive focus on price 
discrimination motivations for the practice. 
Thus, Posner (1976) goes on to note that 
"the replacement of leverage by price dis- 
crimination in the theory of tie-ins has been 
part of the economic literature for almost 
twenty years.”* These criticisms have, in 


Bork (1978) sums up his discussion of tying more 
emphatically: ' ‘[The leverage] theory of tying arrange- 
ments is merely another example of the discredited 
transfer of power theory, and perhaps no other variety 
of that theory has been so thoroughly and repeatedly 
demolished in the legal and economic literature." ` 
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fact, had a tremendous impact in both legal 
and economic circles.’ 

In an important sense, however, the exist- 
ing literature does not really address the 
central concern inherent in the leverage 
theory, namely, that tying may be an effec- 
tive (and profitable) means for a monopolist 
to affect the market structure of the tied 
good market (i.e., “monopolize” it) by mak- 
ing continued operation unprofitable for tied 
good rivals. The reason lies in the 
literature’s pervasive (and sometimes im- 
plicit) assumption that the tied good market 
has a competitive, constant returns-to-scale 
structure. With this assumption, the use of 
leverage to affect the market structure of 
the tied good market is actually impossible. 
Thus, in contrast to a concern over the 
effects of tying on market structure, the 
existing literature’s focus is on a demand- 
side notion of “leverage”: the idea that, 
taking the prices charged by tied good com- 
petitors as given, a firm might be able to 
extract greater profits from consumers by 
tying. 

In this paper, I reexamine the leverage 
hypothesis. In particular, I examine several 
simple models that depart from the compet- 
itive, constant returns-to-scale structure as- 
sumed in the existing literature. In contrast, 


?In a recen: antitrust textbook, for example, Blair 
and Kaserman (1985) comment that "according to this 
view, somehow the seller expands or levers his 
monopoly power from one market to another. This, of 
course, is not possible. A seller cannot get two 
monopoly profits from one monopoly.... Thus, the 
leverage theory of tying is unsatisfactory." The 1985 
Department of Justice Vertical Restraints Guidelines 
state that “Tying arrangements often serve procompet- 
itive or competitively neutral purposes... . [They] gen- 
erally do not kave a significant anticompetitive poten- 
tial.” For a recent rebuttal to this view in the legal 
literature, however, see Louis Kaplow (1985). 

^Indeed, this is exactly the sense in which the exist- 
ing literature can be said to focus on price discrimina- 
tion aspects of the practice; it analyzes whether tying is 
a profitable strategy given the prices of tied good 
competitors (wich can be thought of as creating an 
induced demand structure for the monopolist). In con- 
trast, here my focus is on the ability of tying to change 
those prices, in particular, by making continued opera- 
tion unprofitable for competitors. 
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here I assume that scale economies exist in 
the production process for the tied good, 
and as a result, the structure of that market 
is oligopolistic. 

In these models I address three basic 
questions. First, can tying succeed in alter- 
ing the market structure of the tied good 
market, and if so, how? Second, is it a 
profitable strategy? Third, what are the wel- 
fare consequences? As we shall see, tying 
can lead to a monopolization of the tied 
good market. Most interestingly, the mecha- 
nism through which this exclusion occurs is 
foreclosure; by tying, the monopolist re- 
duces the sales of its tied good market com- 
petitor, thereby lowering his profits below 
the level that would justify continued opera- 
tion. 

Tying is frequently a profitable strategy 
for the monopolist in these models, and it is 
often so precisely because of its potential 
for altering the market structure of the tied 
good market. The particular circumstances 
in which tying is a desirable strategy for the 
monopolist, however, depend in part on 
whether he is able to make a precommit- 
ment to tie. In many circumstances this is 
indeed possible. One of the primary ways in 
which this can be accomplished is through 
product design and the setting of produc- 
tion processes, both of which may involve 
significant sunk costs. By bundling compo- 
nents of its system together or by mak- 
ing interfaces between the separately sold 
components incompatible with their rivals’ 
components, firms can precommit to their 
marketing strategy. IBM, for example, was 
accused of incorporating increased amounts 
of storage into its central processing units in 
order to prevent sales by plug compatible 
memory manufacturers and also of trying to 
achieve interface incompatibility for the 
same purpose (Franklin M. Fisher, John J. 
McGowan, and Joen E. Greenwood, 1983, 
pp. 332-33). Kodak was accused of design- 
ing its new film and camera in a format 
incompatible with rival manufacturers’ 
products (Berkey Photo v. Eastman Kodak 
Co., 603 F. 2d 263 Cd Cir., 1979). 

On the other hand, in a significant num- 
ber of tying cases little more than an easily 
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changed marketing decision seems to be 
involved. For example, in Times-Picayune 
Publishing Co. v. U.S. (345 U.S. 594 (1953), 
the publisher of the only morning newspa- 
per in New Orleans only sold an advertise- 
ment in hiş morning paper with an adver- 
tisement in that day’s evening newspaper 
(which faced competition from another 
evening newspaper). In U.S. v. Griffith (334 
U.S. 100 (1948)), a movie theater chain re- 
fused to show films in its theaters in towns 
in which it possessed a monopoly if the 
distributor did not give it that film in towns 
where it faced competition. In United Shoe 
Machinery Corp. v. U.S. (110 F. Supp. 295 
(D. Mass. 1953)), United Shoe bundled re- 
pair service with its shoe machinery leases. 

Finally, when tying does lead to exclusion 
of rivals, the welfare effects both for con- 
sumers and for aggregate efficiency are in 
general ambiguous. The loss for consumers 
arises because, when tied market rivals exit, 
prices may rise and the level of variety avail- 
able in the market necessarily falls. Indeed, 
in the models studied here, tying that leads 
to the exit of the monopolist’s tied market 
rival frequently leads to increases in all 
prices, making consumers uniformly worse 
off. More generally, though, as is common 
in models of price discrimination, some con- 
sumers may be made better off by the intro- 
duction of tying. The effect on aggregate 
welfare, on the other hand, is uncertain 
because of both the ambiguous effects of 
price discrimination and the usual ineffi- 
ciences in the number of firms entering an 
industry in the presence of scale economies 
and oligopolistic pricing (A. Michael Spence, 
1976; N. Gregory Mankiw and Whinston, 
1984). 

Though most tying cases involve products 
that are complements (particularly those 
where precommitment is involved), for ex- 
positional purposes I begin below by consid- 
ering the case of independent products. In 
Section I, I first analyze the simple case 
where all consumers have an identical valu- 
ation of the monopolized product, so that 
the monopolist, if he chooses to price his 
goods independently, can fully extract all of 
the surplus from his monopolized good. I 
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show that, absent precommitment, tying is 
not a useful strategy for the monopolist; any 
equilibrium outcome will be equivalent to 
one where only independent pricing is al- 
lowed. Despite this fact, however, a pre- 
commitment to tying can be a profitable 
strategy for the monopolist because of its 
potential for excluding his tied market rival. 
This exclusionary effect arises because of 
what I call “strategic foreclosure”: tying 
represents a commitment to foreclose sales 
in the tied good market, which can drive its 
rival’s profits below the point where remain- 
ing in the market is profitable. This strategic 
incentive to foreclose sales in the tied good 
market occurs because once the monopolist 
has committed to offering only tied sales, it 
can only reap its profit from its monopolized 
product by making a significant number of 
sales of the tied good. Thus, in this model, 
tying necessarily lowers the profits of the 
monopolist’s tied good rival. I then discuss 
the implications of such a commitment to 
tying for the monopolist's profits, for con- 
sumers, and for aggregate efficiency, and 
present a simple example to illustrate these 
points. 

In Section II, I investigate how the pres- 
ence of heterogeneous preferences among 
consumers for the monopolized good affects 
these results. Two basic findings emerge. 
First, with heterogeneous preferences for 
the tying good, tying no longer necessarily 
results in strategic foreclosure and the low- 
ering of the monopolist's tied good rival's 
profits (though it still does in many circum- 
stances). If, for example, a significant num- 
ber of consumers in the tied market have 
low valuations of the tying good, tying (not 
surprisingly) will not be a successful exclu- 
sionary device. In addition, a more subtle 
effect may prevent a commitment to tying 
from lowering the tied good rival's pro- 
fits. This occurs when tying substantially 
decreases the responsiveness of the monop- 
olist’s demand to price changes relative to 
the level previously prevailing in the tied 
good market. 

Second, with heterogeneous valuations, 
tying can now also be a profitable strategy 
in the absence of precommitment. There 
are two senses in which this is true. First, in 
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a purely static sense, the monopolist may 
find tying to be a profitable strategy given 
its rival's price. This motivation for tying is 
analogous to that in the monopolistic 
bundling literature (for example, W. J. 
Adams and J. L. Yellen, 1976; R. Preston 
McAfee, John McMillan, and Whinston, 
1989), but here it can have important com- 
petitive effects: tied product rivals can find 
their sales foreclosed and continued opera- 
tion unprofitable. Second, even when tying 
is not profitable in this static sense, it may 
be in a dynamic sense when the exclusion of 
rivals through predation is possible. In such 
cases, tying can be a profitable strategy for 
the monopolist precisely because it fore- 
closes the sales of the monopolist's tied 
market rival. 

In Section III, I turn to the case of com- 
plementary products used in fixed propor- 
tions. I first consider a model of fixed pro- 
portions that is essentially an extension of 
the simple example quoted above from 
Posner (1976) to the case where the tied 
good market involves scale economies and 
oligopolistic behavior. Despite these differ- 
ences, Posner’s central contention continues 
to hold: a monopolist of one component 
never finds it worthwhile to tie in order to 
reduce the level of competition in the mar- 
ket for the other component. The reason 
lies in the fact that when the monopolized 
product is essential for all uses of the two 
products, the monopolist can always benefit 
from more competition in the non- 
monopolized market through sales of its 
monopolized product. Nevertheless, I then 
show that in two natural extensions of this 
model where the monopolized product is no 
longer essential for all uses of the non- 
monopolized components, tying once again 
emerges as a profitable exclusionary strat- 
egy. In one case, the presence of an infe- 
rior, competitively supplied alternative to 
the monopolized component leads to results 
that parallel those for independent prod- 
ucts. In the other case, the existence of a 
second use for the nonmonopolized product 
(such as a replacement part market) can 
give the monopolist an incentive to tie in 
order to reduce competition in this other 
market. 
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Finally, I conclude in Section IV with a 
brief discussion of the implications of these 
. findings. l 


I. Independent Products 


I begin by considering an extremely sim- 
ple model with independent products. There 
are two markets, which I label A and B. 
Market A is monopolized by firm 1 (say, 
because of a patent). Market B, on the 
other hand, is potentially served by two 
firms, firm 1 and firm 2. The products of 
firms 1 and 2 in market B are. differenti- 
ated. Production.in market B involves fixed 
costs of K; plus an expenditure of cg, per 
unit for firm i. Unit costs for good A are 
c4. For expositional simplicity, I ignore the 
possibility that there are fixed costs for 
product A. 

Consumers, who are indexed by d € (0, 1) 
with total measure 1, each desire at most 
one unit of good A and one unit of good B. 
All consumers have a reservation value of 
y>c, for good A, while a consumer of 
type d has a valuation of vg(d) for a unit 
of firm i’s product B. Resale of products by 
consumers is assumed to be prohibitively 
costly. In the absence of tying by firm 1, 
consumers simply respond to individual 
product prices (P4, P5,, P5;). Firm i’s sales 
of product Bi are then given by some func- 
tion x'(Pg, Pg2) <1, which I assume to be 
everywhere differentiable and satisfy (sub- 
scripts denote partial derivatives) 
x(Pg, P5) z Oif j * i and «Oif j=i, with 
strict inequalities if x'(-,-) € (0,1). That is, 
products B1 and B2 compete with each 
other for consumer purchases. 

When bundling is not permitted (which I 
will refer to below as an “independent pric- 
ing game"), it is easy to see that firm 1 will 
always set P, — y. It is also useful for what 
follows to define each firm i’s best response 
correspondence in market B by P3(Pz,,), 
which solves 


max ( Pg, — Cpi) X ( Pg, Pp2)- 
Ppi l , 


I assume that this correspondence is single- 
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ae continuous, and differentiable with 

PH (Ps) € (0, 1) (so products B1 and B2 
are strategic complements in the sense of 
Jeremy I. Bulow, John D. Geanakoplos, and 
Paul D. Klemperer, 1985). 

In the next two subsections I analyze the 
use of tying both for cases where firm 1 can 
precommit to tie and where it cannot. For 
the case without precommitment, I analyze 
a simple two-stage game. In stage one, each 
firm simultaneously decides whether to be 
active in market B. If firmi decides to be 
active, it incurs the cost K;. In stage two, 
the firms pick prices (simultaneously if both 
are active). If firm 1 is active in market B, it 
can offer three different items for sale: good 
A at a price of P,, good B1 at a price of 
Pg,, and a bundle consisting of one unit of 
good A and one unit of good B1 at a price 
of P. If firm 2 is active, on the other hand, 
it can only offer good B2 at price Pj. 
Throughout I assume that firm 1 is unable 
to monitor customer purchases; this as- 
sumption rules out the use of requirements 
contracts (where a consumer agrees as a 
condition of buying good A not to buy good 
B2) and also implies that a bundle will be 
purchased only if P < P, + Pj. 

To analyze the case where precommit- 
ment is possible, I extend this game to three. 
stages. In the (new) first stage of the game, 
firm 1 commits to which subset of three 
possible products—good A, good B1, and a 
bundle—it will be able to produce. For 
example, firm 1 can commit itself to a posi- 
tion where it will only be able to produce a 
bundle. The second and third stages are 
then identical to the no commitment game, 
but with firm 1 only able to offer for sale 
those items that it is able to produce.? Thus, 
as discussed in the introduction, by setting 
its design and production process, firm 1 is 
able to commit to a tying strategy. 

Finally, at various points below I make 
comparisons between the outcomes of these 
two games and those of a game where firm 
1-only offers goods A and B1 indepen- 


5Note that as long as firm 1 can produce both goods 
A and B1 separately it can still offer a bundle for sale. 
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dently (more precisely, a game that is the 
same as the no precommitment game but 
where bundling is prohibited). I refer to this 
game as the “independent pricing game.” 
Firm 1 is said to tie whenever its pricing is 
not identical (or, more generally, economi- 
cally equivalent) to that arising in this inde- 
pendent pricing game. 


A. Tying Without Precommitment 


Consider first the no commitment game. 
If firm 1 is active in market B, then in the 
second stage of this game it selects three 
(nonnegative) prices: (P4, Pg, P). As the 
following proposition makes clear, however, 
tying is not a useful strategy in this game. 


PROPOSITION 1: Any subgame perfect 
equilibrium outcome of the no commitment 
game is economically equivalent to a subgame 
perfect equilibrium outcome in the indepen- 
dent pricing game. 


PROOF: 

The proposition is established by arguing 
that in the subgames of the no commitment 
game in which firm 1 is active in market B, 
any Nash equilibrium in prices is equivalent 
to a Nash equilibrium ‘in the corresponding 
subgame of the independent pricing game. 
Then, given the equivalence of the equilib- 
ria in the pricing subgames, firms’ decisions 
about whether to be active in market B 
must also be equivalent in the two games. 

Consider the subgame where both firms 
are active in market B. The equivalence of 
equilibria is demonstrated by -arguing that 
for any set of prices ((P9, Ps,, P9); PR) that 
constitute a Nash equilibrium in the no 
commitment game there is a set of indepen- 
dent prices (P,,P,,) such that sales and 
profits are the same for both firms under 
prices Es Psi); Pg;) as under prices 
((P2, PR, P9); Pg.) when Ppa = P$, and are 
the same for firm 2 for any Pg. This implies 
that ((B,, Ppi); PR,) is a Nash equilibrium 
in the independent pricing game (note that 
firm 1 now has fewer possible deviations). 

The equivalence clearly holds if firm 1’s 
equilibrium strategy has P?» P? + P$, so 
suppose that P? < P? + PR,. There are two 
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cases to consider. First, suppose that P? > y. 
If this is firm 1’s best response, then it must 
be that all consumers are buying firm 1’s 
bundle since otherwise firm 1 could do bet- 
ter by setting P, — y while leaving all of its 
other prices unchanged: this price would 
make profitable sales of product A to those 
consumers not buying the bundle, while 
having no effect on firm 1’s sales of either 
good B1 or the bundle (since consumers are 
indifferent about buying good A at this 
price). In addition, since all consumers are 
purchasing the bundle (and therefore none 
are purchasing « either A or B1 alone) it 
cannot be that. P? < y since, if it were, firm 
1 could do better by offering only the bun- 
dle at a price of y. But, if so, then set- 
ting (B, = = y, By, = P? =y) yields identical 
sales and profits to both firms given Pĝ, and 
identical profits to firm 2 for all Pg;. Sec- 
ond, suppose instead that y > P. Note first 
that we must have P? > P? in such an equi- 
librium: otherwise all consumers would be 
buying firm 1’s bundle (all consumers would 
be willing to buy good A individually and 
they can get good A cheaper by buying the 
bundle) and firm 1 would increase its profits 
by offering only the bundle at a price of y. 
But if y > P} and P? > P9, then each con- 
sumer buys either good .4 alone or the 
bundle from firm. 1. In this case, prices of 
(B, = P9, És, = P? — P9) yield identical 
sales and profits for both firms for all Pg. 

A similar argument establishes the equiv- 
alence for the subgame where only firm 1 is 
active. | 


The basic idea behind Proposition 1 is 
fairly straightforward. First, it is always 
worthwhile for firm 1 to make sure that all 
consumers purchase product A either alone 
or in the bundle. Given that all consumers 
are consuming good A, however, if firm 1 
engages in tying, then consumers choose 
between buying only good A or the bundle 
from firm 1. They do so by imputing 


51 assume that all consumers will buy good A when 
P,=y. This assumption can be avoided through the 
use of limiting arguments, but it is made in order to 
ease the exposition. . 


x 
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an effective price of (P, — P4) (P, — y) if 
P4,» y) to the product B1 portion of the 
bundle, so that tying is effectively equivalent 
to an independent pricing strategy. 


B. Commitment and Strategic Foreclosure 


The negative result of Proposition 1 
changes dramatically if firm 1 is able to 
precommit to tying through its choice of 
which goods it will be able to produce. In 
the three-stage game that I have described 
above, firm 1 can choose to produce seven 
different sets of goods: both goods individu- 
ally, both goods individually and also a bun- 
dle, the bundle only, the bundle and prod- 
uct A, the bundle and product B1, A only, 
and B1 only. The argument in Proposition 1 
implies that the first two options both yield 
outcomes equivalent to those in the inde- 
pendent pricing game and so they are strictly 
better for firm 1 than the last two (which 
yield lower profits to firm 1 in any subgame 
where it is active and at least as large profits 
to firm 2 when it is active). In fact, the 
following two lemmas indicate that firm 1's 
choice is essentially between producing in- 
dependent goods and producing only the 
bundle. 


LEMMA 1: Any subgame perfect equilib- 
rium outcome in the subgame of the commit- 
ment game where firm 1 can produce only the 
bundle and product A is equivalent to a sub- 
game perfect equilibrium outcome of the inde- 
pendent pricing game. 


PROOF: 
The argument closely parallels that used 
to prove Proposition 1 and is omitted here. 
O 


LEMMA 2: Any subgame perfect equilibrium 
outcome in the subgame of the commitment 
game where firm 1 can only produce the bun- 
dle and product B1 is equivalent to a subgame 
perfect equilibrium outcome that arises in the 
subgame of the commitment game where firm 


` 1 can only produce a bundle. 


PROOF: 
In Appendix A. oO 
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Given these results, firm 1 can restrict its 
attention to either producing goods A and 
B1 separately, which yields an outcome 
equivalent to that in the independent pric- 
ing game, or to committing to producing 
only a bundle. J now turn to an investigation 
of the competitive effects of this tying strat- 
egy. As the following result makes clear, 
such a commitment may make it unattrac- 
tive for firm 2 to be active in the market. 


PROPOSITION 2: In the subgame of the 


‘commitment game where both firms are active 


and firm 1 has committed itself to producing 
only the bundle, firm 2 earns less than it does 
in the independent pricing game. 


PROOF: 
In Appendix A. o 


One might at first think that bundling in 
this context would have no effect at all: if 
firm 1 were charging independent prices of 
P,=y and Pg, a switch to bundling at a 
total price of y + Pg, would not change the 
demand for good B1 at all. The intuition 
for Proposition 2, however, centers on the 
way in which firm 1’s pricing incentives 
change when it bundles. In an independent 
pricing game, firm 1’s best response Pj Pg;) 
satisfies, 


(1) [PC Peo) 7 ei] X1 PAC Paz), Pao) 
+ x'(P#( Pao), Pao) =0. 


By contrast, when firm 1 bundles and sets 
price P, the demand for its bundle is given 
by x'(P — y, Pg) and its best response to 
firm 2’s price Pg; by P*(Pg;) such that 


(2) [P*(P5;)—c, E €i] 
X xi( P*(Pg;) — y, Ps2) 
t x!(P*(P5,) — y, Paz) = 0. 


Note first that if y=c,, then P*(P,.)= 
P&(CPg;)- y. However, if y » c4, then at 
P = Pg (Pg) y the left-hand side of (2) is 
strictly negative. Thus, it must be that 
P*(Pg5) < P#(Pp2)+ y: firm 1’s optimal ef- 
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fective price for good B1 is lower under 
bundling than under independent good pric- 
ing. The reason is straightforward: when 
firm 1 is bundling, in order to make prof- 
itable sales of its monopolized product, good 
A, it must also make sales of good B1. This 
leads it to cut price in an effort to take sales 
away from firm 2, an effect I call "strategic 
foreclosure.”’ The effect on the equilibrium 
can be seen in Figure 1, where the equilib- 
rium effective price for B1 and actual price 
for B2 both fall as a result of firm 1’s 
bundling, thereby lowering firm 2's profits. 
Thus, by committing to tie by producing 
only a bundle, firm 1 may make continued 
operation unprofitable for its tied good ri- 
val. This point emerges particularly clearly 
in the following simple example, which is a 
limiting case of the above model. 


"Jean Tirole has pointed out a nice analogy to 
situations in which firms can invest in cost reduction. 
Here, by bundling, firm 1 can incur an "investment 
cost" of (y — c4) (the lost good A sales) but there- 
by lowers its effective marginal cost in market B by 
(y ~ c4). This lowering of marginal cost makes firm 2 
more agressive in market B. As noted in Drew Fun- 
denberg and Tirole (1984) (see also Tirole, 1988), with 
price competition (Strategic complements) and entry 
deterrence/exit inducement, firms overinvest in cost 
reduction relative to what they would do absent this 
strategic effect (a “top dog” strategy), a comparison 
analogous to my commitment versus no commitment 
games. 
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Example 1. Suppose that all consumers view 
products B1 and B2 as perfect substitutes 
with value v, that (cg, — cg.) » K, > 0 and, 
to focus attention on firm 2’s activity deci- 
sion, that K, — 0 (this could be a situation 
of entry deterrence where only firm 1 has 
already sunk its market B set-up costs). 
Then the subzame perfect equilibrium out- 
come of the independent pricing game has 
firm 2 being active in market B, making all 
sales in that market, and earning profits 
of (cg, — cp) - K,>0.8 By contrast, if 
(cg; — cg) tly — c4) » 0 and firm 1 com- 
mits to bundling, firm 2 earns zero if it is 
active, and so the unique equilibrium out- 
come involves firm 2 being inactive and firm 
1 extracting all of the consumers’ surplus. 

a 


Note that if both firms are active, firm 1’s 
profits are also lower in the bundling regime 
than under independent pricing. This is true 
because bundling not only loses some prof- 
itable sales of good A, but also causes firm 
2 to lower its price? Thus, in this model, 
firm 1 would never commit to tying unless 
this would succeed in driving firm 2 out of 
the market.!? 


5] am ignoring subgame perfect equilibria in which a 
firm prices below cost and makes no sales. These 
equilibria involve weakly dominated strategies and can 
be formally eliminated here through the use of R. 
Selten’s (1975) notion of trembling-hand perfect equi- 
libria (formally one examines discrete approximations 
to the game considered in the text where prices must 
be named in some discrete unit of account). 

?Note that these lower profits can potentially force 
firm 1 to exit should it commit to tying and have 
positive fixed costs in market B (if it believes that firm 
2 will be active). If product A is very profitable, how- 
ever, this effect is unlikely to occur. 

10One special feature of this model, however, is that 
firm 2 can “conzede” to firm 1 only by fully withdraw- 
ing from the market. In other models in which conces- 
sion can be partial, this need not be true. For example, 
if market competition is of the form described in David 
M. Kreps and Jose A. Scheinkman (1983) (product 
production followed by output constrained price com- 
petition), ther. firm 2 will respond to firm 1's more 
aggressive behavior by reducing its production level, 
which can meke tying profitable even in the absence of 
complete exclusion. 
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When tying would drive firm 2 out of the 
market, firm 1 may or may not find it prof- 
‘ itable to do so. The advantage of tying in 
such an instance is the gain from converting 
market B from duopoly into a monopoly: 
The potential loss, however, comes from the 
fact that firm 1 will be a monopolist who 
can only offer a bundle. Thus, the presence 
of a large number of consumers who strongly 
dislike product B1 may make a commitment 
to bundling unprofitable, even when it leads 
to exclusion. 

At the same time, the welfare conse- 
quences of allowing tying in this circum- 
stance are unclear both for consumers and 
for aggregate efficiency. First, consumers can 
lose both because of the price effects stem- 

ming from the exclusion and also because 
` there is less variety available in market B. 
The price effect, however, can potentially go 
either way. The reason is that the same 
incentive to lower the effective price of good 
B1 that drives firm 2 from the market is also 
present when firm 1 becomes a monopolist 
in market B. In general, though, one should 
expect that if the gains from monopoly in 
market B are large, the standard price 
movement should be upward, making con- 
sumers uniformly worse off. The effect on 
aggregate efficiency is still less certain. This 
is due to two different common welfare 
ambiguities. First, the biases associated with 
the free entry process (Spence, 1976; Salop, 
1979; Mankiw and Whinston, 1986) imply 
that exclusion of firms does not necessarily 
reduce aggregate welfare. Second, it is 
known from the monopolistic bundling lit- 
erature (Adams and Yellen, 1976) that 
bundling in a monopoly setting has ambigu- 
ous welfare consequences. 

The following example illustrates these 
points more concretely and also helps to set 
up the discussion in Section II. 


Example 2. Suppose that a consumer of 
type d has a valuation for good Bi of vg; — 
w — ad, and that d is uniformly distributed 
on [0,1]. Assuming that we have all con- 
sumers purchasing from some firm and both 
firms making sales (so that our earlier as- 
sumptions hold in the relevant range), it is 
straightforward to show that equilibrium 
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prices and profits (gross of fixed costs) in an 
independent pricing game are given by 


Pi=y 
Pj, = cg; + (1/3) 

x [3a; + (a; — a5) + (eg; — Cai) 
II! = (y — c4) * [1/9(2, + a5)] 

x [Bay * (e, — a5) * (eg — c5))]^ 


II = [1/9 a, F a,)] 


x [3a, —(a, re a) — (cpp "T SE 


In contrast, profits (gross of fixed costs) for 
firm 2 when firm 1 bundles are given by 


I m [1/9(a, E a;)] [Ba -(o,—- a5) 


—(€p2— Cp) - (Y =c), 


which is lower than in the independent pric- 
ing case. Note also that firm 2’s profits 
fall as the surplus associated with good A, 
(y— c4), rises. In order to illustrate the 
other points made above, I consider three 
special cases of this model in turn: a, =0, 
a, =a, and a, = 0. 

Consider first the case where a, = 0. In 
this case, firm 1 always increases its profits 
by excluding firm 2 (that is, monopoly prof- 
its with bundling are greater than duopoly 
profits with independent good pricing). This 
is because as a monopolist firm 1 suffers no 
loss from bundling. Furthermore, the 
monopoly bundle price of (w + y) leaves all 
consumers with zero surplus. While all con- 
sumers are made worse off, aggregate wel- 
fare may rise or fall: if cp. > cg, aggregate 
welfare must rise since all consumers are 
still served, and production costs fall. When 
Cg; € Cpi the change in aggregate efficiency 
is given by AW = K, — a (cg, — Cpr)”. 

For simplicity, assume now that cp; = cg. 
When a, a, = a, the independent pricing 
equilibrium has full coverage of market B 
whenever w > cg - (3/2)a. For simplicity, I 
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also assume that (2/2) «(y — c), which 
implies that firm 1 will always sell its bundle 
to all consumers when it is a monopolist 
(the qualitative results in the other case are 
similar) In that case, firm 1’s price and 
profits are given by 


P°=wt+y-a 
W=w+y—-a-cy—Cp. 


Comparison of these expressions with those 
for the independent pricing game (setting 
a@,=a,) reveals that firm 1 always gains 
from exclusion. The effective price of good 
B1 (P — y), however, falls whenever (cp + 
(3/2)a) < w € (cg +2a), so that some con- 
sumers (for example, those who were al- 
ready buying B1) are made better off in 
these cases. Aggregate consumer surplus, 
however, never rises here: with an indepen- 
dent pricing duopoly, consumer surplus is 
Ww — cg —(5/4)a, while it is (a /2) with a 
‚bundling monopolist. 

Finally, when a, — 0, firm 1 profits in an 
independent goods pricing duopoly are given 
by II? — (y — c)-- (1/3)a, (an interior so- 
lution arises whenever w>c,+(2/3)a,). 
Then, assuming again that (o,/2)« 
(y — c4), we have that firm 1’s profits rise 
from this exclusion if and only if w > cg + 
(4/3)a,. Notice that exclusion is more likely 
to be profitable as the value of monopoliz- 
ing market B rises (increases in (w — cp) 
and decreases in o) and as the competitive 
constraint that firm 2 imposes when it is in 
the market becomes more severe (decreases 
in a). E 


II. Heterogeneous Consumer Preferences 
for Good A 


The results of Section I provide two im- 
portant lessons. First, tying can be prof- 
itably used as an exclusionary device. Sec- 
ond, there may be important differences in 
the likelihood of its use, depending on 
whether a commitment to tying is possible. 
A feature of that model, however, was the 
strong assumption that all consumers have 
the same valuation of the tying good. In this 
section, I investigate the effects of relaxing 
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that assumption. Two points emerge from 
this investigation. First, a commitment to 
tying need not always result in foreclosure 
as it did in the model of Section I. Second, 
when consumer valuations for the tying good 
differ, tying can be a profitable strategy for 
firm 1 even in the absence of an ability to 
commit, and when it is, it may lower firm 2's 
profitability ia a similar manner to that ob- 
served earlier. In the following two subsec- 
tions I consider first the case of commit- 
ment and then that of no commitment. 


A. Commitment 


In the model of Section I, firm 1’s com- 
mitment to offering only a bundle lowered 
firm 2’s sales because it created an incentive 
for firm 1 tc price more aggressively, lower- 
ing its bundle price below P4(Pg)- y 
(strategic foreclosure). More generally, when 
consumers have heterogeneous preferences 
for good A, the impact of tying on firm 2's 
profits can be determined by asking whether, 
at the bundle price P' such that firm 
2’s sales equal its independent pricing lev- 
el (e, P' such that x?(P’—y, Pp) = 
x?(P#(Pp2), Pp2)), firm 1 has an incentive to 
lower its price further.! This will be true 
when 


B d Bundle Sales 
(3) ( — £47 Cg dP 


+ x'(P#(Pp2),Pp2) « 0. 


With homogeneous preferences for good A, 
for example, the inequality in condition 
(3) is satisfied because (P'—c4— cg)» 
PE Pg;)— cg; and (d Bundle Sales/dP) = 
«(PHP pa), Paa). 


‘Here, unlike in Section I, firm 1 may prefer to 
commit to producing the bundle plus one of the two 
goods independently as part of an exclusionary strategy 
(i.e., Lemmas Ż and 2 do not hold here). I focus on the 
case of a commitment to pure bundling here to provide 
a comparison with the result in Section I. These other 
strategies may also lower firm 2’s equilibrium profits. If 
they do so sufficiently to exclude firm 2 from the 
market, then they will actually be preferred by firm 1 to 
pure bundling, since they restrict its pricing to a lesser 
degree when firm 2 is out of the market. 


i 
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Condition (3) indicates that, with hetero- 
geneous valuations for good A, a commit- 
: ment to offering only a bundle may fail to 
lower firm 2’s profits for two distinct rea- 
sons. First, enough consumers may find good 
A unattractive (may have valuations below 
the cost of production) so that firm 1 may 
have a lower, rather than a higher, margin 
at price P’. In such a case, firm 1’s monopoly 
of good A is too weak for bundling to be an 
effective exclusionary threat in market B; 
bundling would help rather than hurt firm 
2. This effect, of course, is exactly what one 
should expect a priori. 

The second reason is a bit more subtle. 
As noted above, with homogeneous valua- 
tions, the derivative of bundle demand at 
price P' is identical to that arising in market 
` B with independent goods pricing. With 
heterogeneous valuations, however, this de- 
mand derivative can change when firm 1 
bundles, potentially counteracting the 
price-cost margin effect. The clearest exam- 
ple of this occurs in the limiting case where 
products B1 and B2 are nearly homoge- 
neous.^ Then bundling essentially trans- 
forms a nearly homogeneous market B into 
a setting with vertical differentiation (since 
all consumers value the bundle more than 
B2, but they differ in how large this valua- 
tion difference is—see, for example, Avner 
Shaked and John Sutton, 1982) and can 
thereby raise firm 2’s profits. 

The following example, which is an exten- 
sion of Example 2, illustrates these points. 


Example 3. The model considered here is 
identical to that in Example 2 except that I 
now allow there to be different possible 
levels of consumer valuations for product 
A. I assume that the distribution of y in the 
population is described by F(y) and that for 
all d, Prob(y x s|d) ^ F(s) (i.e, types are 
independently distributed across the two 
markets) In the discussion that follows, I 
assume that w is large enough so that (in 
the relevant range) all consumers purchase 
product B from one of the firms. 


Note that this requires that the K;'s are close to 
zero if independent pricing would result in a duopoly. 
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Suppose that firm 1 commits to tying by 
producing only a bundle. A consumer of 
type Cy, d) will buy the bundle if and only if 
d €x Q/ a)CP — y)— Pp]. It is useful to first 
assume that for any level of y some con- 
sumers of that type are buying from each of 
the firms ("interior equilibria"). For interior 
equilibria, equilibrium profits for the two 
firms are given by 


TI? = [1/9( a, + a)|[3a,+(a@,— a5) 
+ (6g; — cg) + (Ey — cJ) 


Il = [1/9(a, + a;)] [3a — (a; — az) 

2 si 

— (Cg. 7 Cg) -(Ev- c4)]', 

where Ey = fsdF(s). Comparing firm 2’s 
profits to its level under independent goods 
pricing (derived in Section I), we see that 
firm 2’s profits are lower in the bundling 
equilibrium as long as Ey>c,. Thus, as 
one would expect, if there are enough con- 
sumers who dislike product A (have y lev- 
els below c4), tying raises rather than low- 
ers firm 2’s profits. Relating this finding to 
condition (3), we see that bundling has no 
effect on the demand derivative, but that P' 
is now given by P'— P#(P,,)+ Ey so that 
the inequality in (3) holds if and only if 
Ey » c4. 

It is worth noting, however, that the lack 
of any effect on the demand derivative term 
in (3) relies heavily on the linearity of the 
demand structure assumed here. If bundling 
were to lower this average derivative (in 
absolute value), this would work against the 
incentive for more aggressive pricing that 
arises in this linear model. In fact, this is 
why interiority of the equilibrium is impor- 
tant for the characterization above. When 
bundling causes all consumers with some 
values of y strictly to prefer either the 
bundle or good B2, this lowers the deriva- 
tive of firm 1’s demand with respect to its 
bundle price (since none of these consumers 
are marginal). Intuitively, this effect seems 
more likely to occur when the dispersion of 
valuations for good .4 increases and the 
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Tam 1 differentiation effect). Note also that firm 
rou Bca T's profits may now rise with bundling even 

if bundling does not drive firm 2 from the 
y z3IB -al ngono <n>  DUNPCIDNP market. In fact, in this example, whenever 
TIBUND < [IND ; bundling causes firm 2's profits to rise, firm 

2 « I5 if ; : > 
y S316 -al and only if: BUND cgwo — ls profits rise as well and, further, firm 1’s 
: rise in some cases where firm 2's profits fall 

38-7 ) zu (this is shown in Appendix B). a 


differentiation between products B1 and B2 
decreases. 

To investigate this effect further, consider 
the special case where y is uniformly dis- 
tributed on the interval (y — B,y + B), 
where B X y, €g,770cg5; 70, and a,;=a,= 
a > 0. In this example, the issue addressed 
above, that some consumers may value good 
A at less than its production cost, does not 
arise (here Ey > c ,). Rather, the focus here 
is on the effects of the level of valuation 
dispersion for good .4 and the level of 
product differentiation in market B. Te- 
dious calculations (an example of which is 
provided in Appendix B) reveal that the 
effect of a commitment by firm 1 to offering 
only a bundle on firm 2's profits can be 
summarized as in Table 1. Examination of 
the condition in the lower left-hand box 
(the only case where firm 2's profits are not 
necessarily lowered by firm 1’s bundling) 
confirms that high levels of dispersion of 
valuations for good .4 and low levels of 
differentiation in market B are necessary 
for firm 2’s profits to rise when firm 1 bun- 
dles. Interestingly, though, even when a is 
close to zero, we need y not to be too large 
for this to occur (so that the incentive to 
make sales of A does not outweigh the 


P My investigation of this example is motivated in 
part by the example analyzed in independent work by 
J. Carbajo, D. DeMeza, and D. J. Seidmann (1987). 
They illustrate the differentiation effect in an example 
with homogeneous goods in market B and valuations 
for goods A and B that are perfectly correlated and 
uniformly distributed across consumers. Earlier ver- 
sions of this paper pointed out the implications of 
noninteriority for the derivative of demand in the con- 
text of a two-type (of y) example. i 


B. No Commitment 


The presence of heterogeneous valua- 
tions of product A can also cause tying to 
be firm 1’s optimal strategy. even in the 
absence of an ability to commit to this strat- 
egy. To see this more clearly, consider first 
the no commitment game analyzed in Sec- 
tion I. In that game, when both firms are 
active in market B, firm 1 selects its prices 
taking firm 2’s price as given and acting 
as a monopolist on the residual demand 
structure. Given the literature on bund- 
ling by multiproduct monopolists (Adams 
and Yellen, 1976; McAfee, McMillan, and 
Whinston, 1989), which has found bundling 
to be a profitable strategy quite generally, it 
should not be surprising that firm 1 may 
now find some form of bundling to be its 
best response to firm 2’s price choice. What 
is interesting from our perspective, how- 
ever, is that this tying strategy by firm 1 may 
have detrimental effects on firm 2’s profits 
since, when firm 1 does decide to bundle, it 
may have an incentive to foreclose sales in 
market B in a manner similar to that dis- 
cussed in Section I. 

In Whinston (1987), for example, I con- 
sidered the structure described in Example 
3 with two types of valuations for good A, 
y, and yy with yy > c4 and Prob(yj) =A. 
For this case I showed that any equilibrium 
of the no precommitment game is equiva- 
lent to an equilibrium of a game where firm 
1 is allowed to either sell A and B1 inde- 
pendently or to offer only the bundle and 


“The reader may be puzzled by this point since it 
seems that when a= 0 firm 2’s profits would always 
rise with bundling. In fact, when a = 0, the upper left 
box would have I[PUND = [INP and firm 1 making all 
sales when it bundles. 
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product A at price P, € (y,, yj]. In addi- 
tion, the equilibrium may involve firm 1 
pursuing the latter (bundling) strategy, 
though a necessary condition for this is that 
yp? €4 (see Whinston, 1987, for details). 
When the equilibrium does involve bundling 
and is "interior" in the sense discussed 
above, firm 2's profits are 


I? = [1/9(a, + e3)] 3a, - (à, — @) 


— (Cg. — €) - (1 A) (9, - c4)] 


Thus, when firm 1 does tie here, it fore- 
closes firm 2's sales in a similar manner to 
that observed earlier. Note, though, that 
firm 2’s equilibrium profits are larger here 
than when firm 1 commits to only offering a 
bundle. The reason is that when firm 1 also 
offers product A independently, it is as- 
sured of making sales of product A to all 
type H consumers regardless of whether 
they buy product B1; thus, here the incen- 
tive for foreclosure arises only from the L 
types and firm 2's profits fall only if y, > c ;. 

Though the effect of firm 1’s tying here 
may be exclusionary (firm 2, anticipating 
that firm 1 will tie, may choose to be inac- 
tive), one might argue that its motives are in 
some sense "innocent" since its decision to 
tie is never affected by the possibility that 
firm 2 might be excluded from the market. 
Such dynamic considerations, however, may 
be important even when firm 1 cannot pre- 
commit to tying. For example, if firm 2 faces 
a financial constraint that it must meet in 
order to remain active in the market (as in 
the work of J. P. Benoit, 1984; and Drew 
Fudenberg and Jean Tirole, 1986), firm 1 


2 


SThat is, we can without loss of generality restrict 
firm 1’s pricing strategy choices to one of these two 
forms. This equivalence actually holds for any market 
B structure that satisfies the assumptions made in 
Section I] (a proof of this fact is available from the 
author upon request). It is worth noting that a bundling 
strategy of this sort may not appear to be tying at all 
since firm 1 does offer to sell product A at a price that 
some consumers are willing to pay. For type L con- 
sumers, however, this offer is unattractive, putting them 
in exactly the same situation as when firm 1 offers only 
a bundle. 
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may be led to use tying in order to lower 
firm 2’s profits and increase the likelihood 
that firm 2 will be forced to exit the market, 
even when tying is not profit-maximizing in 
a static sense. 

To formalize this idea, consider a simple 
extension of the earlier no commitment 
model in which there are two production 
periods. If firm 2 decides to be active and 
incurs the set-up cost K, prior to period 1, 
it may face a financial constraint that it 
must meet after period 1 in order to be able 
to remain in the market in period 2. In 
particular, suppose that with probability 
1—8 firm 2 will not face a financial con- 
straint, while with probability 8f(II) firm 2 
will face a constraint that prohibits contin- 
ued participation if first-period profits were 
less than II and assume that f'(ID) = 0 (there 
is a diminishing marginal return to preda- 
tion). 

In this setting, what is the effect of an 
increase in 0 on the attractiveness of tying 
for firm 1? It is not difficult to see that for 
the two type example if y, >c, (and out- 
comes are "interior") then increases in 6 
make tying a relatively more attractive pol- 
icy for firm 1 in period 1 for any given level 
of Pg,. The central (and very general) idea 
is that increases in 0 make firm 1 care more 
about foreclosure relative to current profits. 

To see this more formally, let G denote 
the benefit to firm 1 if firm 2 does not meet 
its financial constraint and fix some initial 
level of 0 and Pg;. Suppose, first, that firm 
] pursues its best independent pricing pol- 
icy and that this results in a profit level for 


firm 2 of II}. Then, firm 1’s price choices 


are equal to the level that it would choose 
in the simple one production period model 
if its marginal costs of production for B1 
were cg, — 8Gf(ILPg; — c55) instead of 
Cay. Likewise, if firm 1 pursues its optimal 
bundling strategy and thereby gives firm 2 
profits of IŽ, then its prices are equal 
to those it would pick in the static game 
if its marginal cost was 


CBT Gf (T1?)( Ps; — Cg). 


Since we have seen that the optimal 
bundling strategy in the one period no pre- 
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commitment game results in lower profits 
for firm 2 then does the optimal indepen- 
dent pricing policy for any given level of cp, 
(since y; >c,), it must be that II? <T 
(that is, that bundling leads to foreclosure). 
But the envelope theorem then implies that 
a small increase in 6 raises the profits from 
the optimal bundling best response by more 
than it raises the profits from the optimal 
independent pricing best response (since the 
derivative of firm 1 profits with respect to @ 
is GF(IL,)). Thus, in this example, increases 
in @ strictly increase the likelihood that 
firm 1 will find bundling to be its best re- 
sponse (since bundling is never optimal if 
y, < c4). 


IH. Complementary Products 


I now turn to the case of complementery 
products used in fixed proportions. I first 
consider a model of fixed proportions that is 
essentially an extension of the simple exam- 
ple quoted above from Posner (1976) to the 
case where the tied good market involves 
differentiated products with scale economies 
in production and an oligopolistic, rather 
than a competitive, market structure. De- 
spite these differences, I show that Posner's 
central contention continues to hold: a 
monopolist of one component never finds it 
worthwhile to tie in order to reduce the 
level of competition in the market for the 
other component. The key point is that with 
complementary products used in fixed pro- 
portions, the monopolist can actually derive 
greater profits when its rival is in the market 
than when it is not because it can benefit 
through sales of its monopolized product 
from the additional surplus that its rival's 
presence generates (due to product differ- 
entiation). 


The two-type of y example considered here is 
special in one sense. With more general distributions 
of y, firm 1’s best response in the one period no 
precommitment game will quite generally involve 
some form of bundling (see McAfee, McMillan, and 
Whinston, 1989). In such cases, one would have to 
examine how increases in @ affected the degree of 
bundling (i.e., the difference between P and P4 + Pg). 
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Nevertheless, 1 then show that in two 
natural extensions of this model in which 
the monopol:zed product is no longer essen- 
tial for all uses of other components, tying 
once again emerges as a profitable exclu- 
sionary strategy. In one case, the presence 
of an inferior, competitively supplied alter- 
native to the “monopolized” component 
leads to results that parallel those of the 
independent products case. In the other 
case, the existence of a second use for the 
nonmonopolized product (such as a replace- 
ment part market) can give the monopolist 
an incentive to tie in order to eliminate 
competition in this other market. 

The discussion in the text focuses on the 
case of precommitment. In fact, for each of 
the models considered here, any no pre- 
commitment outcome is equivalent to an 
equilibrium of the independent pricing 
game." Of course, this is therefore also true 
when firm 1 produces A and B1 indepen- 
dently in the commitment game. In order to 
simplify the exposition, in Parts B and C 
below, I will use this fact and simply com- 
pare bundling outcomes: to the independent 
pricing game equilibria when investigating 
whether firm 1 would find a commitment to 
bundling to be a profitable exclusionary de- 
vice. 


A. The Basic Model 


Consider the following simple model. 
There are two components needed to com- 
prise a system, A and B: a system consists 
of one unit of each. As before, firm 1 is a 
monopolist of component A, and two dif- 
ferent versions of component B could po- 
tentially be available, B1 and B2. The pro- 
duction technology for these products.is as 
before. 


"The proofs of this fact for the three models pre- 
sented in this section are available from the author 
upon request. For the model of Part C, the result 
requires the use of Selten’s (1975) notion of trembling- 
hand perfection in order to eliminate the use of weakly 
dominated strategies. In Parts B and C this equiva- 
lence is a consequence of the homogeneity of valua- 
tions assumed there (as in Section D. 
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The set of consumers is the same as in 
Section I. Each consumer demands at most 
one unit of the system. A consumer of type 
d’s valuation of a system with product Bi is 
U4 pid). When goods A, B1, and B2 are 
independently priced, consumers' demand 
for an A / Bi system is given by some func- 
tion x'(P, + Ppi P4-- Pg), where xiC, ) > 
0 if i*j and <0 if i=j, with strict in- 
equalities whenever x'(,.)€ (0,1), and 
where (x}(-,-)+ x26, « 0. 

In the case of independent products we 
implicitly assumed that purchase of a pro- 
duced bundled unit allowed the indepen- 
dent use of either of the products (the proof 
of Lemma 1, for example, uses this fact). 
Though natural in the case of independent 
products, this assumption is less so when 
products must be used together. For exam- 
ple, the bundling of a stereo tuner and a 
stereo amplifier into a stereo receiver may 
not allow the buyer to use just the amplifier 
in conjunction with another manufacturer's 
tuner. Thus, here I assume that production 
of a bundled good does not allow the user 
to use only part of the bundle. 

In this model, since component A is es- 
sential to any system, firm 1 is trivially able 
to exclude firm 2 by committing to produce 
only a bundle. Nevertheless, as the follow- 
ing proposition indicates, firm 1 never finds 
it worthwhile to tie in order to exclude 
firm 2. 


PROPOSITION 3: /f a commitment to tying 
causes firm 2 to be inactive, firm 1 can do no 
worse — and possibly better — by committing 
to producing only independent components. 


PROOF: 

Suppose that firn 1’s precommitment to 
tying (by not producing one or both of the 
components individually) causes firm 2 to be 
inactive. In this case, since only firm 1’s 
bundle price is relevant once firm 2 is inac- 
tive, firm l's profits given its optimal bundle 
price of P* are 


(P*— 64 — cgi )x!(P*,). 


Suppose that firm 1 instead commits to only 
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producing components A and B1 indepen- 
dently. One pricing policy that it can always 
follow, regardless of whether firm 2 is ac- 
tive, is to set individual cómponent prices of 
Pg, 7 €g,— € (where £2 0) and P,= P*— 
Ppi- If firm 2 is inactive, this pricing scheme 
leads to exactly the same level of profits as 
did the bundling outcome. If firm 2 is active, 
however, firm 1’s profits will be at least as 
large as those in the bundling outcome since 
they are given by 


(P*— CA4-— ex eC PS Ê, T Ps) 
+ x'(P* P, + P5;)] 
t ex?(P*,P,+ Pp»), 


when firm 2 names price P5, (since x'(-,-)+ 
x?(-,:) weakly increases when prices fall 
and x?(-,-) is nonnegative). o 


The basic idea behind this result is fairly 
simple to see. If firm 2 did not exist, firm 1 
could do as well as it does through bundling 
by setting independent prices that had com- 
ponent B1 priced at or below cost and 
component A’s price set at a high level; it 
would simply earn all of its profits on sales 
of component A (consumers' purchases de- 
pend only on the sum of the prices). But, if 
pricing in this manner leads firm 2 to be 
active, this can only raise firm 1’s profits 
since firm 1 would then sell more compo- 
nent A's (on which it makes profits) and 
fewer component B's (on which it has a 
negative margin). Intuitively, firm 1 is able 
to benefit through sales of its product A 
from the increase in surplus generated by 
firm 2’s presence. - 

While firm 1 never gains from committing 
to tying here if this forces firm 2 to be 
inactive, firm 1 may commit to tying in or- 
der to price discriminate. For example, sup- 
pose that some set of consumers get positive 
benefits only out of an A /B1 system, while 
the remainder get positive benefits only out 
of an A/B2 system, and that the latter 
group’s valuation of its desired system is 
much higher. Then firm 1 will want to set a 
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very high price for good A in order to 
extract surplus from this latter group, and a 
very low price for good B1 in order to get 
an optimal .A / B1 system price for the for- 
mer group. If this attempt hits the nonnega- 
tivity constraint on P5, however, then firm 
1 will find it worthwhile to tie by offering a 
bundle with price P < P,.! 

Firm 1’s lack of desire to use tying as an 
exclusionary device can change dramati- 
cally, however, when firm 1’s monopolized 
component is not essential for all uses of 
product B2. I now consider two natural 
extensions of the above model in which 
tying can prove to be not only an effective 
exclusionary device but also a profitable one. 


B. An Inferior, Competitively Supplied 
Component A: Strategic Foreclosure 


Suppose that there exists a uniformly in- 
ferior, competitively supplied alternative to 
firm 1’s product A, denoted as product A2 
(henceforth, firm 1’s product A will be de- 
noted by A1). The cost of component A2 is 
also c,, but compared with the valuations 
described above for A1/B1 and A1/B2 
systems, a consumer’s valuation for a system 
that has product 42 in it rather than 41 is 
y — c4) lower (i.e., v45, 5 (d) = v4,,p/(d)— 
(y — c4)) where y » c,. 

Consider, first, the independent pricing 
game (which, as noted above, yields an out- 
come identical to what occurs if firm 1 pro- 
duces A and B1 only independently). In 
this game, firm 1 always sets P4, < y and 
makes all component A sales. When firm 1 
sets P4, < y in this equilibrium, the inferior 
alternative (product A2) is irrelevant for 
pricing and profits. In the case where 
P4, = y, however, the presence of the infe- 


This point is analogous to the observation that an 
upstream monopolist may wish to integrate vertically 
forward into one of the industries that uses its product 
in order to achieve price discrimination across users 
(see, for example, Tirole, 1988, p. 141). Note that 
bundled production is essential for this purpose since 
otherwise the second set of consumers would buy the 
bundle to get their component A whenever the bundle 
price was lower than the price of good A alone. 
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rior product A2 constrains firm 1’s equilib- 
rium pricing and profits. This could mean 
that, contrary to Proposition 3, firm 1 would 
prefer to have firm 2 out of the market (firm 
1 can no longer necessarily benefit through 
its component 41 sales from the surplus 
created by the presence of firm 2).? Exam- 
ple 4 illustrates this point and shows how 
the presence of component A2 can make 
competitive interaction here look very much 
like the independent products case consid- 
ered earlier. 


Example 4. Suppose that v4,/5(d) - w — 
ad, c,» 0 and cg,— cg; = cg > 0, and that 
w>2at+c,+C, (to ensure that all con- 
sumers buy a system; note the parallel to 
Example 2). Ignoring the constraint im- 
posed by the presence of product .42, the 
independent pricing equilibrium level of P4; 
is increasing in w. When w» y cg 
(3/2)a, the unique equilibrium involves 
prices of. P?=y and PR - Pb; -cg-a, 
and all consumers receive positive surplus 
(see Figure 2). Profits (gross of fixed costs) 
are given by 


Hj =(y- c4) +(a/2) 
TIS = (a /2). 


Note that this equilibrium essentially repli- 
cates the independent goods outcome from 
Section I] (Example 2 with a,=a, and 
Cgi 7 Cg). That is, the presence of a com- 
petitive constraint from product AZ serves 
to “uncouple” the two component markets. 
As in the independent products case, if w is 
large, firm 1’s profits are increased by firm 2 
being inactive (firm 1 then acts as a systems 


The fact that the presence of an inferior competi- 
tively supplied product A can potentially prevent firm 
A from deriving maximal (two-product monopoly)’ 
profits has also been noted in Ordover, Sykes, and 
Willig (1985). : 

More precisely, though multiple equilibria exist in 
the game when component 42 does not exist (corre- 
sponding to a range of values for Pg, and Pg; that is 
independent of w), in any such equilibrium the level of 
P4 is given by Pf —w-(1/21a +c -3PAT. 
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w-y= reservation value 
for compgnent B 
given PA -=y 


(excluding costs 
ad and a(1-d)) 


o 
Py2* a(1-d) 





d-0 


d-l 


FiGURE 2 


monopolist setting an 41/B1 system price 
of w).2! 


Now consider the commitment game. 
When firm 1 would prefer firm 2 to be out 
of the market, can a commitment to tying by 
firm 1 force firm 2 out of the market (note 
that firm 1’s component 41 is not essential)? 
The answer is yes, and for the same basic 
reason as in Section I: when it is only able 
to sell a bundle, firm 1 can only gain its 
profits from component A1 if it also sells 
component B1; this causes firm 1 to fore- 
close sales in the component B market. To 
see this formally, suppose that firm 1 can 
only produce a bundle. When the presence 
of product 442 constrains firm 1’s pricing 
in an independent pricing game (so that 
P4, y), firm ls price for component B1 
given firm 2's price Pg, is given by P3\( Pp.) 
such that 


(4) ([ Pi( Peo) — es] xi (v + Pj, y + Pao) 
xl(y- Py + Pg;)) 
+(y—ce,4)[xi(y + PA. y + Ppa) 


+x?(y + P#,y + P5;)] =0. 


?'Unlike the independent products case, firm 1 suf- 
fers no loss from bing restricted to bundle when it is a 
monopolist. Rather, here the cost of exclusion of firm 
2 is that firm 1 is unable to capture any of the sur- 
plus created by firm 2 (through firm 1’s sales of compo- 
nent A). 


The first term of this expression represents 
the effect on sales of component B1 of 
marginally changing P,,, while the second 
is the effect on sales of component 41. 
Note that this second change is due to the 
total change in system sales. In contrast, 
when firm 1 commits to bundling, its opti- 
mal bundle price given firm 2's price, P5;, is 
given by P*(P,,) such that 


(5) [P*(P5;) ^ c4 — cg] xi (P*, y + Poo) 
t x!(P*,y + Py.) - 0. 


At P=P#(P,.)+y, this expression be- 
comes 


(6) ([PACP22) - esi] xily + PA v + Pao) 
+x! (y+ Pf y + Pa;)) 


t(y-ca)xi(v + Py + Pg;). 


Note that x?(-,-) does not appear in the 
second term of (6). This represents the fact 
that when firm 1 bundles, only by increasing 
sales of A1/B1 systems does it increase the 
sales of component 41. Firm 1 is therefore 
led to set P*( Pp.) < Pg Pg) + y, foreclos- 
ing sales in market B and lowering firm 2's 
profits.7 Thus, by committing firm 1 to 


?^This incentive for foreclosure is similar to the 
effects studied in Carmen Matutes and Pierre 
Regibeau (1986). They study product compatibility in a 
symmetric duopoly and identify a collusive incentive to 
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“strategic foreclosure,” tying can exclude 
firm 2 from market B and thereby raise firm 
1’s profits.” 


Example 4 cont. If firm 1 commits to only 
producing a bundle, firm 2’s equilibrium 
profit when both firms are active is given by 


y-c4Y 
z. 
which is lower than its profit under inde- 
pendent pricing. Note that if firm 1 bundles 
and forces firm 2 to be inactive, all con- 
sumers receive zero surplus here (although, 


as usual, aggregate welfare may either fall 
or rise). " 





TI} = max{0,(1/2a)(a ~ 


C. An Alternative Use for Product B: 
Direct Foreclosure 


. Next, consider an alternative variation in 
the basic model. Suppose that there exists 


have compatibility. This corresponds to the "puppy 
dog" strategy in the Fudenberg and Tirole (1984) tax- 
onomy for the case of accommodation under price 
competition, in contrast to the “top dog" strategy that 
I focus on here (see fn. 7). 

The discussion in the text has only compared 
producing A and B1 independently with producing 
only a bundle. One might wonder about other alterna- 
tives. In fact, it can be shown that as long as the sort of 
price discrimination motivation discussed in Part A is 
not present, any of the other alternatives are either 
equivalent to independent pricing (Bundle and A1; 
Bundle, A1, and B1), equivalent to producing only a 
bundle (Bundle and B1), or clearly inferior to these 
options (41 only, B1 only). 

^A long-standing issue in the legal treatment of 
tying is when to treat the tying and tied products as 
distinct products. A common argument is that the tied 
product must be one that consumers might want to 
purchase separately, without also purchasing the tying 
product. In Justice O'Connor's concurrence in Jeffer- 
son Parish Hospital District No. 2 v. Hyde, 466 U.S. 2 
(1984), for example, she argues this position because 
*When the tied product has no use other than in 
conjunction with the tying product, a seller of the tying 
product can acquire no additional market power by 
selling the two products together." The model analyzed 
here illustrates that this view is incorrect unless. one 
defines "other uses," contrary to Justice O'Connor's 
meaning, to include use with other producers’ compo- 
nent A’s. 
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an alternative use for component B that 
does not rely on the simultaneous purchase 
of componen: A. One example of such a 
use is a replacement parts market for exist- 
ing owners of a system who need to replace 
only component B. Because component A 
is not essential for the use of product B in 
that market, firm 1 is not able to benefit 
from firm 2’s presence in this market 
through sales of good A and the logic of 
Proposition 3 therefore breaks down. Firm 
1 may now find it worthwhile to exclude 
firm 2, if it can, in order to monopolize this 
other market for product B. Furthermore, 
because component A is still essential for 
certain uses cf product B, firm 1 may have 
the means to accomplish this end: by offer- 
ing to sell compoent A only in a bundle 
with component B1, firm 1 directly fore- 
closes firm 2's sales in the joint use market 
(foreclosure of these sales is complete re- 
gardless of firm 1’s bundle price), which 
may drive firm 2’s profits below the level 
that justifies its continued operation. The 
following simple example illustrates these 
points. 


Example 5. Suppose that there are two types 
of consumers. Type I consumers desire a 
system. There are a continuum of type I 
consumers indexed by the uniformly dis- 
tributed variable d &€[0,1] with total mea- 
sure 1. Consumer d has valuations for the 
two possible systems of v, (d) = w:d and 
U4 ,pY(d) = w'd + y,. Type II consumers, of 
which there are a total measure of 6, only 
desire product B. Each type IÍ consumer 
has valuations for products B1 and B2 of 
Ug; ^ and Ug, — q + y2. The firms are un- 
able to discriminate (in a third degree sense) 
across these consumers in their pricing. The 
cost structure has c4 > 0, cg, = Cpa = cg > O, 
K,>0, and K,=0. Finally, I make two 
further assumptions: 


(Al) (1+ 9)¥2>¥1> Y2 


and 
(A2) w> max(Ay; — yı * C4 Cp. 


yt cA eg). 
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Consider, first, the outcome of the inde- 
pendent pricing game. The unique equilib- 
rium outcome when both firms are active 
involves prices of 


Pj; = Cp 
Ph = Cg t Ya 


P2-[w t (yi— y2) * c4— cg]/2. 


In this equilibrium, all consumers buying a 
component B buy product B2, and profits 
for the two firms are given by 


TI? = [w+(¥1—- y3) —c4— cg] /4w 


W-t(yi-y2) 7C4—6p 

II9 = y; 0+ — — ——— — — | - kK. 
27 Y2 Jw 2 
Suppose, instead, that firm 1 commits to 

producing only a bundle and product B1 

alone. In this case the unique equilibrium 

prices when firm 2 is active are given by 


Ps = 65 
Pg, = Cpt y 
P? =(w+ce,+c,)/2, 


and profits are 


Thus, by committing to tie, firm 1 denies 
firm 2 its profitable sales to type I con- 
sumers, lowering firm 2’s profits, and possi- 
bly forcing firm 2 to be inactive. Further- 
more, if tying does force firm 2 to be 
inactive, firm 1’s profit is ((w — c4 — cp) / 


57 am ignoring equilibria here that involve firm 1 
pricing its component B1 below cost and making no 
sales. As earlier, these equilibria involve the use of a 
weakly dominated strategy by firm 1 and can be elimi- 
nated through the use of Selten’s (1975) notion of 
trembling-hand perfection. 
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4w)+ 6(o — cp), which is larger than its in- 
dependent pricing profits if o, the gain from 
monopolizing the type II market, is large.” 


. Finally, if firm 1 does exclude firm 2 in this 


manner, all consumers are made worse off 
here, although aggregate welfare may either 
fall or rise. E 


IV. Conclusion 


The above results demonstrate, in my 
view, that the leverage hypothesis can be 
formally modeled in a coherent and appeal- 
ing way. Once one allows for scale 
economies and strategic interaction, tying 
can make continued operation by a 
monopolist’s tied market rival unprofitable 
by leading to the foreclosure of tied good 
sales. As the models above have indicated, 
such a strategy can be a profitable one for a 
monopolist, often precisely because of this 
exclusionary effect on market structure. 

While the analysis vindicates the leverage 
hypothesis on a positive level, its normative 
implications are less clear. Even in the sim- 
ple models considered here, which ignore a 
number of other possible motivations for 
the practice, the impact of this exclusion on 


©The reader may be wondering about other alter- 
natives available to firm 1. A commitment to producing 
A only B1 only, or just a bundle is worse as an 
exclusionary strategy for firm 1 than committing to 
produce the bundle and B1 since firm 2's profits are 
higher when it is active under these strategies than 
when firm 1 commits to produce the bundle and B1, 
and firm 1’s profits are lower under these options if 
firm 2 is inactive. They also are less attractive as an 
accommodation strategy for firm 1 than independent 
production of A and B1. Producing A, B1, and a 
bundle yields an outcome equivalent to the indepen- 
dent production outcome (restricting attention to trem- 
bling-hand perfect equilibria). Finally, producing a 
bundle and 4 is less effective as an exclusionary strat- 
egy than producing a bundle and B1 (it gives firm 2 
higher profits if it is active and firm 1 lower profits 
when firm 2 is not active), and when firm 2 is active, no 
pure strategy (trembling-hand perfect) equilibrium with 
this product offering can give firm 1 higher profits than 
when it produces A and B1 independently. However, 
a pure strategy equilibrium may not exist here. A 
sufficient condition for a pure strategy equilibrium to 
exist is that y; «(o — cg). Thus, when this condition 
holds, firm 1 can effectively limit itself to the two 
options considered in the text. 
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welfare is uncertain. This fact, combined 
with the difficulty of sorting out the lever- 
age-based instances of tying from other 
cases, makes the specification of a practical 
legal standard extremely difficult. 

Finally, it should be noted that the lever- 
age debate is not limited to the practice of 
tying, but rather arises in numerous areas of 
antitrust analysis. With the practice of re- 
ciprocity, for example, a monopsonistic 
buyer of some product refuses to buy from 
his suppliers unless they also buy a product 
(in which he may face competition) from 
him. Alternatively, when a vertically inte- 
grated monopolistic input supplier can sell 
his input to both his own downstream man- 
ufacturer and to a rival manufacturer, a 
refusal to supply this rival manufacturer is 
similar to the tying of complementary goods. 
The results here raise the possibililty that 
the use of leverage as an effective and prof- 
itable exclusionary device, could arise in 
these other settings as well.”” 


APPENDIX A 


PROOF OF LEMMA 2: Suppose, first, that firm 2 i R 
active and that the equilibrium prices are ((P9 o 

P9; P2,). There are two cases to consider. If P? < P}, 
+y, then all consumers prefer the bundle to buying 
only good B1 from firm 1 (again, for expositional 
reasons, I assume here that consumers buy the bundle 
when they are indifferent). If so, then firm 1 selling 
only the bundle at price P? generates identical sales 
and profits for both firms for all P5. 1f P°> PD, +y, 
then it must be that firm 1 is making no sales since 
otherwise it could do better by setting P = P, + y (it 
would make exactly the same number of sales of the 
bundle as it did of B1, but at a larger margin since 
y? c4). In this case, firm 1 selling only the bundle at 
price PR, + y generates identical sales and profits for 
both firms when Pg; = Pĝ, and for firm 2 for all Po. 


?"In fact, the models analyzed above can frequently 
be reinterpreted to apply to these other settings. Some 
differences do arise, however, in modeling the varicus 
practices. For example, the extent to which commit- 
ment is possible is likely to vary by practice. Also, in 
the case of vertical integration discussed in the text, 
the upstream monopolist sells his component (input) to 
the unintegrated final goods producer, who then sets a 
price for the entire system (finished good) rather than 
selling directly to consumers who put a system together 
themselves. Furthermore, in such a setting, more com- 
plicated wholesale contracts may be possible than the 
simple linear pricing considered here. 
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A similar argument holds if firm 2 is not active. There- 
fore, any perfect equilibrium outcome (including deci- 
sions regarding zctivity in market B) is equivalent to 
one that arises after firm 1 has committed to producing 
only the bundle. 


PROOF OF PROPOSITION 2: The argument is a 


simple comparative statics exercise. Letting $ = P, — y, 
firm 1’s problem, given Pp;, can be written 


max[($ — cg) t Qr 7 c4)]: x! (0, Poa) 


The bundling equilibrium is then characterized by the 
following two equations, which have a unique solution 
(with positive sales by both firms) under our assump- 
tions. 


[(6**- c) +(x- 0 ]1xCo**, PR) 
+2'(6", Pi) =0 
(PR — cna) xi (**, PR) + x* (o**, Pj) = 0. 
Note that if y — 4, then ($**, Pjs*) = (Pei, Pip), the 


independent pricing equilibrium. Now define (omitting 
arguments of functions): 


A= 2x1 - [(6**- es) t (v - c)]xhi 


B e 2x5 + (Pi* — cg) x30 


C 2 xi xp[(9** - 08) € (v ca] 
Dzxttx$(PE* 


— €g2). 


The assumption that P'(P,;)€(0,1) implies that 
(4, B) «(- C, — D) « 0. This then implies that 


. dó** 
sign i 





= sign{ - Bri} <0 





dP 
sign 7 = sign{ Dx]) <0, 
so that both firms’ profits fall relative to the indepen- 
dent pricing equilibrium. a 


APPENDIX B 


Here I work aut the example for the cases where 
B =a. In this class of cases, the division of consumers 
between the two firms can be represented diagrammat- 
ically as in Figure 4. Consider first equilibria that are in 
region (i), that is, that satisfy P «(y — B) (Pg; + a). 
In.this region, the first-order conditions for the two 
firms are as follows (these conditions are sufficient for 
a maximum since at any point where these conditions 


bi 
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Indifferent consumers when 
bundle price is 


P = (7+8) + (Paga) 


Indifferent consumers when 
bundle price is 


P = (M) (Byg-2) +€ 


Indifferent consumers when 
bundle price is 


P = (7-8) + (Pyote) 


Pg2*? 


Reservation Values for Good Bl Induced by Pa 


FIGURE 3 


hold, the firms’ profit functions are concave): 
Firm 1: 4aB-(1/2)[P - (y - 8) - (Pg - e] 

- PEP - (y- B) - (Pg; - a)]- 0 
Firm 2: [P-(y- 8)- (£g; - a)] -2Pg; - 0. 


From firm 2’s first-order condition we see that we are 
in region (i) if and only if Pg, x a. Solving the two 
first-order conditions for Pg; yields the following ex- 
pression: 


—8(Pg5)^ * 2[a — (y — B)] P52 * 4aB — 0. 


This expression is strictly concave and is nonnegative at 
P5; — 0. Hence, P5; < a if and only if the value of this 
expression is nonpositive at Pp; = a. Substituting yields 
the requirement that y = 3(8 — o). Firm 2's profits in 
this region under bundling are (1/2a8)(P52)? com- 
pared with its profits of (æ /2) under independent 
goods pricing. Since Pg; «a in this region, firm 2's 
profits must fall. : 

Consider now bundling equilibria that fall in region 
(ii), that is, where P &((y — B) (Pg; +a), (y+ B 
(Pg; — a). Straightforward analysis of the firms’ 
first-order conditions reveals that in equilibrium we 
must have 3P5; = 38 — y. In addition, to be in region 
(ii), Pg; must satisfy 28 — a > Pg; > a, or substituting 
for Pg;: 3(8—a)z y z Xa — B). The first of these 
inequalities is just the reverse of our region (i) condi- 
tion, while the second, which assures that we are not in 


region (iii), is always satisfied since B > a (in fact, the 
bundling equilibrium can never be in region (iii)). Firm 
2’s profits under bundling in this region are given by 
(1/2B)(Pg2)* compared with (æ /2) under indepen- 
dent goods pricing. Substituting for Pj; yields the 
condition in the text. Firm 1’s profits under bundling in 
this region are given by 


II = P(1- (1/28)[P - (v - 8)- P5;]) 


38tyY(1 v. 
ise — T — 
(lates) 
while under the parameter values of this region its 
independent goods pricing profits are given by 





+B 
2 





ems ) 0729. 


Bundling then yields firm 1 larger profits than indepen- 
dent pricing (assuming that firm 2 remains active) if 
and only if 








(= 


6 6 Jaap. 


But the expression on the left side of this inequality is 
strictly larger than (38 — y)/3)?, which implies that 
whenever firm 2’s profits are higher under bundling, so 
are firm 1’s. There is also clearly an area of the 
parameter space where firm 1 is better off and firm 2 is 
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worse off under bundling compared to independent 
‘ goods pricing. 
The analysis of cases where a> proceeds in a 
similar manner. 
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The Economics of Product Patents 


By MicHAEL WATERSON* 


This paper develops a model of product patents in which the patent changes the 
nature of market entry behavior rather than preventing entry entirely. The model 
incorporates three levels of action, namely, the innovator's patenting decision, the 
potential entrants location decision, and possible court action. The analysis 
demonstrates how the characteristics of the patents system and the enforcement 
framework can influence rivals' variety choices and thus the market equilibrium. 
It also considers how the system can in principle be adjusted to improve social 


welfare. (YEL 621) 


The purpose of this paper is to reexamine 
the question—what does a patent do? Cer- 
tainly it acts as a deterrent to entry, as the 
established literature suggests, but more 
` precisely a patent on a product makes entry 
into the market by “close” substitutes costly. 
It prevents (or discourages) anything that is 
sufficiently close from being produced but 
does not foreclose a whole area in most 
cases. One might think of the case of a new 
child’s toy, a new car safety restraint, or a 
new type of bottle opener. Here there is 


*Department of Economics, University of Reading, 
Reading, RG6 2AA, England. I should like to thank 
Yannis Katsoulakos, Katrien Kesteloot, Sumner La 
Croix, Ian Motho, Norman Strong, participants at the 
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Research in Industrial Economics in Berlin, at an 
ESRC Industrial Economics Study Group at the Uni- 
versity of Sussex, at staff seminars at the Universities of 
Manchester, Melbourne, and York, Murdoch Univer- 
sity Western Australia, the AGSM at the University of 
New South Wales, and the RSSS at the Australian 
National University and, last but by no means least, the 
Review’s anonymous referees, for helpful comments 
and suggestions at various stages of preparation of this 
paper. 

'The classic welfare analysis of patents appears in 
W. D. Nordhaus (1969). Some more recent studies 
have adopted a deterrence framework, in which patents 
provide a specific perpetual monopoly (see, for exam- 
ple, Richard Gilbert and David Newbery, 1982; Jen- 
nifer Reinganum, 1983; John Vickers, 1985), but these 
papers, and also those considering “patent race” issues 
(for example, Gene Grossman and Carl Shapiro, 1987), 
are largely concerned with the evolution of concentra- 
tion over time and the question of whether monopolists 
persist. 
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potential competition between the product 
that is patented (that suits some people very 
well, others not so well) and other products 
in this area. Thus we argue that the main 
impact of a product patent is not to create a 
monopoly but rather to affect the variety 
choices that rivals make. Moreover, the par- 
ticular impact on variety choices is heavily 
influenced by the particular legal mecha- 
nisms that are used to enforce patent rights. 
The paper provides a simple framework for 
analyzing these variety issues, specifically 
addressing how patents affect the equilib- 
rium number and nature of products pro- 
duced, and whether it is socially desirable 
that they have these effects. 

One major feature of the present paper is 
that the patent issued provides incomplete 
coverage for the patentee’s idea in a specific 
sense—coverage is limited but inexact, and 
it can in principle be tested in the courts. In 
an important complementary theoretical 
analysis, Ignatius Horstmann, Glenn Mac- 
Donald, and Alan Slivinski (1985) assume 
limited but exact coverage.” In the jargon of 
patent law, they assume a very strict 
“fencepost” rather than a “signpost” system 
of patents. With a true fencepost system, 
there would be no need to refer to the 
courts over questions of interpretation, but 
in practice, court cases occur. And, .as 


There are also some links between this paper anda 
recent welfare analysis of patent width by Paul Klem- 
perer (1990). 


VOL. 80 NO. 4 


Richard Levin (1986, p. 199) says, “There is 
no theoretical presumption that improving 
appropriability is desirable." This is one of 
the issues we examine below within our 
more fully structured version of the legal 
process. Our modeling approach enables 
predictions regarding changes in the legal 
system to be generated. 

We choose to analyze the case of product 
patents because the idea of imperfect ap- 
propriability arises naturally in a product 
setting and because it enables us to raise 
interesting and largely unexplored issues re- 
garding the effects of patents upon product 
locations. Admittedly, not all product 
patents fit within the precise framework 
adopted here, but this point is explored in 
an appendix (available from the author on 
request). Also, despite the focus on product 
patents, at least some of the issues raised 
here carry over to the case of patents on 
processes. 

Section I sets out the model and develops 
its positive predictions. Section II develops 
some social welfare points through the 
medium of a specific example, and Section 
III contains a few concluding remarks. 


I. The Model 


Our model of patenting, like those cited 
in fn. 1, is rather stylized in its modeling of 
the inventive process. Firms are aware of 
what they will end up with; there is little 
uncertainty about the technology or de- 
mand.’ All agents are assumed to maximize 
expected utility /profit arising from their ac- 
tions. There is no dynamic learning process 
involved in producing new products. And 
the model largely abstracts from any strate- 
gic pricing behavior by the firms. We make 
this simplification in order to focus on 
strategic considerations of greater immedi- 
ate interest. Finally, for simplicity, we have 
only two firms in the game, each of which 
produces only one product. Thus, our game 
is perhaps best thought of as a microcosm 


?]n relation to uncertainty, the Horstmann et al. 
paper is clearly an advance on most previous work. 


WATERSON: ECONOMICS OF PRODUCT PATENTS 861 


of a broader contest in which several firms 
compete, each with an array of products. 

We make the following specific assump- 
tions. 

(i) The products in question are at partic- 
ular points on a one-dimensional line of 
finite length 2x, with customers evenly 
spaced along the line. The farther a cus- 
tomer is from a product, the higher is the 
“delivered price." This is most directly 
thought of as users obtaining lower con- 
sumer surplus at any price (F. M. Scherer, 
1979) but might also fit with the firm in 
question having to incur additional promo- 
tional costs in order to make more sales, or 
more extensive product development (dif- 
ferent types or styles), for instance. 

(ii) The gross profits obtained by the two 
firms are concave functions of the extent of 
the market that is served to each side, given 
price, and for simplicity, the prices set are 
such that the entire market will be served. 

(ii) The form of the patent naturally 
bounds the total market area in question 
(previously latent demand) by the broadest 
claim of the potential patentee. Thus the 
first firm, J, irreversibly selects a market 
area and a product location, y. Absent en- 
try, it earns profits (gross of any patent fee) 
that are written II(y,2x — y), as a function 
of the market areas, y and 2x — y to each 
side, over which its product is sold. (Notice 
price is implicit in the profit function.) 

(iv) Entry is assumed to involve sinking 
substantial costs," so it is irreversible and 
therefore not lightly undertaken. Assuming 
the potential entrant E locates 28 units to 
one side of J (0€0 x y/2), by convention 
the left, and that both firms set the same 
price, then J and E sell to distances to left 


^The structure of UK patent applications, for exam- 
ple, is that they commence with a broad claim, then 
successively narrow this down in subsequent claims, as 
they proceed to a finer definition of the precise article. 
The courts are more likely to hold narrow claims as 
valid, which is what prompts our modeling framework 
for the legal process developed below. 

In order to economize on notation, in this section 
we assume that entry involves sinking costs equal to 
those the incumbent has incurred. This is not a mate- 
rial assumption, and it is dropped in Section II. 


862 THE AMERICAN ECONOMIC REVIEW 


“delivered 
price" 





y "distance" —® 2x 


Figure 1. Tug Basic FRAMEWORK: / Is THE 
INNOVATOR AND E Is THE ENTRANT 


and right of (6,2x — y) and (y —20,0), re- 
spectively. Thus the scene in product space 
is as pictured in Figure 1. 

(v) For much of the paper, we shall as- 
sume that, should firm J decide not to 
patent, the entrant will locate so that 0 = 0, 
as a result of the centripetal tendencies 
created by the finite market length. (This 
assumption is reexamined later.) 

(vi) If the first firm patents, it incurs a 
nontrivial cost p (essentially, the legal fees 
plus a patent agent’s charges), but benefits 
from the fact that @>0. Since no further 
entry is envisaged, the potential entrant 
need not patent. 

(vii) If J takes E to court over violation 
of patent rights, there are probabilities that 
I will win and E will lose with given expen- 
ditures on representation in court, and asso- 
ciated with the outcomes are payoffs. Fol- 
lowing the action, both may remain in the 
market, or alternatively only 7 may actually 
be allowed to continue producing. However, 
it is still sensible to think of the payoffs in 
expected value terms, given risk neutrality 
on the part of the players. To put some 
structure onto this, we assume the in- 
cumbent receives expected net benefits of 
C6) = A(8) — K((8) from court action, 
while the entrant expects C,(@)=— A(8) 
— K.(8X < 0), A being a nonnegative trans- 
fer of damages and K; and K, being dead- 
weight legal costs to the parties. Then the 
respective payoffs are 


[II(6, x) — p+C,(6), 
I(x —26,0) + C,(0)]. 


It is natural to assume that C; «0 with 
C,(@) > 0 for 6 small, but C,(@) < 0 for large 
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8 due to negligible damages in this case. It 
also seems reasonable to assert that Cj > 
0,C/ <0,C,(0)< —II(x,0); that is, court 
costs increase at an increasing rate as E 
moves toward /(8 0), and direct copying 
is unprofitable. 

The sequence of events is as follows. Firm 
I selects its location and decides whether to 
patent. Firm E then decides whether, and if 
so where, to enter. If J did not patent and 
E wishes to enter, then 7 cannot prevent 
entry. If J has patented, then after E has 
entered, J has the option of taking E to 
court over patent violation. It will save time 
if we note at this stage that firm J’s equilib- 
rium location is at x, the center of the 
market and analyze subsequent decisions 
with /’s location given. 

In analyzing 7’s and E's decisions based 
upon the model outlined above, we utilize 
the now-standard subgame perfect equilib- 
rium concept. The game tree corresponding 
to this model is represented in extensive 
form in Figure 2. It is clear that there is 
more than one potential equilibrium out- 
come. In fact, the only possibility that can 
be ruled out without further consideration 
is that where 7 does not patent but E stays 
out, since E does better by entering know- 
ing that J will accommodate it, and any 
threat by Z will not be credible in this case.’ 

However, we can proceed further by mak- 
ing use of the assumptions, in particular 
(vii). Simple functions consistent with these 
properties are illustrated in Figure 3. These 
enable us to determine both ð and the 
range of possible equilibria. A maximum on 
[I(x —20,0)-- C.(0)] occurs at a value of 
8*, point C in the figure, representing E's 
best position if it expects a court battle. E 
will only expect this if C(0) > 0; the threat 
to fight must be believable (Dixit, 1982; 
Steven Salop, 1979). E's alternative is to 


SObserve that if J were to locate at x 4 A, rather 
than x, then without entry, its profits would be I(x + 
A,x—-A)«Il(x.x), and with entry, II(0,x — A) « 
TCE, x). 

Thus for example we might allow firm / to threaten 
to price down to marginal cost, but since this involves 
both J and E in losses, it is not credible (see, for 
example, Avinash Dixit, 1982). 
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stays out [II (x) - p, 0] 
rm court case [I1(0,3) - p+ G (8),TI( x-20,0) + C, (6)] 
x- 20 

patents 
no court case 
b II(0, x) - p, TI - 20,0)] 
does not patent 

stays out m (x,x), 0] 





enters at — : | 


FIGURE 2. THE GAME IN ExTENSIVE FORM 


T1(x,0) = 
T1(0,x) 







IIx - 28,8) +C (8) 


accommodates — [II (0,x), I1 (x,0)] 
entry 


K II (0, x) 


TI(@, x) + C (8) 


IKx -20,0) 


FiGURE 3. THE PROFIT FUNCTIONS 


choose a 0 value, here called 6, which gives 
the highest profits E can expect conditional 
on not triggering a court action, that 
is, where C,(6)=0. This yields E profits 
of Il(x —26,8). As the figure is drawn, 
I(x — 26, 6) > II(x — 20*, 0*) + C,(0*) 
(point D is higher than C), meaning that 


should J patent, the best E can do in the 
subgame in which it decides to enter is to 
choose the variety 8. But it is clear that the 
opposite could be true. Notice also that 
AE < CF; in the event of a court action, 
there is a net loss to the pair of firms, 
representing their legal costs. 


864 THE AMERICAN ECONOMIC REVIEW 


SEPTEMBER 1990 


TABLE 1— EQUILIBRIUM OUTCOMES OF THE GAME 





Relationship Between Profit Values 


T(x — 26,6) > 0; II, x)— p > IIO, x) 
IK, x)— p < TH, x) 


IKx —28,8) < 0; II0,x) — p z TICO, x) 


IKx —26*, 0*) + C (0*) > 0, IK(0*, x)— p + C(9*) > IIO, x) 


TI(6*, x) - p + C,(6*) « TIO, x) 








Text 
Outcome Reference 

Ix —26, ê) > T(x —20*,0*)-- C,(6*), that is, Ó Relevant 

Patent; No Court Case (1) 

No Patent; No Court Case 

But Entry (2) 

Patent; E Stays Out (3) 
T(x — 28,8) < I(x —26*, 6*)+ C,(0*), that is, 0* Relevant 

Patent, But 

Court Case (4) 

No Patent; No Court Case 

But Entry (2) 

Patent; E Stays Out (3) 


Ilx —26*, 6*) + C,(6*) < 0, II(6*, x)— p  CK6*) 2 TICO, x) 


The various possible situations arising 
from Figure 2 are written down in Table 1. 
Several features of interest emerge from 
these calculations. Most obviously, if the 
best E can do given the patent is to make a 
loss, E cannot credibly threaten to enter. 
Nevertheless 7 will patent, as long as we 
assume that II(x,x)— p » II(0, x), a most 
likely occurrence at current fee levels. 
Therefore equilibrium 3 results. However, if 
E can make a profit, any one of the other 
three equilibria can arise. Obviously, if J is 
better off in terms of final profits not 
patenting, then equilibrium 2 results. Given 
the current sizes of patent fees though, it is 
more likely that Z is better off patenting, in 
which case the two firms may implicitly agree 
not to fight (equilibrium 1) or, interest- 
ingly, they both may actually be better off 
going to court (equilibrium 4). This last out- 
come may be viewed as an example of a 
curious type of prisoners’ dilemma. The 
parties jointly would be better off at Ó with 
profit-sharing but E has an individual inter- 
est in being at 0* even in the knowledge of 
the court case, while location at 0* means / 
has an incentive to bring the case. 


Notice that between 9 = 0 and 6, TICO, x) + C6)» 
TI(@, x), which is greater than the profits earned over 
the same distance with price competition, so that price 
competition by / as a response to entry is never credi- 
ble in a single-shot game. Moreover, price competition 
would make entry, at 8 = 0 at least, unprofitable for E. 
Hence, we ignore this possibility. 


Thus a variety of equilibria are possible, 
with the specific outcome dependent upon 
characteristics of the patent system itself. 

Let us now consider some comparative 
static properties of the patents system and 
its associated legal apparatus. Note first that 
a reduction in the deadweight legal costs 
involved in court action will raise curve CJ 
in Figure 3 as well as raising GABH. These 
movements lead to some straightforward 
effects but also to others less obvious. One 
effect is to shift Ó to the right. However, it 
also leads to an increased likelihood of the 
0* equilibrium? Point C in Figure 3 is 
raised, at the same time as D shifts down 
line FD. Thus streamlining court procedure 
makes the potential entrant more wary of 
introducing a close substitute for /'s prod- 
uct, but also releases some of the latent 
demand for court action. 

So far, it has been assumed that, even if 
the legal system may award costs (as it can 
in the UK), the costs of engaging in the 
action far exceed those incurred in, and 
therefore in principle recoverable through, 
the courts. But consider the effect of intro- 
ducing a purely contingent-fee based legal 
cost framework (rather unrealistically, ig- 
noring unrecoverable costs) In that case, 
C,(8) = A(@)— K{6)>0. Therefore curve 


?It is fairly clear that, under a wide range of circum- 
stances, the set of equilibria is socially to be pre- 
ferred to the 0* set. 
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GABH lies entirely above line EBK. In 
consequence, the 6 equilibrium disappears, 
so that entry, if it occurs, must be at the 
edge of the market, or a court case will 
necessarily ensue. l 

Notice that changes in the magnitude of 
p leave both 6 (and 6*) equilibria un- 
changed, since lines GABH and EBK are 
net of patenting costs. Thus the only direct 
effect of a reduction in patent fees is to 
increase the likelihood of patenting. How- 
ever, if we make a mild additional assump- 
tion, we may say something rather more 
interesting regarding the propensity to 
patent. 

To do this, consider the hitherto ne- 
glected possibility that the desire for variety 
is sufficiently strong for E's most profitable 
position to be some way from /. Return to 
the "normal" penalty structure situation, 
where C(x /2)<0<C,0) and A*(8)« 0, 
and assume and not 6* is relevant. In that 
case, I’s propensity to patent will be lower 
when customers' desire for variety is strong. 
This transpires because, under the stated 
assumptions, E's optimal value of 0 in the 
absence of a patent, 0", say, is positive, 
which implies J would find itself some way 
up the line II(8, x). However, there would 
be no reason for Ó to be different. P S 
comparison in deciding whether or not to 
patent is between II(0", x), which increases 
with 0", and II(8, x)— p. Therefore as.the 
desire for variety, and so 0", increases, the 
former value is increasingly likely to be the 
larger, so leading J to choose not to patent. 

Casual empiricism is perhaps dangerous, 
but interindustry comparisons of the ratio 
between the proportion of firms with patents 
and the proportion with R&D programs, 
calculated from data reported in Zvi 
Griliches, Ariel Pakes, and Bronwyn Hall 
(1987), are suggestive. The lowest propen- 
sity to patent, defined in this sense, comes 
from the industry group “office, computing 
and accounting equipment," and low values 
are also featured in “communication equip- 
ment" and “professional and scientific 
equipment." Goods like “primary metals" 
and "petroleum refining" have very high 
values. Each of the low propensity groups 
could be considered as cases where cus- 
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tomers' desire for variety is Ven strong, and 
vice versa. 

The outcomes in our model may also be 
compared with those arising in Horstmann 
et al.'s rather different model. Like us, they 
predict that not all innovations will be 
patented, but they find those that are not 
patented will be imitated, whereas our 
model suggests they will often be copied. 
We would argue that at least some observa- 
tions support our version (for example, drugs 
in Italy; pirating of computer software). 
Their third major prediction is that patented 
goods or processes will be neither imitated 
nor duplicated. This rather counterintuitive 
outcome arises because in their model there 
is some positive probability that imitation is 
unprofitable, and the occurrence of a patent 
signals a case where the entrant's expected 
profit is negative; they make no claims for 
their prediction in those cases where imita- 
tion or duplication would always be prof- 
itable in the absence of a patent but, given 
the patent, imitation need not be. In sum- 
mary, the deterrence effect of patents arises 
in very different ways in their framework 
and ours. 


II. À Specific Illustration 


This section describes a simple example 
that illustrates a number of results in the 
preceding section but that focuses upon the 
welfare effects of location and firm numbers 
raised by the model. Since duopoly frame- 
works commonly will involve prices lower 
than under monopoly, whereas here by na- 
ture of the example prices are parametric, 
this is not an innocuous simplification. Our 
justification is that it enables us to concen- 
trate upon issues of current interest regard- 
ing numbers and locations of products and 
moreover that, by contrast with effects aris- 
ing from proportionately large changes in 
product numbers, deadweight welfare loss is 
a second-order effect. 

The example uses a formulation popular- 
ized by Curtis Eaton and Richard Lipsey 
(1978) among others—though we differ in 
employing the assumption of a bounded 
market, given the nature of the patent ap- 
plication. Consumers at some point in the 
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product space represented in Figure 1 have 
an (indirect) utility function: 


(1) V=ePe/Po+ I/P), 
where 


P, being the price set by the firm selling the 
nearest product, t the unit cost associated 
with that product being a distance z from 
the consumer’s ideal, J income and P, the 
price of the numeraire good, set equal to 
unity. All consumers are alike save in their 
views regarding the best product, and they 
are uniformly distributed across the market. 
Each firm is assumed to face costs | 


(3) C;=cQ+ Eij-i,e 


linear in total output Q, with a fixed ele- 
ment F, that may be smaller for firm E 
than for firm J. Deriving demand from (1) 
and integrating across area served yields 
profits, before deduction of any patenting or 
legal costs, as 


€ 
(4) II;- (Pp - c) — 
x[2— e Put — e-Pe]- F, 


D, and Dj being the left- and right-hand 
limits to sales. By differentiation, optimal 
price assuming price matching is 


(5) Př =1+c. 


Using (1) and (5), consumer surplus gener- 
ated over the area covered by a specific firm 
operating D, units to its left and Dp units 
to its right is 


e 0*9 


(6 CS= 


[2- e Pu = e Pr]. 
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Social surplus arising from the production 
of a particular product is S — CS + II, ob- 
tainable from (4) and (6). Thus we write 
down the surplus created when one cen- 
trally placed firm J, begins operations as 


-(-*c) 


(1) 5)(2,2)=— — [2-2e™] - E. 


Suppose now that there are two firms, in- 
cumbent and entrant. The entrant has to 
decide where it would be best to come in. 
Using (4) its profit is 


1 
(8) IL(x —280,0) rm 


x[2- e7720% e")—F, 


Assuming there is no patent constraining 
location, this is optimized for a value of 0 
given by 


x 1 


G) =Ma 5 ~ 


in2,0).. 


This implies locating right next to the in- 

cumbent unless the "transport cost" ex- 

ceeds approximately 0.7 per unit distance 
10 

x. 


V Tt may be useful to illustrate the possibility that a 
6* equilibrium may arise, within the context of this 
specific example. Assume 


C,(8) = TL (x 20,0) k 


6«04 
-01Il(x-20,0)-k 6201 
C(8)--11,(x-20,0)-k | 8404 


--04IL(x-20,0)-k 6204 


and consider the following parameter values: x = c= 
1, F, = 0.01,¢ = 0.1, k = 0.0083. Then 6 — 0.3, at which 
point E's net profits are 0.083. 8* is 0.1. clearly, at 
which point E's net profits are greater, at 0.0885. 
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With both firms in the market, the gen- 
eral expression for social surplus is 


(10) 5(0,x;x—20,0) 


2e 0*9 


t 


x [4—-2e^9 E e m ertt 20v] 
-E-EF,. 


Here it is easily confirmed that the socially 
optimal value for 0 is x /3. Hence by com- 
parison with (9), there is always a cen- 
tripetal tendency in this model. Thus there 
are two potential sources of static social 
welfare loss: having the wrong number of 
incumbents and having the wrong array of 
products. 

Assume for the time being that £ is suf- 
ficiently low for E, left to itself, to desire a 
location next to 7. Then it is immediately 
seen, by comparing (10) with 6 = 0, with (7), 
that society would be better off in a static 
sense with a monopolist, for any F,> 0. 
This also provides 7 with more profit, and 
hence more incentive, than does freer entry, 
since ILíx, x) » II,(0, x). We have: 


PROPOSITION 1: There are circumstances 
where if entry can be prevented by use of a 
patent system, the innovator will receive more 
(dynamic) encouragement in the form of ex- 
pected profits as well as society being better off 
in the sense of having larger social surplus. 


When we introduce the possibility of the 
incumbent patenting, then the entrant's lo- 
cation will never optimally be at 0 — 0, as- 
suming there are penalties for transgressing 
the law. Where 870, it is possible that 
society benefits from having two suppliers. 
The patent, as we shall see, increases this 
tendency. 

In society's eyes, it is worthwhile having 
two firms rather than one if S(0,x;x— 
20,0)» S(x,x) (ie. the value of (10) is 
greater than the value of (7). From the 
viewpoint of the potential entrant, it is 
worthwhile entering if the value of II, (x — 
28,0), that is, (8), is positive. It is clear that 
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E may enter when entry is socially undesir- 
able since there will be a “business stealing” 
effect (Gregory Mankiw and Michael Whin- 
ston, 1986), which may or may not be out- 
weighted by the gain in variety. The key 
factors here are the size of the entrant’s 
fixed costs, consumers’ desire for variety, 
and the form of the penalty scheme. 

Consider first a stylized “signpost” system 
of patents that does not designate any par- 
ticular product, save an exact copy, as 
outlawed, but that involves an increasing 
expected value of penalty as the entrant 
product impinges more strongly upon the 
patented one." Specifically, take two ver- 
sions: 


A. A penalty on £’s profits that is propor- 
tional to closeness, varying between no 
penalty at 0—x/2 and 100 percent 
penalty at 0 —0, namely, (1—20/x)x 
IL(x —26,6). Alternatively, this may be 
thought of as a penalty of II,(x — 26,6) 
with probability (1— 26 / x) 

B. A penalty equal to Ps loss in profit 
IL(x, x) — I1, (6, x), imposed with proba- 
bility (1 —20/x) 


These are stylized versions of the two 
common forms of damages in patent claims, 
namely, breach of contract (scheme A, sec- 
ond interpretation) and tort (scheme B). 
They may be contrasted with a perfect 
"fencepost" system that bars entry within a 
complete specified area, say, up to the edge 
of the market (with the penalty being a fine 
equal to the profits made by the entrants), 
and with blockaded entry. . 

In order to analyze the relative welfare 
effects of these schemes, we adopt a numer- 
ical approach.!? Some examples are given in 
Table 2. 


"As W, R. Cornish (1981, p. 128) puts it, “to some 
extent it is left to the court to work out the proper 
Scope of the monopoly from the description of the 
invention." 

7 Details of the calculations involved are omitted, as 
they are straightforward but tedious. For simplicity in 
calculating social welfare, legal costs are assumed to be 
lump-sum transfers to lawyers, and patenting costs are 
ignored. 
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TABLE 2—SOME SIMULATION RESULTS 














Fencepost 

Blockaded No Penalty Scheme A Scheme B (6 = 0.5) 

Case t F, F, S 85 S 6 $ 8 S S 

1 0.05 0.05 0.05 0.4780 0 0.4280 0.32 0.4324 048 0.4316 0.4313 
2 005 0.1 0.05 0.4280 0 0.3780 0.32 0.3824 048 0.3816 0.3813 
3 005 01 0.1 0.4280 0 0.3280 0.13 0.3308 N 0.4280 N 0.4280 
4 0.5 003 0 0.3960 0 0.3960 0.45 0.4276 0.40 0.4308 0.4225 
5 0.5 0.08 0.03 0.3960 0 0.3660 0.38 0.4016 0.40 0.4008 0.3925 
6 0.5 0.05 0.03 0.3760 0 0.3460 0.38 0.3816 0.40 0.3808 0.3725 
7 0.5 0.05 0.05 0.3760 0 0.3260 0.32 0.3623 0.40 0.3608 0.3525 
8 0.5 0.1 0.05 0,3260 0 0.2760 0.32 0.3123 040 0.3108 0.3025 
9 0.5 0.1 0.1 0.3260 0 0.2260 0.08 0.2411 N 0.3260 N 0.3260 
10 1 0.03 0.03 0.3122 0.10 0.3117 037 0.3405 0.37 . 0.3405 0.3241 
11 1 0.05 0.05 0.2922 0.10 0.2717 0.32 0.3012 040 0.2986 0.2841 
12 1 0.1 0.05 0.2422 0.10 0.2217 0.32 0.2518 0.37 0.2505 0.2341 
13 1 0.1 0.1 0.2422 N 0.2422 N 0.2422 N 0.2422 N 0.2422 
14 2 0.05 0.05 0.1840 0.22 0.2046 (0.31 0.2141 0.34 0.2145 0.1881 
15 4 0.05 0.05 0.0829 0.28 0.1136 028 0.1136 N 0.0829 N 0.0829 
16 4 0.1 0.05 0.0329 0.28 0.0636 0.28 1.0636 N 0.0329 N 0.0329 
17 4 0.1 0.1 0.0329 N 0.0329 N 0.0329 N 0.0329 N 0.0329 
18 5 0.05 0.05 0.0575 0.29 — 0.0841 N 0.0575 N 0.0575 N 0.0575 


Notes: c = x =1 throughout. 


S = social welfare, calculated from equation (10). 


N «entry does not occur; monopolist remains. 


Various points of note emerge from the 
table. Most obviously, the patent system, 
like free entry, commonly allows the second 
product to achieve some position in the 
market rather than barring it completely. 
Particularly for the lower t values, where 
consumers value variety only moderately, 
social welfare is markedly improved thereby 
over the free entry (and, sometimes, the 
blockaded entry) position. We have: 


PROPOSITION 2: The presence of a patents 
system can lead to products being better !o- 
cated in economic space than in its absence. 


For very low ¢ values, where consumers 
care little about variety, Proposition 1 domi- 
nates and blockaded entry is usually better, 
of course. 

It is also noticeable that, once we move 
toward rather high values for ¢ (for exam- 
ple, t=4 per unit distance), this effect is 
reduced since a substantial separation be- 
tween products would exist independent of 
the patent system, and any penalty imposed 
is as likely to discourage entry as it is to 
improve separation. In particular, a rather 


limited degree of protection, if any, seems 
desirable once we move to high t values, as 
cases 10-18 show. 

Generally, the fencepost system produces 
over-much separation of the products while 
permitting entry and so performs poorly over 
the range of values listed in the table. 
Scheme B appears the most resilient to pa- 
tameter changes in the sense that it always 
does better than either scheme A or the 
fencepost system (or both) for the range of 
values given. This is interesting since scheme 
B can be thought of as a proportional tax on 
(privately but not socially desirable) busi- 
ness stealing. Scheme A performs well ex- 
cept when F, is large, in which circum- 
stance the penalty will be small since E’s 
profits are low. 


III. Concluding Remarks 


This paper has pointed out the very com- 
plex effects of a patent system. Giving a 
monopoly right can often prove socially very 
worthwhile both in terms of encouraging 
firms to design innovative products and in 
terms of providing society with the right 
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number of products. However, there remain 
the very important questions of deciding 
. how extensive the right should be, and de- 
signing the system by which those rights are 
exercised. The positive desire for variety in 
some industries, coupled with the influence 
of the patent on the extent of variety that 
results, seems a noteworthy addition to the 
range of relevant considerations in a patent 
system. So also does the structural point 
that a contingent fee system for patent court 
cases might well largely destroy the poten- 
tial gains from increased variety. 

This paper, like many others concerned 
with welfare aspects of patents, faces one 
major problem in drawing conclusions: it is 
hardly likely that patent law can be made 
industry specific. Yet one implication may 
still be drawn. Broadly, patents provide 
wide-band protection as a result of consid- 
erable effort in formalization, and, if neces- 
sary, defense on the patentee’s part, whereas 
copyright provides easier but more narrowly 
defined protection. In some product areas, 
(for example, computer software), there is 
debate as to which is the more appropriate 
system since.it is not clear whether one or 
the other should apply. Here, that feature 
of the model, which tentatively suggests that 
where variety is valued very highly, broad 
band protection is likely to be socially infe- 
rior, would imply that copyright is the more 
appropriate form of protection for those 
cases where consumers desire a plethora of 
specific applications for specific situations. 
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Are Treble Damages Neutral? 
Sequential Equilibrium and Private Antitrust Enforcement 


By Davip BESANKO AND DANIEL F. SPULBER* 


A sequential equilibrium model of private antitrust enforcement is presented. 
Consumers have incomplete information about cartel costs and cannot accurately 
estimate a priori the damage recovery from an antitrust action. Consumers are 
able to infer cartel costs from the equilibrium pricing strategy of firms. The 
universal divinity criterion is used to characterize the sequential equilibrium. It is . 
shown that for a sufficiently large damage multiple, antitrust enforcement effec- 
tively increases social welfare. (TEL 026, 612) 


The dominant form of antitrust enforce- 
ment in the United States is the private 
party lawsuit. In the 1980s private lawsuits 
outnumbered public lawsuits by over 10 to 1 
(Steven Salop and Lawrence White, 1988). 
The large volume of private antitrust activ- 
ity has engendered heated debates in recent 
years regarding the welfare and efficiency 
effects of antitrust law.! Richard Posner 
(1976, p. 35) observes that the number of 
private actions has “grown explosively,” and 
he expresses “concern about the overexpan- 
sion of the antitrust laws.” Some research 
has raised the possibility that private an- 
titrust enforcement is neutral, with no de- 
terrent effects on competitive behavior. This 
would suggest that the antitrust laws are 
ineffective, and the large legal costs of pri- 
vate enforcement constitute a significant 
deadweight welfare loss. While not attempt- 
ing to resolve the ongoing antitrust policy 
debate, the present paper provides a game- 
theoretic model of private antitrust that ad- 
dresses the issue of neutrality. It is shown 
that in a sequential equilibrium with asym- 


*Department of Business Economics and Public 
Policy, Indiana University; and Kellogg Graduate 
School of Management, Northwestern University. We 
would like to acknowledge the helpful comments and 
suggestions of Jeffrey Banks and Jonathan Baker. Re- 
sponsibility for any remaining errors is ours. 

‘William Breit and Kenneth Elzinga (1985, p. 405) 
observe that the antitrust literature and congressional 
debates have been “almost uniformly critical" of pri- 
vate treble-damages actions. 
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metric information, private antitrust en- 
forcément can increase social welfare if the 
probability of conviction is sufficiently large. 

It is widely recognized that the treble 
damages remedy can create “perverse in- 
centives” for private enforcers. William 
Breit and Kenneth Elzinga (1974, 1985) have 
argued that under a multiple damages rem- 
edy, buyers may have no incentive to avoid 
the damage of monopolistic overcharges by 
searching for competitive substitutes. In- 
deed, buyers may even seek to increase their 
damages by i increasing their purchases from 
a price-fixing cartel? Frank Easterbrook 
(1985, p. 451) notes that to the extent that 
buyers “have perfect information and en- 
forcement is costless, they view the future 
recovery as a cents-off coupon attached to 
each purchase." Recognizing these incen- 
tives, Jonathan Baker (1988) and Steven 
Salant (1987) analyze models of private an- 
titrust enforcement and find that the multi- 
ple damages remedy has neutral welfare 
consequences. In their models, consumers 
increase demand in anticipation of collect- 
ing treble damages, and this offsets the 
increase in the cartel's price due to the 
expected penalty. As a result, the cartel's 
output exactly equals the unconstrained 
monopoly level. Consumers’ and producers’ 


Elzinga and Breit (1976) suggest that these incen- 
tives may have been at work in the electrical equip- 
ment conspiracy of the 1950s. 
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surpluses are unchanged because the ex- 


pected damage award is a pure transfer.’ 

The neutrality of various legal institutions 
has often been. stressed in the law and eco- 
nomics literature. The classic analysis of 
Ronald Coase (1960) argued that the out- 
come of private bargaining over an external- 
ity was unaffected by whether property rights 
were assigned to the polluter or to the vic- 
tim.^ As Coase stressed, the neutrality of 
legal. institutions depends upon the absence 
of transaction costs, including availability of 
full information. Private antitrust enforce- 
ment differs from enforcement of criminal 
law (see David Besanko and Daniel Spul- 
ber, 1989b) in that the two parties are in a 
contractual relationship. Private antitrust 
enforcement is similar in this sense to ac- 
tions under contract or tort law that involve 
a buyer and a seller. Transaction costs can 
affect contractual terms between cartel 
members and their customers and thus de- 
termine the effectiveness of private an- 
titrust. The purpose of.the present analysis 
is to examine the effects of imperfect infor- 
mation on the neutrality of private antitrust 
enforcement. i 

This paper presents a model of private 
antitrust enforcement in which the cartel 
has better information about its production 
costs than prospective buyers. Consumers 
thus make purchase decisions with imper- 
fect information about whether price-fixing 
has occurred and the extent of the antitrust 
violation (the markup of price over marginal 
cost). A sequential equilibrium is obtained 
in a game-theoretic model in which the car- 
tel sets a price and consumers then decide 
how much to purchase and whether to take 
legal action against the cartel. The universal 
divinity criterion due to Jeffrey Banks and 
Joel Sobel (1987) is used to restrict con- 


The “perverse incentive” also operates in Pablo 
Spiller’s (1986) dynamic model in which buyers make 
repeated purchases from a cartel over time in anticipa- 
tion of obtaining treble damages. Buyers stop making 
purchases and sue the cartel only when total damages 
reach the limits of the cartel’s liability. Limited liability 
leads to an outcome that is not welfare neutral. 

^See Spulber (1987) for a related analysis of damage 
remedies for breach of contract. 
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sumers’ “off the equilibrium path” beliefs. 
In equilibrium, there is a positive relation- 
ship between the cartel's price and con- 
sumers' expectations of the cartel’s costs. In 
contrast to the situation in which consumers 
have full information about the cartel's cost, 
a higher price will not necessarily lead con- 
sumers to believe that the antitrust violation 
is more severe and thus to expect a larger 
damage award. Asymmetric information 
about the cartel's costs thus introduces a 
friction that offsets the neutrality due to the 
“perverse incentive.” As a result, a multiple 
damages remedy can result.in welfare im- 
provements if the probability of conviction 
or damage multiple is sufficiently large. 
The paper is organized as follows. The 
model of private antitrust enforcement is 
set out in Section I. Neutrality of private 
antitrust under full information is examined 
in Section II. The sequential equilibrium 
under asymmetric information is presented 
in Section III and further characterized in 
Section IV. Conclusions are in Section V. 


I. A Model of Private Antitrust Enforcement 


A. group of identical firms in a given in- 
dustry must decide whether or not to form a 
cartel. If a cartel is formed, the market 
price p is chosen cooperatively. Establish- 
ing the cartel is costless but carries the risk 
that the firms may be sued by consumers for 
price-fixing and the prospect of multiple 
damages. If the firms choose not to form a 
cartel, Bertrand competition ensues, and the 
market equilibrium price equals marginal 
cost. Besanko and Spulber (19892) analyze a 
similar cartel model in a public enforcement 
setting. 

The cartel's marginal cost can take values 
in the set {6,,...,6,,}, where 6, «60,, i € 
(1,...,7m — 1). Cartel costs are not known to 
consumers. Let y; be a consumer's prior 
probability that the cartel’s cost is 6;. The 
prior probability is the: same for all con- 
sumers and is common knowledge. 

If a lawsuit results in the cartel being 
found guilty of fixing price p above marginal 
cost, the courts are assumed to determine 
perfectly the overcharge p — 0. The winning 
consumers collect damages t( p — 8)g, where 
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t>0 is a damage multiple. Let B €(0,1), 
represent the probability that the cartel will 
be found guilty by the courts. This allows 
for legal rules that require evidence of col- 
lusion in addition to the existence of 
markups above marginal cost. We assume 
that 8 is common knowledge to consumers 
and firms. If 8t z 1, expected profits from 
cartelization would always be nonpositive. 
Thus, if the conviction probability or the 
damage multiple is sufficiently large, multi- 
ple damages result in total deterrence of 
price-fixing. Accordingly, it is assumed 
throughout that 8: «1. For example, with 
treble damages the probability of conviction 
of a guilty cartel must be less than one-third. 
The issue is then to determine whether pri- 
vate antitrust enforcement has marginal de- 
terrence effects on industry behavior. 

The cartel and consumers interact over 
two dates. At date zero, the firms decide 
whether to form a cartel. If they do, the 
market price p is chosen cooperatively and 
announced to consumers. Upon observing 
this price, consumers make a utility-maxi- 
mizing purchase decision. Then, at date one, 
consumers can sue for price fixing.ó A con- 
sumer compares the expected return from 
suit with the legal costs of suit. If legal costs 
are normalized to zero, then under a multi- 
ple damages remedy, the consumer will sue 
if and only if p> 0. If the consumer knows 
8, he can adjust his purchases to reflect the 
anticipated damage award. If cartel costs 8 
are unknown, the consumer's purchase level 
and decision to sue depend on his expecta- 


‘The probability of conviction is assumed to be 
independent of the seller's markup, and thus of the 
cost uncertainty. This is more restrictive than Baker 
(1988) and Salant (1987) who allow the probability of 
conviction to depend on cartel output. The joint proba- 
bility of detection and conviction may increase with the 
markup. The present model makes detection exoge- 
nous, An endogenous conviction rate based on cartel 
behavior would be of interest for future work. 

It is assumed that consumers and the cartel have 
the same discount factor and that the discount factor is 
included in the damage multiple t. (That is, let ¢ = ôf’, 
where 6 is the discount factor and ¢’ is the nominal 
damage multiple.) 
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tions about €. The consumer's expectations 
will depend on the observed equilibrium 
cartel price. 

The cartel is assumed to face a market of 
identical consumers. Industry demand at 
date zero thus can be derived from the 
maximization problem of a representative 
consumer. The representative consumer is 
assumed to have quasi-linear preferences 


- V(g)4 x over the cartel’s product q and a 


numeraire good x. The total willingness to 
pay V(q) satisfies V(0) = 0; V'(q) > 0, V"(q) 
<0. Let P(q) 5 V(q) and O(p)= Pp). 
In the absence of antitrust enforcement, 
Q(p) would be the industry demand func- 
tion, and P(g) would be the industry in- 
verse demand. 

The setting we consider has two notewor- 
thy features. First, notice that consumers 
cannot make demand decisions or choose 
possible legal actions until the market price 
has been observed. This implies that con- 
sumers cannot make commitments to a par- 
ticular course of action. In this way the 
situation differs from principal agent-type 
models of regulation or antitrust (for exam- 
ple, Besanko and Spulber, 1989a) in which 
enforcement authorities can commit to a 
probability of suit so as to deter monopoly 
pricing. Public authorities can issue an- 
titrust guidelines, invest agency funds, or 
target markets for investigation and prose- 
cution, although public agencies can also 
encounter problems in making credible 
commitments; see Besanko and Spulber 
(1989b). Since private enforcement efforts 
are costly to implement and are of:en unob- 
servable to violators, enforcement efforts are 
not credible threats. This suggests the pre- 
sent sequential equilibrium model in which 
consumers make purchase and enforcement 
decisions after firms have made price-fixing 
decisions. 

Second, since industry costs are unknown 
to consumers, consumers may gain informa- 
tion about industry costs (and thus the ex- 
tent of possible antitrust violations) from 
the market price. In equilibrium, a con- 
sumer's demand and decision to sue are 
based upon the information conveyed by 
prices. The information content of prices 
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plays an important role in the cartel’s pric- 
ing strategy.’ 


H. Neutrality Under Symmetric 
Information About Industry Costs 


The neutrality of private antitrust en- 
forcement in now examined. It is then shown 
that neutrality can be eliminated by various 
institutional constraints, even under full in- 
formation. These constraints are relaxed in 
the next section to highlight the effects of 
asymmetric information. 

Baker (1988) and Salant (1987) show that 
in the full-information, setting with zero le- 
gal costs, private antitrust enforcement will 
have neutral welfare and distributional con- 
sequences. Their result can be derived as 
follows. 

The consumer’s utility maximization 
problem at date zero is 


ma (a) — pq $ Bt( p- 9) allo, 24 p)}, 


where Jø, (p) is an indicator function tak- 
ing on value one for p>@ and a value of 
zero otherwise. If the expected damage 
multiple Bt is less than one and price is at 
least as great as marginal cost, the con- 
sumer's demand q/( p, 6) is given by 


(1) q/(p,0) 7 Q((1— Bt) p + Bt0). 


For future reference, note that q'(p,0) is 
decreasing in 8. That is, a consumer will 
purchase more from a cartel with lower 
production costs because at any given mar- 
ket price the consumer expects a greater 
damage award. 

The consumer's marginal willingness to pay 
for the cartel's output under multiple dam- 
ages is 


p/(q,0) =[P(q) - 8:6] /(1— Bt). 


Baker (1988) recognizes that private information 
may be inferred from the cartel’s market actions but 
exogenously specifies buyer expectations to reflect ei- 
ther full information or a windfall damage recovery. 
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For any quantity of output below the per- 
fectly competitive level, a multiple damages 
award increases the consumer's marginal 
willingness to pay. This. follows from the 
fact that with a multiple damages remedy, 
consumer marginal willingness to pay 
p/(q,0) exceeds the marginal benefit P(q) 
from consumption. 

If the cartel forms, the profit-maximizing 
price solves? 


(2) max(p— 6)a/ (p, 6) (1- BO. 
The solution is given by 
p!(8,) - (p"(6)) - Bre)(1- Br) " 
> p"(6,) 
q!(6,) = a/(p/(6,),0) - a"(6)), 


where qg™(0,) and p^(0, denote the 
profit-maximizing monopoly quantity and 
price for 0 = 6,. The additional demand due 
to expected damages exactly compensates 
for the deterrent effects of damages. Thus, 
if a multiple damages remedy does not to- 
tally deter price-fixing (i.e., Bt « 1), the car- 
tel chooses the monopoly output and 
charges a price that is greater than the 
monopoly price. Moreover, it is straightfor- 
ward to verify that consumers’ surplus 
csf(0;) and cartel profits 7/(0;) under pri- 
vate antitrust are equal to their respective 
levels, cs(@,) and "(6,), in the absence of 
antitrust. The gain to consumers from ex- 
pected damage recovery is exactly offset by 
the loss in consumers' surplus due to the 
higher price. One can show that if the prob- 
ability of conviction 8 depends on the 
markup, as in Baker (1988) and Salant 
(1987), the neutrality result is preserved. 

A number of institutional factors can 
eliminate the neutrality of treble damages. 


*We assume that the profit function (p — 
84 p,9,(1— Bt) is well-behaved: twice continuously 
differentiable in p with a unique interior local and 
global maximum on (6,,). 
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Limited liability can cause the cartel price 
to fall below the monopoly level. Spulber 
(1989) shows that for the intermediate lia- 
bility levels, some moderation of markups 
may occur. However, relatively high liability 
levels do not affect the equilibrium and 
relatively low liability levels are simply de- 
ducted from cartel profits with no equilib- 
rium effects. In a dynamic setting, Spiller 
(1986) shows that with limited liability, con- 
sumers may postpone their suits perhaps 
indefinitely or until damages rise to equal 
the firm’s liability. Spulber (1989) also con- 
siders the case of fixed costs of suit. Fixed 
legal costs can cause the cartel to set a price 
p such that the expected damage payment 
exactly equals the expected cost of suit k, 


Bt( B —0)a( P) =k, 


where q(p) is market demand. Thus, 
markups are positive, but the consumer has 
no incentive to sue. For example, under 
treble damages, the cartels profits equal 
one-third of consumer legal costs divided by 
the likelihood of conviction; that is, m = 
k /3B. This holds even if the costs of a 
successful suit are borne by the defendant 
(by Section 4 of the Clayton Act), because 
we can let kK=(1—£)k' where k’ is the 
actual cost of suit. 

Suppose that damages are calculated on 
the basis of a reference price p in excess of 
marginal cost (i.e., (p — p)q). The cartel is 
thus allowed a maximum positive markup. 
Then cartel actions can be affected by a 
variety of institutional constraints. “Decou- 
pling" of damage payments and penalties 
for price fixing, as advocated by William 
Schwartz (1980), A. Mitchell Polinsky (1986), 

- and others, moderates cartel markups. Un- 
der decoupling, the government imposes a 
fine on price fixers that drives a wedge 
between damage multiples for the cartel 
and for consumers, (^ # t^. Similarly, if con- 
sumers and firms face legal costs that are 
contingent and proportional to damages, 
and these legal costs are not equal, treble 
damages are no longer neutral. Finally, sup- 
pose that the consumer and firm have dif- 
fering estimates of the likelihood of convic- 
tion, B^s Bf. Then, the damage remedy 
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moderates markups and is no longer neu- 
tral? In each of these cases, setting the 
reference price equal to marginal cost re- 
stores the neutrality result. 


III. Sequential Equilibrium Under 
Asymmetric Information 


Consumers who have imperfect infor- 
mation about the cartel's production costs 
cannot accurately assess the extent of the 
violation and the size of damages. In this 
section, we examine the impact of asymmet- 
ric information using a sequential equilib- 
rium model of the interaction of a represen- 
tative consumer and the cartel. The analysis 
indicates that asymmetric information intro- 
duces a friction that can eliminate the neu- 
trality of a multiple damages remedy. 

The game between the cartel and con- 
sumers is defined as follows. A cartel with 
cost 6; € (6,....,0,.) chooses a pricing strat- 
egy p=p(6;).'° The representative con- 
sumer observes the price p set by the cartel 
and makes an inference, represented by a 
posterior probability y,(p), that the cartel's 
costs are equal to 60, Let y(p)= 
(y,(p),..-:%n(p)) represent the posterior 
probability distribution across cartel types.'! 
The consumer makes his purchase decision 
q(p) and his decision to sue to maximize 
expected utility, given the observed price p 
and the posterior probability distribution 
y(p) on the cartel’s costs. To keep the focus 
entirely on the effects of asymmetric infor- 
mation, assume zero legal costs for both the 
cartel and the consumer. Because legal costs 
are zero, the consumer will sue as long the 
observed price p exceeds the lowest possi- 
ble cost 0, af the cartel. 

A sequential equilibrium ^ of the game 
consists of strategies and beliefs {p*(6;), 


12 


This is shown in Spulber (1989). Baker (1988) also 
examines the case of divergent estimates of the likeli- 
hood of conviction. 

“Attention is restricted to pure strategies for ease 
of presentation although it can be shown that the 
cartel will not employ randomized strategies. 

"Thus Ox y(p) xli—1,...,m and E? iy; 
(p) 71. i 

See David Kreps and Robert Wilson (1982) for a 
general definition of sequential equilibrium. 
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q*Cp), y*Cp)) such that the following condi- 
tions hold. 


(a) For each 6;€(0,,...,0,), the cartel's 
pricing strategy p*(0,) is a profit-maxi- 
mizing best response to the consumer's 
purchasing strategy. That is, p*(0;) max- 
imizes 


(p-6;)q*(p)(1— Bt) 


for every 0; € (6,,...,6, ). 

(b) For any p x 0 the consumer's purchas- 
ing strategy q*(p), given beliefs y"(p) 
and the observed market price p, maxi- 
mizes net expected benefits, 


V(q)-— pa + Bt(p —8*( »))als, (p); 


where 9* p)s Xjf.,0;y*Cp) is the con- 
sumer's posterior expectation of 
marginal cost. This condition implies 
that for p > 04, 


q*(p) 7 a'(p,0*( p)). 


(c) For prices p on the. equilibrium 
path—that is, p*~'(p)#@—the con- 
sumer's beliefs y*( p) are consistent with 
Bayes' rule and the cartel's equilibrium 

. Strategy q*(p). This implies 


Yi 


Yj 
U:p*(6) — p) 


if p*(0;) = p 
0 
if p*(6;) * p. 


A price p is said to be “off the equilib- 
rium path" if it would not be chosen by a 
os following the equilibrium strategy 

*(-). That is, a price p such that p* (p) 
= are is off the equilibrium path. The defini- 
tion of a sequential equilibrium requires 
specification of the consumer’s beliefs in 
response to off the equilibrium path prices. 
This is required to verify that the cartel will 
not have an incentive to deviate from its 


yf Cp) € 
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equilibrium strategy. In many sequential 
equilibrium models, the degrees of freedom 
afforded by off the equilibrium path beliefs 
results in multiple sequential equilibria. In 
this analysis, we restrict off the equilibrium 
path beliefs to satisfy the universal divinity 
criterion due to Banks and Sobel (1987). 
Following Banks (1988), universal divinity is 
defined as follows. 
Let p*(0) and q*(p) be equilibrium 
strategies, and let p be a price such that 
p* (p) Ø. Define the function A(6;, p) 


A the level of consumer demand that makes 


a cartel with cost 0, indifferent between 
following its equilibrium strategy p*(8;) and 
deviating from that strategy by charging 
price p; that is, A(6,, p) is given by 


(3 (»-8)h(68, p) (1- Bt) 
i (p*(8;) = 0,)a* (p*(8,))(1— Bt), 


for each 6; € (0,,...,0,,).? Universal divinity 
requires that consumer beliefs place posi- 
tive probability only on those types that are 
most likely to. deviate from the optimal 
strategy. For any p, this requires finding the 
lowest critical output level h(0,, p) across all 


' possible 6;. This is because a cartel antici- 


pating: a demand response larger than 
h(0; p) will also choose to deviate. Thus, 
given the universal divinity criterion, out- 
of-equilibrium beliefs place positive proba- 
bility on those types 6’ such tliat! 


@earg min A(6,p). 
) 


8 € (0,,...,0,, 


We develop the characterization of the 
sequential equilibrium in a sequence of 
lemmas, culminating in a characterization 


“Because the cartel's equilibrium profit must be 
positive, condition (3) defines A(6;, p) only for p> 6, 
For p x 6j, we define h(6;, p) — o. 

14Note that by defining (0, p) o» for 8,» p, we 
ensure that the consumer places zero probability weight 
on those types 0; for which 6; p. This is as it should 
be. A firm with cost type. 0; would never wish to 
deviate from its equilibrium price to a price p below its 
marginal cost, no matter what the anticipated demand 
response. i 
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of the equilibrium in Propositions 1, 2, and 
3. A discussion of the intuition underlying 
the equilibrium follows Proposition 3. 

Let 7*(6;) denote the equilibrium profits 
of a cartel with cost 0;: 


(4) «-*(8) = (o*(8)) — 6,)a*(p*(6;)) 
X(1- Bt),i € (1,...,m). 


For the pricing strategy p*(0;) to be a se- 
quential equilibrium strategy, it must be in- 
centive compatible. That is, a cartel with cost 
6, must prefer the price p*(6;) to the price 

p*(6, ) that a cartel without 6; would choose 
in equilibrium. Thus, 


(S) 7*(6,) > (o*(8,) — 6,)a*(p*(6)) 
x(1- Bt),i,j €(1,...,m),i *j. 


Incentive compatibility has the following 
important implications. 


LEMMA 1: The cartel's equilibrium pricing 
strategy is nondecreasing in 0,; that is, 


p*(01) < p*(02) x =: x p*(6,). 


Along the equilibrium path, the equilibrium 


quantity demanded is nonincreasing in 0;; that 
is, i : 


q*(p*(8,)) > 4*(0*(6,)) 
s z q*(p*(8,,)). 


Finally, the cartel’s equilibrium profit is de- 
creasing in 0j; that is, 
T*(0,) » 7*(0,) > ++: » v*(0,). 
The proofs of this and all subsequent 
lemmas and propositions appear in the Ap- 
pendix. 
The next step in the analysis is to use the 


universal divinity criterion to pin down equi- 
librium posterior beliefs. 


LEMMA 2: Let (p*(-),9* C), y* C] be a se- 
quential equilibrium satisfying universal divin- 
ity. Suppose that the equilibrium is fully sepa- 
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rating; that is, p*(@,)-< p*(@,)< = < 

p*(6,,). Then equilibrium posterior beliefs are 
given by? 

yt(»)-1 


ytCo) =] 


pe(0, pi 
pe (BB, 
i€(2,...,m- 1) 
Ya(P)=1 D> By 
where 


= 7*(0)(0;,, — 9) 
GETOON 


i € (1,...,m— 1}, 


Di S Pisi: 
and , 


p, c [0* (8). 0*(8,..] fori e (1... m - 1). 


This lemma says that for off-the-equi- 
librium path prices within a neighborhood 
(p; Dj) of the. equilibrium price p*(@,), a 
consumer infers that the cartel's cost is 6,, 
with probability one. For prices less than- 
(greater than) the smallest (largest) possible 
equilibrium price, the consumer infers that 
the cartel has the lowest (highest) 8. 

The possibility that the sequential equi- 
librium involves pooling is ruled out by the 
universal divinity restriction on beliefs. 


LEMMA 3: Let (p*C),q* C), y*C-)} bea se- . 
quential equilibrium satisfying universal divin- 
ity. The equilibrium must be fully separating; 
that is, 


p*(6,) « p*(62) € >+ €«p'(8,) 
q*(p*(8, 3» e*Co*(8)) 
MET m (8, )). 


For p «6, consumer beliefs are not relevant be- 


' cause consumers will not sue for price fixing. 
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The universally divine sequential equilib- 
rium is now characterized in Proposition 1. 


PROPOSITION 1: There is a unique se- 
quential equilibrium satisfying universal divin- 
ity. That equilibrium has the following fea- 
tures. 

(a) A cartel with the highest possible cost 6,, 
produces the full-information (i.e., 
monopoly) quantity, charges the full- 
information price, and earns its full- 
information (i.e., monopoly) profits, 


q*(p*(0,)) = 4/(8,) = a" (0m), 
: p*(8,) =P Om) > P (On), 
7*(0,) 7 7/(0,) =T” (0n). 


(b) The equilibrium price p*(0;) for a cartel 
with cost 0;,i &(1,...,m —1), solves the 
following constrained optimization prob- 
lem 


max(p = 6)9/(p,9)(1— Bt) 


(6) subject to: (p — 6;,,)q/( p,06)) (1— Bt) 
<17*(6;41). 


(c) Equilibrium consumer demand is given by 


Q(p) p €[0,0,] 
4'(p,0) pe(0, pil 
q4*'(p)7-14/(5,68) pPe€(B.v,Bil 
ie(2,...,m—-1) 
q(p,0,) P»P,. 


A multiple damages remedy is only neu- 
tral in general when the cartel has the high- 
est possible cost, as shown by: part (a) of 
Proposition 1. A cartel with the highest 
possible cost chooses the monopoly output 
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and earns monopoly profit. The highest- 
cost cartel's price exceeds the monopoly 
price by just the amount needed to compen- 
sate for the damage award, as in the full- 
information case. A cartel with any lower 
cost behaves as if it were solving a con-: 
strained optimization problem, given as 
equation (6) in part (b). The constraint arises 
as a result of consumer expectations. 
Clearly, a cartel would like to choose the 
profit-maximizing price. However, if the 
cartel's price is set too high, consumers will 
incorrectly infer that the cartel's costs are 
higher than they actually are. This will re- 
duce consumer demand because consumers 
would then anticipate a lower damage 
award. The reduction in demand would re- 
sult in lower cartel profits. Therefore, the 
cartel’s pricing decision is constrained. 

The effects of the constraint imposed on 
the cartel by consumer expectations are re- 
flected in part (c) of Proposition 1. It is 
shown that if the cartel’s price falls below a 
critical value, consumers’ expectations of 
cartel costs are revised downward and con- 
sumers adjust their demand accordingly.!’ 
This yields a modified market demand 
schedule, q*( p), that cuts across the family 
of full-information demand schedules, as 
shown in Figure 1. 

The unique sequential equilibrium ob- 
tained in Proposition 1 has a significant 
economic implication. Private antitrust en- 
forcement effectively alters the market equi- 
librium and improves social welfare when- 
ever consumer expectations constrain cartel 
pricing behavior. 


‘©The observation that the equilibrium outcome at 
one end of the range of agent types corresponds to the 
full-information outcome is relatively standard in sig- 
naling models. See, particularly, George Mailath (1987) 
on separating equilibria in a continuous-types signaling 
game. It is expected that our result will carry over to 
the case of a continuum of types, as in Mailath. Then 
in any separating equilibrium, all lower cost types will 
price below their full-information price, thus generat- 
ing welfare gains. 

"Tf constraint (6) in the optimization problem in 
Proposition 1 is binding for a given i, then p; = p*(6;). 
This implies that an increase in price by a cartel with 
cost 6; above its equilibrium price will induce con- 
sumers to infer that its cost is greater than 6,. 
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FIGURE 2. SEQUENTIAL EQUILIBRIUM FOR THE THREE-TvPE CASE 
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6, p' (8j) 





FIGURE 3. THE SEQUENTIAL EQUILIBRIUM IS WELFARE NEUTRAL WHEN 
CaRrTEL's Cost ts 6; 


PROPOSITION 2: Private antitrust enforce- 
ment under a multiple damages remedy lowers 
the equilibrium price, increases output, and 
raises social welfare whenever the consumer 
expectations constraint (6) is binding. That is, 
if (6) is binding for any given i, then p*(0;) < 
D/(8;) and q*(p*(8,)) > q/(6;) = a6). 


The sequential equilibrium for a three- 
type example is illustrated in Figure 2. The 
downward sloping convex curves are profit 
indifference curves for the cartel in quan- 
tity-price space. The downward sloping lines 
are the full-information demand functions, 
q/(p,60;). The bold line indicates the equi- 
librium consumer demand function, g*(p). 
The figure shows the case in which con- 
straint (6) is binding for i —1 and 2. In this 
case, the equilibrium price for a cartel with 
either of these cost levels is less than the 
full-information price. 


Proposition 2 establishes a condition for 
the nonneutrality of private antitrust en- 
forcement. This is useful in establishing a 
critical value for the damage multiple that 
yields effective private enforcement. This is 
the subject of the next section. 


IV. Welfare-Improving Antitrust Enforcement 


The situations in which private antitrust 
enforcement. enhances social welfare can be 
illustrated with the aid of Figures 3 and 4. 
Figure 3 depicts the constrained. optimiza- 
tion problem for a cartel with costs 8,. The 
hill JK shows attainable price-profit combi- 
nations for the cartel. Suppose private en- 
forcement is welfare neutral for the next 
highest cost level 8;,,. The locus GH then 
shows the boundary of the set of points 
satisfying (6). The full-information solution 
is optimal if and only if the locus GH lies 
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FIGURE 4. THE SEQUENTIAL EQUILIBRIUM IS WELFARE ENHANCING WHEN 
CarTEL’s Cost 1s 6; 


above the profit hill at the full-information 
price p/(6;), as shown in Figure 3. Alge- 
braically this is equivalent to!? 


T"(6;) <7 (6,41) 
+ (6:41 = 9;)9™"(6;)(1— Bt) 
or 
(7) Btx4;1-(v"(6)- v"(86,,)) 
/ (8... — 6)a" (6,)) € (0,1). 


If condition (7) is not satisfied, then the 
situation is as shown in Figure 4, and it 


'8The proof that wir 0 is as follows. From the 
envelope theorem, v M(8) is decreasing and convex in 8 
with da ^(0) d8 = — q^ (0). By convexity, 4" (805 
G7" (8)— mO 0/(0,, , — 8). 


follows that zhe sequential equilibrium price 
p*(6,) is strictly less than the full-informa- 
tion price p/(0,). 
The condition in (7) enables us to prove 
the following result. 


PROPOSITION 3. 


(a) If the expected damage multiple Bt is less 
than or equal to a critical value $* 
min(o,, .. Pm i}, then for all i €{1,...,m)} 
p*(8) = p/(6), and q*(p*(8,) = q™(6,). 
That is, pe antitrust enforcement is wel- 
fare neutral for any possible realization of 
cartel cost. 
(b) If the expected damage multiple Bt ex- 
ceeds the critical value @*—that is, Bt € 
($*,1)—then there exists at least one i€ 
(L,...,m —1) such that p"(8)) « p/(0)) and 
a" p*(6, 9 » q™(0,). That is, private antitrust 
enforcement results in welfare improvements 
for some realizations of cartel cost. 
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Proposition 3 indicates that a relatively 
high damage multiple, such as treble dam- 
ages, is more likely to lead to welfare im- 
provements. Detrebling antitrust damages 
may push the expected multiple below the 
critical level, $*. 

Given a sufficiently high conviction prob- 
ability or damage multiple, private antitrust 
enforcement is unlikely to be welfare neu- 
tral under asymmetric information. More- 
over, as 6;,, approaches 6; for each i, the 
critical value $* approaches zero and pri- 
vate enforcement results in marginal deter- 
rence for nearly all values of B and t. Thus, 


unless significant differences exist between ' 


possible cartel costs, private antitrust en- 
forcement is unlikely to be neutral. 

The intuition for why asymmetric infor- 
mation can upset the neutrality results has 
to do with the relationship between the 
cartel’s price and consumers’ expectations 
of the magnitude of the antitrust violation. 
When consumers do not know the cartel’s 
cost, they do not know with certainty the 
damages they will recover if the cartel is 
convicted of a violation. However, they know 
that for any given market price, damages 
will be greater the lower is the cartel’s cost. 
In equilibrium, a cartel with a lower cost 
charges a lower market price, and con- 
sumers anticipate this. As the cartel lowers 
its market price, the consumer’s net 
price—price minus the expected damage 
award—falls.? Due to the informational 
effect, the net price falls by a greater amount 
than it would under full information.?? Thus, 


We would like to thank Jonathan Baker for sug- 
gesting this verbal exposition. 


The buyer's net price, p^*7, is given by 


pNET = p— Bt( p—6*(p)) 
7 (1- Bt)p + Bt0*(p), 
where p is the market price. Under full information, 


ApP£1 /A p — 1— Bt, while under asymmetric informa- 
tion, 


ApNET/A p = (1— Bt) + BIA8* /Ap. 
From Lemma 2, A@*/Ap, the change in consumers’ 


expectations with respect to price, is positive. Thus 
ApNE* /A p is larger under asymmetric information. 
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under asymmetric information, a given de- 
crease in market price generates more ad- 
ditional demand than it would under full 
information. In effect, asymmetric informa- 
tion makes the market demand curve more 
elastic, which leads the cartel to set a lower 
price than it would under full information. 
This effect can be seen most clearly for the 
case in which the possible realizations of 
the cartel's costs are very close together. 
In this case, the equilibrium demand curve 
approaches the smooth curve shown in Fig- 
ure5. — 

An example can be given to illustrate the 
situation where antitrust is always effective 
for low cost realizations. For the case of 
linear demand, P(g)-a-— bg, and a dis- 
crete uniform distribution of types with 6, , 
—6,—£ for iec(1,...,m—1) and Bt» 
3€(a—6,)7', there exists a cost type 6; > 6; 
such that private antitrust enforcement re- 
sults in a welfare improvement for all cost 
realizations 8; «60; and results in welfare 
neutrality for all cost realizations 0; = 6,.”" 
That is, 


p*(8) <p!) and 
q*(p*(9;)) > a" (8,) fori & (1,...,j—1), 

p*(6;) = p/(6;) and 
q*(p*(6,)) - 4" (0;) for iE (j,..., m]. 


The expected damage multiple Br has 
two effects on the market equilibrium. First, 
a higher expected damage multiple can raise 
the likelihood that antitrust policy will be 
welfare enhancing by increasing the set of 
cost types for which nonneutrality holds, as 
in the preceding example. Second, as shown 
in the next proposition, given nonneutrality, 
a higher expected damage multiple induces 
the cartel to moderate its markup. 


PROPOSITION 4: Suppose q*(0) = 


q*(p*(6;)) <q™@(6,); that is, the equilibrium 
results in a welfare improvement when the 


?!$ee Appendix for a proof. 
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FIGURE 5. SEQUENTIAL EQUILIBRIUM DEMAND CURVE AS 6; — 0;,, FOR ALL i 


cartel’s cost is 0;. An increase in the convic- 
tion probability B or the damage multiple t 
increases the equilibrium quantity q*(6,), de- 
creases the equilibrium price p*(0;), and thus 
increases social welfare. 


Proposition 4 emphasizes the marginal 
deterrent effect of antitrust under asymmet- 
ric information. This implies that policy 
makers can influence both cartel pricing 
behavior and consumer enforcement deci- 
sions by the expected damage multiple. In 
particular, policymakers can increase the 
damage multiple ¢ or raise the likelihood of 
conviction 8 through such procedural 
changes as rules of evidence or through 
public enforcement efforts. l 

The basic model can be generalized to 
incorporate various institutional features. It 
is expected that the nonneutrality of private 
antitrust enforcement would be robust to 
these changes. Legal costs would be ex- 
pected to moderate markups if they lie in a 
critical range, as in the full-information set- 


ting. A reference price for’ calculating 
damages that lies above marginal cost would 
strongly reinforce our nonneutrality result. 
If the consumer and firm differ in their 
estimates of the likelihood of. conviction, 
this would further enhance the effects of 
private enforcement. Decoupling of damage 
payments and penalties would also be ex- 
pected to enhance the effectiveness of pri- 
vate enforcement under asymmetric in- 
formation, at least in the presence of a 
reference price. An interesting extension of 


22an interesting aspect of positive legal costs for 
consumers and firms is that it may lead to a selection 
bias in terms of the cost types that are actually sued. If 
consumers have positive fixed legal costs (as opposed 
to legal costs that are proportional to the damage 
award) it will not be worthwhile for them to sue if the 
expected markup is sufficiently low. This suggests that, 
in equilibrium, there will be some cartel types that are 
not sued. Nevertheless, private antitrust enforcement 
may still be effective even though a cartel is not sued 
since the cartel may lower its markup to avoid incur- 
ring the legal costs of suit. 
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the present model would be to allow for 
pretrial settlements. This might affect the 
deterrence properties of private enforce- 
ment. Also, the presence of asymmetric in- 
formation would complicate the bargaining 
over the out-of-court settlement. 


V. Conclusion 


Private antitrust enforcement with a mul- 
tiple damages remedy can enhance social 
welfare if consumers lack complete infor- 
mation about the cartel's production costs. 
At the sequential equilibrium, the cartel 
price conveys cost information to con- 
sumers. On the basis of their updated ex- 
pectations, consumers make demand deci- 
sions and estimate potential damage awards. 
This yields a modified market demand 
schedule for the cartel. At the resulting 
market equilibrium, the cartel's markup is 
moderated, leading to increased output and 
social welfare. 

The sequential equilibrium model of pri- 
vate antitrust enforcement indicates that 
asymmetric information can increase social 
welfare. This is in contrast to the full-infor- 
mation models of Baker (1988) and Salant 
(1987) in which neutrality arises. In many 
applied game-theoretic models, informa- 
tional asymmetries create distortions in de- 
cision making that reduce social welfare. 
The gains from information asymmetry ob- 
tained here are the result of mitigation of 
monopoly price fixing. Under full informa- 
tion, consumer demand and cartel mark- 
ups perfectly adjust for antitrust damages. 
However, it is reasonable to expect that 
consumers can only imperfectly estimate 
potential damages. In fact, the presence of 
asymmetric information alleviates the “per- 
verse effect" of increased consumer pur- 
chases to alter the damage award. Collusive 
firms must then moderate price increases in 
equilibrium to avoid raising consumer esti- 
mates of costs and the resulting lowering of 
demand. . 

The analysis suggests a connection be- 
tween public and private antitrust enforce- 
ment. Private antitrust enforcement efforts 
generally occur after successful public en- 
forcement. Elzinga and Breit (1976, p. 142) 
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attribute this to the fact that "the private 
sector still does not have the ‘fishing rights’ 
available to the government to investigate 
possible antitrust violations prior to an ac- 
tual case." Suppose that the likelihood of a 
private plaintiff winning a case, B,, rises to 
B,» B, following a successful public an- 
titrust suit. Proposition 3 obtained a critical 
minimum level $* of the expected damage 
multiple for which private antitrust enforce- 
ment can enhance welfare. Suppose that 
B2t > $* » Br. Then, public enforcement 
effort, by raising the expected damage mul- 
tiple in price-fixing cases, can enhance the 
effectiveness of private enforcement. 


APPENDIX 


PROOF OF LEMMA 1: The incentive compatibil- 
ity condition (5) implies, for 6; > 6;, 


(6; -6))a* (o*(6))) (1 Bt) < «*(6,) - 7*(8)) 
x (8; —6,)a* (0*(6)) 1 - Bt). 


This implies 4*(p*(6))2 q*(p*(6)) and 7*(0;)> 
m*(0,). 

Now, suppose 0; > 6;. We have just established that 
a*(p*(6,;)) < q*(p*(0,)). A necessary condition for (5) to 
hold is p*(6,) > p*(G;). y o 


PROOF OF LEMMA 2: As a first step, we prove 
that p; €[p*(8)), p*(6;,.,)] for any i € (1,...,m — 1}. The 
incentive compatibility condition (5) implies 


(A1) 7*(6;)— 7*(0,.1) 
< (6;41- 9;)4"(p*(8;)) (1 - Br). 


Thus 


v*(60;) 
4*(p*(8)))(1- Bt) 


D;2=0;+ = p*(6;). 


A symmetric argument establishes p; > p*(6;,,). Thus, 
in general (that is, whether the equilibrium is fully 
separating or not), p; < p*(0)) < p; < p*(0;4.1) € Dj, 1. 
Now, in a fully separating equilibrium, p*(@) is strictly 
increasing in 6, so it follows immediately that p; < p;. ;. 
Moreover, because o*(0;) &[D;. ,,P;] for prices that 
are on the equilibrium path, the beliefs specified in the 
lemma are consistent with Bayes' rule. 

Now, as a next step, fix an arbitrary i € Q,..., m — 1) 
and p €(p; ,,p;]. We will now show that 





(A2) 
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To prove (A2) note, first, that for any k 


a*(8,) S 7*(0,,1) 


A3) h(0,,p)—h(0,,.,, p) 
(A3) h(6,,p)- R(0,.1, p) p-9, Pr, 


n m*(6,) —7* (81.1) 
(5p-8,)(p—9,.1) 


We are given that p x p;, so it is indeed the case that”? 
h(0,, p) <h(6;1,P). 


Now, recall p; s p; ,.,. Employing (A3) again (now with 
k — i4 1) yields 


hbi’ p) € h(9;42,P)- 


Iterating upward in this fashion for i+2, i+3,...,m 
yields the result that 


(A4) h(0,,p) <h(8;,p) for p< Dj, 
J € (i 1,..., m). 


Now, we are also given that p> p; ,. Again using 
(A3) (with k = i — 1), it follows that 


h(8;.,, p) = h(6;, p). 


Moreover, from our result above, p» p; 2 Dj. 5. 
Once again using (A3) (with k =i —2) yields 


h(0, 5. p) > h(8, 1. p). 


Iterating downward in this fashion for į —3,...,1 yields 
the result that 


(A5) h(0;, p) s h(8), p) for p > Bii 
je{i-1,...,m}. 


For p € (pj..,, Bj] , conditions (A4) and (A5) simulta- 
neously hold, so 


6,= arg min 
8 €(0,,...,0,, 


pU 


The definition of universal divinity thus requires beliefs 


Straightforward algebra establishes p,_, > 6, 
Thus, p > 6. Moreover, without loss of generality, we 
have focused on the case in which p > 6;,,. If p'« 6; ,, 
then A(8;, p) =~ for all j 2 i -- 1 and the result that we 
are trying to establish in this part of the proof, A(8;, p) 
< h(8,, p) for j x i +1, follows trivially. 


[» - 5,]. 


SEPTEMBER 1990 


given by 
y' (p) » AMforpe( pj, ji e Q,..., m1). 


For p €(6;, p,], logic that is identical to that just 
employed leads to the result that 


h(0,,p) xh(8j, p), for 
FE (2,...,m}, pe (6, p). 


Universal divinity thus requires y#(p)=1 for p < By. 
Similarly, for p > p,, 1 condition (A3) can be shown to 
imply 


h(8,,p) sh(0,p), forje(L....m—-1), 
P> Pm-1. 
Thus, universal divinity requires 7,*(p)=1 for p> 


Dm-1. a 


PROOF OF LEMMA 3: Suppose, to the contrary, 
that the equilibrium involves pooling. Let T c 
(6,,...,0,,) be a pooling set whose smallest element is 
0r. For 8€ T, 


(A6) v*'(8)- (p*(07)— 9)a*(o*(07)) 
= (p*(07) — 0)a/(p*(67),8(T)). 


where: 


(A7) &n-| Y "| / | 2 sper 
ij:8,e T) U:0€ T) 


That is, &(T) is the conditional expectation of the 
cartel’s cost, given that 0 is in the set T. Condition 
(A7) follows because equilibrium beliefs must satisfy 
Bayes rule. . 

Now consider a price p(e) = p*(87.) — where e > 0. 
Using the logic in the proof of the previous lemma, one 
can show that 


(A8) | 68rzarg min 
0 «(8 


pee 


h(8, p(e)). 
Om? 


By (A8), universal divinity requires that for p slightly 
less than p*(07), consumers place positive probability 
weight only on cost realizations of 07. or smaller. This 
implies 6*( ple)! x 67 < &(T). Thus 
(A9) lim ( p(e)— 8)4*( p(e)) 
E 
-(o*(-) - 8)e/(o*(67). im^ p(e))) 
€ 
` z(p*(87)- 0)a/(0*(07),07) 
» (o*(97) - 9)a/(p*(07),8(T)), 
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where both inequalities follow because. 9/( p,0) is de- 

creasing in 0. But (A9) implies that a cartel in the pool 

can increase profits by charging a slightly lower price, a 

contradiction of the conditions for a sequential equilib- 

rium, QO 
PROOF OF PROPOSITION 1: 


(a) Suppose, to the contrary, that p*(6,,)# p/(6,,). 
Then, 


a*(»/(8,)) = a/( p/(8,),9*( »/(8,,))) 


= q/( /(8,).9,.), 


because 0*( p/(6,.)) < Om and q/(p,6) is decreasing in 
0. Thus 


(A10). (5/(8,)—6,)a*( »/(6,.)) 
> (»/(8,) - On)! p/(8,),8,) = 1100n), 


where 7r/(6,,) denotes full-information monopoly prof- 
its. But by Lemma 3 the equilibrium is fully separating. 
Thus 6*(p*(9,,)) = Om, which implies 


T*(6,,) = (p*(6,)— 6,,)a/(p*(8,,),6,.) 


<1!(6,,): 


(A11) 


The inequality i in (A11) follows because p/(6,,) is the 
unique maximizer of ( p — 6, )g/( p,0,,). But (A10) and 
(A1D imply that the cartel can increase profits by 
charging p/(6,,), contradicting the definition of a se- 
quential equilibrium. ü 
(b) Let us refer to the constrained optimization prob- 
lem in the statement of the proposition as Problem-(b). 
Let p(0,) be a solution to Problem-(b). One can show 
that a solution to Problem-(b) has the following fea- 
tures? 


(A12) B(8;) < p/(6;) 


(A13) For p « A(6;), Zlo- 8,)a/( p,6,)] > 0. 


(A14) For p< (6), (p — 9:41)4/(P,6;)(1 - Bt) 
& *(6;,,). 

(A15) B(6;) is unique. 

If we can show that a sequential equilibrium price 


p*(@;) solves Problem-(b), property (A15) implies that 
the sequential equilibrium must be unique. 


Proofs of these properties are available on re- 
quest. 
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Suppose, to the contrary, that the sequential equi- 
librium price p*(6;) is not the solution to Problem-(b); 
that is, 5(6;) + p*(6;). There are two possibilities: ei- 
ther p(8) < p*(8;) or p(0;)> p*(6;). 

If BG, )« p* (8), then given the structure of beliefs 
in Lemma 2, 8*( 5(8))) < 0. This implies 


q* (&(6,)) = a! ( 6(6;),6;), 
which, in turn, implies 
(8(6,) - 6)a* CPO- BF) 
= (5(6,) — 6,)a/ ( (8;),6,) (1— Bt) 
> (p*(6;) - &;)a/(p* (6.).6/)(1 — Br) 
= (0*(8)) - 6;)a*(o*(6,)) C1 - BD. 


The second inequality follows because the equilibrium 
price p*(8;) must satisfy (6) (the equilibrium must be 
incentive compatible) and is thus a feasible, though not 
optimal (given our contrapositive assumption), solution 
to Problem-(b). But the condition (A16) contradicts the 
definition of a sequential equilibrium, so we cannot 
have p(0,) < p*(6,). If p(8,)> p*(6,), then (A13) and 
(A14) are relevant. In particular, (A14) can be shown: 
to imply 


(A16) 


p*(8;) < Pi 


where p; is defined as in Lemma 2. For a price p 
above p*(6;) but less than pj, y*(p)=1 so q*(p)= 
q/(p,6;) in this neighborhood. But (A13) would then 
imply that : 


(p-6;)a*(p)Q— Bt) 
> (»*(8) - 8,)a* (p*(8))) (1 ~ Bt), 


for p €(p*(0)), pj), contradicting the definition of a 
sequential equilibrium. a 

(c) The specification of equilibrium consumer de- 
mand follows directly from the specification of con- 
sumer beliefs in Lemma 2. a 


PROOF OF PROPOSITION 2: See part (b) of the 
previous proof. g 


PROOF OF PROPOSITION 3: 

(a) Note that 8t < $* implies 8t € $; for all i € 
{1,... m — 1}. Straightforward induction arguments es- 
tablish that constraint (6) will not bind at the optimal 
solution to Problem-(b), for all i € (1,...,m —1}. ü 

(b) If Bt » ó*, there exists some ie... m-1) 
such that Bt > $;. Let k be the largest such i. For j > k 
we thus have welfare neutrality and 7*(8,) — e M(8,) 
for j>k, But the Problem-(b) constraint will bind 
because Bf > $, implies 


7" (0,) > m" Oka) + (Orai Ora (6,)(1— Bt). 
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PROFIT 







T (8) 


m*(8) 


m* (0,0) 


: MOa 


SEPTEMBER 1990 


Toc mOi) +O- Bati- Bt) 


shift due to increase 
in Bt 


—— 
š 
— —À 


8) QUANTITY 
q*(0i 


FIGURE Al. COMPARATIVE STATICS ON CARTEL’S CONSTRAINED 
OPTIMIZATION PROBLEM 


When constraint (6) binds, welfare neutrality no 
longer obtains and p*(8,) < p/(6,). n 


Linear Demand / Uniform Distribution Example. For lin- 
ear demand and a uniform distribution of cost types 


die (a7 8), 


SO $;,,» $j Find the lowest i such that $;z i. 
Label that by the index j. For i &(j,...,m — 1) condi- 
tion (7) holds. Arguing by induction, one can prove 
that the equilibrium is welfare neutral for i € {j,..., m). 
For i=j-—1, condition (7) does not hold and the 
equilibrium will not be welfare neutral. Thus, 


7* (6.1) « 7" (6,.,). 


Now we have $,.5 € $;., « Bt. It therefore follows 
that i : 


(A17) (6-2) 
»«M(68, 4) l 
—6;.5)a"(8j.;)(1 - Bt). 


> v*(80,.:) 


+(6;-1 


+ (9;-1 P. j-2)4"(8-,)(1 — t). 


Examination of Problem-(b) reveals that when (A17) 
holds, the equilibrium must result in a welfare im- 
provement; that is, p*(6;..,) < p/(6;...). Repeating this 
logic for j —3,...,1 implies that the equilibrium results 
in a welfare improvement forallie(j—1,....1). D 


PROOF OF PROPOSITION 4: Using the expressions 


for q/(p,6;) and p/(q,6;) from Section Il, we can 
restate Problem-(b) as i 


7*(8;) 2 max(P(a) —6)a 
q 
subject to: (P(q)—6,)q x 7"(8,,,) 
- *(06,,1—8)(1— Bt)a. 
As a first step, we show that 


dz" (8) 


(A18) dO 





<0 foralli. 


The proof is by induction. Condition (A18) holds for 
i= m (since 7*(6,,)= «(8,), so dr*(8,,)/ a(B1) = 0). 
Suppose (A18) holds for i= j+1. We will now prove 
that it also holds ior i= j, which establishes the result. 
By the envelope theorem 


da*(0;) _ (oe 1) 
j 


d(Bt) s d(Bt) (rer (0). 
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where Aj2=0 is the Lagrange multiplier for the con- 
straint in Problem-(b) and q*(@;) is the solution. 
Clearly, d *(6,)/d(Bt) <0, so, by induction, the re- 
sult holds for all i. 

Next, suppose that the equilibrium is not welfare 
neutral at 0j. Then the solution to Problem-(b) is as 
shown in Figure Al. Given (A18), an increase in Bt 
shifts the line segment ABC downward and flattens it 
out. It follows that q*(0;) increases and p*(0;) de- 
creases. a 
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Ex Post Liability for Harm vs. Ex Ante Safety Regulation: 
Substitutes or Complements? 


By Cuarzes D. KorsrAp, THomas S. ULEN, AND Gary V. JoHNSON* 


This paper concerns the regulation of hazardous economic activities. Economists 
have generally viewed ex ante regulations (safety standards, Pigouvian fees) that 
regulate an activity before an accident occurs as substitutes for ex post policies 
(exposure to tort liability) for correcting externalities. This paper shows that where 
there is uncertainty, there are inefficiencies associated with the exclusive use of 
negligence liability and that ex ante regulation can correct the. inefficiencies. In 
such a case it is efficient to set the safety standard below the level of precaution 
that would be called for if the standard were used alone. (JEL 619) 


One of the main issues that dominates 
the economic literature on optimal regula- 
tion is the choice of the most efficient policy 


for correcting an externality. From its be- . 


ginnings the literature has focused on alter- 
native forms of what may be called ex ante 
policies (for example, safety standards, 
Pigouvian taxes, and transferable discharge 
permits) that affect an activity before the 
externality is generated. But in the past 
decade researchers have analyzed the abil- 
ity of what may be called ex post policies 
(for example, exposure to tort liability) to 
control externalities.! These latter policies 
regulate the externality only after it has 
been generated and harm has occurred. The 
threat of suit causes the potential injurer to 
internalize the expected social damages and 
thus to take optimal precaution. 


*Department of Economics and Institute for Envi- 
ronmental Studies, University of Illinois, Urbana, IL 
61801; Department of Economics and College of Law, 
University of Illinois, Champaign, IL 61820; Food Mar- 
keting Policy Center, Department of Agricultural Eco- 
nomics, University of Connecticut, Storrs, CT 06268. 
We have benefited from comments by two anonymous 
referees, Robert Cooter, Richard Craswell, Robert 
Gritz, Judy Lachman, Steven Shavell, V. Kerry Smith, 
and participants in workshops at MIT, Colby, and the 
Universities of British Columbia, Chicago, Illinois, and 
Washington. 

‘John Brown (1973) and Peter Diamond (1974) were 
among the first to mathematically articulate Guido 
Calabresi's (1970) theories of liability as a means of 
controlling externalities, 
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Economists have generally viewed ex ante 
and ex post policies as substitutes for cor- 
recting externalities. The usual policy rec- 
ommendation has been to choose the less 
costly regulatory policy to administer. For 
instance, in the case of chopping down a 
tree in one's yard, it is less costly to use 
threat of suit to force appropriate caution 
than to construct a myriad of permits and 
regulations covering tree-falling. An' exam- 
ple at the oiher extreme is air pollution, 
where it is less costly to promulgate well- 
thought-out regulations than to let each in- 
jured party take injurers to court. Rarely is 
the joint use of ex ante and ex post policies 
recommended for a given externality.’ 


This theoretical conclusion, however, 
stands in stark contrast to actual policy. 
One of the most noticeable features of cur- 
rent policy dealing with externality-gener- 
ating activities in a wide number of areas is 
that ex ante and ex post policies are very 
frequently used jointly. Consider the follow- 
ing examples. The potential inefficiencies of 
incompatible neighboring property uses— 
for example, a hospital located next to a 
noisy, dusty cement-manufacturing plant— 
are minimized by zoning ordinances (a form 
of ex ante regulation) and by simultaneously 
exposing the externality generator to nui- 


?An exception is the recent work of Steven Shavell 
(1984a,b), which is discussed in more detail later in this 


paper. 
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sance liability (a form of ex post regulation)? 
Similarly, society attempts to minimize the 
harms that new pharmaceuticals may inflict 
on users by requiring the manufacturers of 
drugs to engage in specific tests before the 
drugs are licensed by the federal Food and 
Drug Administration for prescription and 
sale (a form of ex ante regulation) and also 
by thereafter exposing the drug manufactur- 
ers to strict products liability (ex post regu- 
lation). In the field of environmental exter- 
nalities, the potential harms of toxic wastes 
are regulated at the federal level by the 
Resource Conservation and Recovery Act 
(1982), which imposes ex ante siting and 
technological regulations on the generation 
and disposal of hazardous wastes, and the 
Comprehensive Environmental Resource, 
Compensation, and Liability Act of 1979, 
which establishes ex post liability rules for 
the recovery of compensatory and punitive 
damages for harms imposed by hazardous 
wastes. 

This phenomenon of complementary use 
of ex ante and ex post regulatory policies is 
so widespread that the dearth of persuasive 
theoretical arguments for this joint use is 
glaring. Various authors have identified in- 
efficiencies associated with one or the other 
regulatory policy. In the case of ex ante 
regulation, the typical criticism is that the 
central regulator has imperfect information 
on accident costs and damages (Martin 
Weitzman, 1974; William Baumol and Wal- 
lace Oates, 1971; Susan Rose-Ackerman, 
1973; Shavell, 1984b), which may lead to 
inefficient undercontrol of some wrongdoers 
and overcontrol of others. The typical criti- 
cisms of tort liability have been that suit 
may not always be brought against injurers, 
that bankruptcy provides an incentive for 
underprotection, and that uncertainty re- 
garding the legal standard leads to over- or 
underprotection, depending on the circum- 
stances.* Shavell (1984b, 1987) appears to 


>The classic comparison of the efficiency aspects of 
these alternate methods of minimizing this type of 
externality is given by Robert Ellikson (1973). 

See Brown, 1973; Robert Cooter et al, 1979; 
Richard Craswell and John Calfee, 1986; Shavell, 
1984b; 1987; Donald Wittman, 1977. 
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be alone in suggesting that ex ante and 
ex post regulation can complement each 
other in that their joint use can correct the 
inefficiencies of using either alone to cor- 
rect an externality. 

This paper first identifies a set of ineffi- 
ciencies associated with ex post liability. 
These inefficiencies are due to a potential 
injurer's being uncertain about whether a 
court will hold him liable in the event of an 
accident and suit. Our discussion formalizes 
and extends the results and conjectures of 
Craswell and Calfee (1985) and Calfee and 
Craswell (1984). In contrast to Shavell 
(1984b, 1987), we do not base our analysis 
upon the inefficiencies due to the potential 
bankruptcy of injurers and the uncertainty 
of suit by victims. Nor do we assume risk 
aversion. Having identified inefficiencies as- 
sociated with tort liability, we then demon- 
strate how ex ante regulation, if used jointly 
with tort liability, can correct some of those 
inefficiencies. 

One of our strongest conclusions, and a 
startling one, is that when ex ante and 
ex post policies should be used jointly, effi- 
ciency generally requires that the ex ante 
regulatory standard be set on a level that, if 
regulation were used alone, would provide a 
socially suboptimal level of safety or precau- 
tion. Put somewhat differently to emphasize 
this unconventional conclusion, when tort 
liability rules are in place, it is inefficient to 
set ex ante regulatory standards at the so- 
cially optimal level (where marginal costs of 
precaution. equal the expected marginal 
benefits). The only instances when the 
ex ante regulatory standard should be set 
at the social optimum are when there is no 
ex post liability or, equivalently, when there 
is a zero probability of a judgment against a 
rational injurer under ex post liability. 


I. A Model of Negligence and Safety Regulation 


` Consider the case in which a risk-neutral 

firm (or any other economic agent) engages 
in a risky activity, from which accidents can 
occur. The firm can reduce the dangers 
associated with this activity by taking pre- 
caution. Precaution reduces expected acci- 
dent costs but is costly to the firm. 
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Let x be the level of the firm’s precaution. 


in preventing an accident or reducing its 
severity. For simplicity, we will not consider 
the decisions of the potential victim by as- 
suming she always takes the socially opti- 
mal level of precaution? The injuring firm's 
costs of taking precaution are given by the 
function C(x), which is upward sloping 
[C'(x) > 0] and convex over the relevant re- 
gion. An accident will occur with probability 
p(x,e) and will be of size (cost) D(x, «) 
where e is a random variable representing 
the view-of-the-court and is distributed with 
density function g,. Assume the expected 
value of e is zero. By this assumption we 
mean to suggest that, on average, courts 
evaluate the precautionary behavior of the 
injurer and the size of the accident (D(-)) 
accurately. Define A(x) as the expectation 
of p(x,e)D(x, &) over e. Thus, A(x) embod- 
ies both the accident size (D) and the prob- 
ability of the accident occurring (p). The 
view-of-the-court is only revealed after a 
court has heard evidence about x and the 
extent of damage after an accident has oc- 
curred. Assume A(x) is convex and down- 
ward sloping over the relevant region (A’(x) 
« 0). Assume that (C(x)+ A(x)) is strictly 
convex. l 

To avoid confusion, it is useful to preview 
the three fundamentally different levels of 
precaution we. will consider. We first define 
the socially optimal amount of precaution, 
x*, where the expected social costs of acci- 
dents are minimized. We then define the 
legal standard of care, x(&), which is the 
court's interpretation of the social optimum, 
a function of e since it is only revealed after 
an accident and litigation occurs. The third 
type of precaution is the firm's precaution 
level, £, chosen to. minimize expected pri- 
vate costs to the firm. Our goal will be to 
compare X and x*. 

The socially optimal amount of precau- 
tion for the potential injurer can be ob- 
tained by minimizing expected social costs; 


> Many authors (for example, Diamond, 1974; Brown, 
1973; Cooter et al., 1979) explicitly consider the level of 
precaution taken by the potential injured party. While 
this is realistic and leads to richer conclusions in many 
analyses, it is tangential to the purposes of this paper. 
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that is, 


(1) minE[C(x) + pr, e) Dx. &)] 
= min[C(x) * A(x)]. 


At the. unique level of x that minimizes 
equation (1), x*, the marginal cost of pre- 
caution equals the negative of the marginal 
expected cost cf the accident; that is, 


Q C-A”), 


assuming the solution of equation (1) is 
greater than zero. . 

The legal standard, as opposed to the 
social optimum, is an ex post parameter, 
revealed by the courts after an accident has 
occurred. Thus the legal standard is param- 
eterized by the view-of-the-court: ¥(e). In 
fact X(e) is defined as the solution of 


(3) min[C(x) + p(x, e) D(x, e)] 


for which the first-order condition is 


(4 CG4 SEDE ui 


assuming, an interior minimum. Equation 
(4) implicitly defines X(e). Since ¢ is a ran- 
dom variable, x(e) induces a distribution on 
x, which we term q; or more simply, q. We 
assume q is continuous. 

It is at this point that the notion of liabil- 
ity enters. Under a negligence rule, the in- 
jurer is found liable for all damages if, and 
only if, his level of precaution was less than 
the legal standard of precaution. Mathemat- 
ically, the injurer's total expected costs are 
given by 


(5)  TC(x)- E[C(x) * L(x,&) 


Xp(x,e) D(x,se)]. 


where L(x,e&), the liability rule, is defined 
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by 
(6) Negligence: L(x, «) 


» l 1 if x<ž(e) 
0 otherwise f 


Let X be the level of precaution that mini- 
mizes (5). Ideally € should equal x*, in 
which case the liability rule is ex ante effi- 
cient. 

There is some potential controversy about 
what we mean by the legal standard's being 
parameterized by the court's view. We mean 
simply that at the moment at which the 
potential injurer makes her decision about 
the appropriate level of precaution, she 
faces some unavoidable uncertainty about 
how a fact finder will subsequently evaluate 
her decisions. She does not know, for in- 
stance, whether the court will evaluate her 
actions according to an objective standard 
(for example, what precaution would a rea- 
sonable person have taken in these circum- 
stances?) or a subjective standard (for ex- 
ample, will the court attempt to place itself 
in the decision environment that the injurer 
was in when she decided how much precau- 
tion to take?) (Calabresi and Alvin 
Klevorick, 1985). In addition, even if there 
is no uncertainty about how the court will 
look at the injurer's precaution, there may 
be uncertainty about the court's ability to 
evaluate the evidence of the injurer's pre- 
caution. The injurer and the court may sam- 
ple from the same population, but their 
sample means need not be identical. 


These forms of uncertainty make our fundamental 
point that an injurer,. ex. ante, will be unsure about a 
court's interpretation or view of the evidence. Consider 
further the uncertainties that arise in an injurer's mind 
about whether she will be held to some form of the 
negligence standard (and what form?) or to a strict 
liability standard (under which the victim's precaution- 
ary behavior is not evaluated), about what jurisdiction 
and under which judge litigation may occur, and about 
whether she might be held liable for punitive damages 
or just for compensatory damages. Finally, as Judge 
Posner (1988) has noted, there is an inherent uncer- 
tainty about cases litigated to judgment because, if the 
matter were clear, the litigants would have settled out 
of court. Diamond (1974) views this uncertainty from a 
somewhat different perspective. He assumes the firm 
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A basic result of Brown (1973) is that 
when the legal standard is defined as in (4) 
and the firm knows that standard with cer- 
tainty, then the negligence rule is efficient. 
This conclusion in the case of negligence is 
qualified by Calfree and Craswell (1984). 
Their argument hinges on ex ante uncer- 
tainty on the part of the firm regarding the 
legal-standard, x(e). Unfortunately, for the 
most part Calfee and Craswell are unable to 
prove their conjectures and must rely on 
cogent argument and numerical examples.’ 

Shavell (1984b, 1987). provides the only 
thorough treatment of correcting the inef- 
ficiencies of tort liability by supplementing it 
with ex ante safety regulation. But instead 
of relying on uncertainty, Shavell argues 
that negligence is inefficient because L(x) 
<1. And this, he suggests, is due to (a) a 
positive probability that suit will never. be 
brought against an injurer, and (b) an in- 
jurer. not needing to plan for accidents 
whose cost (D) exceeds the injurer's assets. 

For symmetry, Shavell also suggests that 
ex ante regulation by itself is inefficient be- 
cause D is not known with certainty to the 
regulator (but is known by the injurer) and 
pure ex ante regulation requires one level of 
care for all injurers. This means that firms 
that cause small accidents are overregulated 
and firms that cause large accidents are 
underregulated. A mixed regulatory system 


knows the legal standard of care with certainty but the 
firm is uncertain about how its precautionary measures 
translate into safety levels and it is these safety levels 
that are measured by the court. However, the effect is 
the same: for a given level of precaution the firm is 
uncertain as to whether it is above or below the legal 
standard. Cooter and Tom Ulen (1986) examine evi- 
dentiary uncertainty, or uncertainty in exactly how a 
court will interpret evidence in deciding whether the 
firm's level of precaution was above or below the “legal 
standard." 

7Craswell and Calfee (1986) prove that for a legal 
standard symmetrically distributed about x*, small lev- 
els of uncertainty lead to oversupply of precaution, 
provided density is concentrated at x* for low levels of 
uncertainty. At the optimal level of care, the marginal 
costs of precaution just offset the marginal accident 
costs from precaution. But the injurer also sees a 
marginal savings in liability due to the unpredictability 
of the legal standard. Thus, the potential injurer may 
take precaution £ * x*. 
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results in firms that cause little damage be- 
ing regulated by the ex ante regulation and 
firms that cause great damage being regu- 
lated by the threat of liability. Given the 
inefficiencies built into ex ante safety regu- 
lation and ex post liability for harm, it is 
easy to show that a hybrid does no worse 
and frequently does better than either ap- 
proach alone.® Key to the Shavell analysis is 
that L(+) is strictly less than 1. If bankruptcy 
is not a possibility and suit is always brought, 
then L(x)=1 and there are no inefficien- 
cies associated with liability. 

We take a different approach in this pa- 
per. Similarly to Craswell and Calfee (1986), 
we suggest that it is uncertainty over the 
legal standard that leads to inefficiencies 
with negligence. In fact, in the next section 
we prove all of their conjectures as well as 
others regarding the efficiency of negligence 
liability. In the subsequent section of the 
paper, we show how ex ante regulation can 
be used to correct some of these ineffi- 
ciencies. , 


II. The Inefficiency of Negligence 


Our basic model of negligence was devel- 
oped in the previous section. The legal stan- 
dard of precaution, x(e), is defined implic- 
itly by (4). Should an accident occur, litiga- 
tion will reveal the true view-of-the-court, e, 
and its realization, x, on q. If a court finds 
that the firm's level of precaution was less 
than x, then the firm will be liable for all 


accident costs; if greater than x, no liability 


will apply. 

The firm does not know the view-of-the- 
court, e, when it chooses X. The firm must 
choose an X based on an uncertain legal 
standard, x(e). As defined above, we let 
q(x) be the injurer's subjective probability 
distribution around the legal standard, the 


"This result is strikingly similar to that of Marc 
Roberts and Michael Spence (1976). They argue, in an 
entirely different context, for a hybrid system of price 
and quantity controls to optimally control an external- 
ity. The analogy to our problem is that quantity con- 
trols are akin to ex ante regulations and price controls 
are similar to tort liability (in that liability induces the 
firm to equate marginal damage and marginal precau- 
tion costs). 
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level of precaution that the firm must pro- 
vide to avoid being held liable for accident 
costs. We assume that q(x) is a continuous 
probability density with support (—~,).? 
The probability that the injurer’s level of 
precaution x will end up being below the 
legal standard of care applied in the case of 
an accident is thus given by 


(7) R(x) = f a(x) de. 


That is, R(x) is the probability when all is 
said and done, after the court has passed 
judgment, that the injurer will pay damages 
E[ p(x, e)D(x, e)) = A(x). We have already 
assumed that C(x) and A(x) are convex. 
We now make the slightly stronger assump- 
tion that [C(x)+ A(x)R(x)] is strictly con- 
vex. 

The essence of our model is presented in 
Figure 1. The expected legal standard, which 
is defined to be the socially optimal level of 
precaution, is where the marginal precau- 
tion costs just equal the negative of the 
marginal expected accident costs, as indi- 
cated in equation (2). With uncertainty, the 
injurer does not know x precisely. The in- 
jurer’s uncertainty about the legal standard 
to which it will be held accountable is em- 
bodied in q(x). If the injurer takes € amount 
of precaution, then the probability that it 
will be held liable is the area under the 
density function from € to o, R(X), the 
cross-hatched area in the figure. This is the 
probability that should an accident occur, 
the firm will be found to be taking an inade- 
quate amount of precaution and, thus, be 
liable for the victim's harm. 

When there is this type of uncertainty, 
the injurer's objective function (in the pres- 
ence of potential liability) is defined by (5) 
except that L(x) is replaced by R(x): 


(8 TC(x) = E[C(x) 
+ R(x)p(x,e) D(x,&)] 
EGO FAQ ROX), 


“Alternatively, one could argue that the support 
should be [0,5]. ` 
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Figure 1. THE SOCIAL PROBLEM WITH 
EVIDENTIARY UNCERTAINTY FOR THE INJURER 


which the firm attempts to minimize. TC(x) 
is strictly convex, by assumption, and thus 
has a unique minimum. Let € be the level 
of precaution that minimizes (8). The 
first-order condition for the minimization is 


(9) TC'(%) =C'(¥) + A'(X) RCX) 
— A(x)a(X) =0, 


provided € is greater than zero. 

Equation (9) is basic to much of our anal- 
ysis and thus deserves some interpretation. 
The first term on the right-hand side of 
equation (9) is the marginal cost of provid- 
ing the fth unit of precaution, and we as- 
sume that this marginal cost is rising. The 
second and third terms, which together we 
might call "marginal (expected) liability 
costs," sum to the expected marginal liabil- 
ity costs of the fth unit of precaution and 
consist of two effects. The first of these 
terms [ A'(Z)R(Z)] is the marginal (reduc- 
tion in) accident cost times the probability 
of being held liable for the accident if the 
firm has taken precaution equal to x. This 
term, which might be called the "injury 
effect," is negative because A'(X) is nega- 
tive and R(X) is always positive. The injury 
effect represents a savings to the injurer 
from the application of greater precaution 
because expected accident costs are re- 
duced. But there is also a savings from 
providing slightly higher precaution in that 
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the probability of being held liable is re- 
duced. The monetary savings is the product 
of the change in the probability of liability 
and total expected accident costs. This sav- 
ings is captured in the term [— ACOq()]. 
This term, which might be called the “liabil- 
ity effect," is negative because both A(X) 
and q(X) are positive. Thus, the marginal 
(expected) liability costs can be decomposed 
into an injury effect and a liability effect, 
both of which have the same sign and both 
of which indicate that marginal (expected) 
liability costs decline in precaution. 

The question that arises is whether the 
level of precaution, X, chosen by the firm to 
minimize its expected costs is greater than, 
less than, or equal to the socially optimal 
level of precaution, x*. An evaluation of the 
relationship between X and x* can be made 
by evaluating the sign of TC'(x*) in equa- 
tion (9). Since by assumption X minimizes 
TC(x) and TC(x) is strictly convex, TC'(x) 
«0 for x « and TC(x)>0 for x» x. 
Thus, if TC'(x*) «O0, then x*« £; and if 
TC'(x*) > 0, then x* > x. Evaluating TC'(x) 
(n equation (9) at x*, using equation (2), 
gives 


(0) TC'(x*) = C'(*)[1- R(x*)] 
- A(x*)a(*). 


Since C'(x) > 0 by assumption and R(x) <1, 
then C’(x*)[1— R(x*)] in equation (10) is 
nonnegative. Also, since by definition A(x) 
and q(x) are greater than or equal to zero, 
the term — A(x*)q(x*) is . nonpositive. 
Therefore, the sign of equation (10) is inde- 
terminate and the relationship between x 
and x* cannot be discovered without know- 
ing the magnitude of the various terms. Any 
further evaluation of equation (10) will re- 
quire further assumptions regarding the na- 
ture of the distribution q(x) and the size of 
the marginal cost of precaution. First let us 
turn to assumptions about the nature of the 
distribution q(x). 


10The negative sign in front of the term [ A(£)q(£)] 
is due to the fact that R'(Z)=— q(<). 
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A. The Effect of the Level of Uncertainty 
About the Legal Standard 


We are concerned here with the extent of 
the uncertainty regarding x; for example, 
the size of the variance of X, which is dis- 
tributed as g(x). Let us consider two cases, 
one where there is a great deal of uncer- 
tainty with regard to the legal standard, and 
one where there is little uncertainty with 
regard to the standard. An example of the 
first case is the great uncertainty regarding 
the appropriate standard of care when deal- 
ing with a new. technology, for example, 
genetic engineering. The level of scientific 
knowledge regarding the potential for acci- 
dents and the extent of the damages may be 
low, and it may therefore be difficult to 
determine in the first instances of accidents 
what the socially optimal level of precaution 
in testing, production, warnings, and dis- 
posal, is for genetically engineered output. 
An example of the second case, where there 
is very little uncertainty about the appropri- 
ate legal standard, might be the case for a 
well-recognized harm where the costs and 
benefits of accident precaution are well 
known and legal precedent is well estab- 
lished, for example, an automobile accident. 

We are concerned with the effect of the 
extent of uncertainty in X, embodied in 
q(x), on the sign of TC'(x*) in (10). Before 
addressing this concern, we must be some- 
what more precise about what we mean by 
more or less uncertainty. The conventional 
notion is that of second-order stochastic 
dominance (Steven Lippman and John Mc- 
Call, 1981). But just because one distribu- 
tion dominates another in this sense does 
not assure us that the density function will 
be any different at x*, which is important in 
the present analysis. To facilitate our com- 
parative statics analysis, we will introduce a 
class of distributions, members of which 
differ in terms of location and scale parame- 
ters (for example, Jack Meyer, 1986); that 
is, two distributions, with continuous density 
functions f and g, belong to the same class 
if there exists an a > 0 and f such that 


(11) f(x) = ag(ax * B). 
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Consider the class of distributions differing 
only in location and scale (a) that contains 
q(x), and (b) whose members have identical 
means, x*. This defines a family of mean- 
preserving spreads and implies that B= 
(1— a)x*. In particular, parameterize the 
set of distributions by 


(12) qalx) - ag[ax € (17 a)x*], 


where (by assumption) the legal standard is 
distributed as g(x), defined over the non- 
negative reals, with expected value x*. It 
can readily be seen that g,(x) & q(x). Fur- 
thermore, q, is a well-behaved density func- 
tion for all values of a>0, and random 
variables distributed according to q and q, 
have the same mean. As «a decreases, the 
spread of q, increases; as o increases, the 
probability mass becomes concentrated at 
the mean. Thus, in this instance uncertainty 
in the legal standard is inversely related to 
the size of a. 

As uncertainty becomes larger, that. is, as 
‘a becomes smaller, q(x*) becomes smaller, 
and TC'(x*) in equation (10) eventually be- 
comes positive. This implies that i is less 
than x*. That is, the greater the uncertainty 
in the legal standard, the more likely it is 
that a potential injurer will take /ess than 
the socially optimal amount of. precaution. 
As uncertainty becomes less, that is, as « 
becomes larger, the probability mass be- 
comes concentrated at x* and TC'(x*) be- 
comes negative. This implies that i is 
greater than x*. That is, the less the uncer- 
tainty in the legal standard, the more likely 
that the firm will take more precaution than 
the social optimum. 

. We can now state our first result regard- 
ing the effect of uncertainty on the use of 
negligence as a liability rule." 


PROPOSITION 1: Assume that to an in- 
jurer, the legat standard is uncertain (distrib- 


"This result for overprotection was proved, for the 
special case of symmetric distributions, by Craswell and 
Calfee (1986). Shavell (1987) provides an elegant proof 
tha: sufficiently small uncertainty leads to overprotec- 
tion. . 
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uted as q(x)) with an expected value of x*; 
assume q(x*) > 0. If uncertainty regarding the 
legal standard (in the sense of equation (11)) 
is sufficiently large (small), then the injurer 
subject to a negligence rule will underprotect 
(overprotect). 


PROOF: 

To prove that with sufficiently large un- 
certainty a firm will underprotect, we need 
to show that there exists a more spread-out 
version of q(x) such that TC'(x*) in equa- 
tion (10) becomes positive. Equation (10) 
can be rewritten using q, from equatio 
(12) as - 


(13) TC'(x*) =C'(x*)[1- R,(2*)] 
- A(x*)q,(x*) 
- C'(x*)[1- R,(2*)] 
- A(x*)aq(x*). 


Obviously, there exists an & >0 such that 
equation (13) is positive (since C'(x*) » 0 
and regardless of œ, R,(x*)«1) Con- 
versely, since A(x*) > 0 (because A'(x*) « 0 
and A(x*)- 0) and q(x*)» 0, there exists 
an «7 0 such that equation 13 is negative. 

o 


Thus, even though the injurer’s expected 
value of the legal standard is equal to the 
social optimum, uncertainty is sufficient to 
result in over- or underprotection.'? This re- 


"an alternative explanation of this result can be 
made with reference to equation (9), The last two 
terms of equation (9), the marginal liability costs, are 
the savings from increased precaution. Equation (9) 
can be rewritten as 


C'(X) = — A(X) R(X) + A(X)a(X). 


An increase (decrease) in the injurer's uncertainty at a 
point X decreases (increases) the right-hand side of 
this equation. This comes about because both R(x) 
and q(x) become smaller (larger) as uncertainty in- 
creases (decreases) At the equilibrium this implies 
that a lower (higher) level of precaution is taken by the 
injurer. ] 
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sult flows from the definition of negligence. 
Under that liability standard, if one slightly 
undercomplies with the legal standard, one 
is just as liable as if one greatly undercom- 
piled.” In deciding whether to increase or 
decrease precaution from x*, the injurer 
must trade off the marginal cost of precau- 
tion against the expected marginal benefits 
(in the form of the sum of expected marginal 
accident costs and the change in the likeli- 
hood of being found liable). When there is 
very little uncertainty in the determination 
of the legal standard, overcompliance re- 
sults in only slightly higher expenditures for 
precaution but in a greatly reduced likeli- 
hood of being held liable and, therefore, 
paying for the victim's. losses. Similarly, 
where there is a great deal of uncertainty 
surrounding the legal standard of care, un- 
dercompliance greatly reduces precaution- 
ary costs while only slightly increasing ex- 
pected liability costs. 


B. The Effect of the Marginal Cost of 
Precaution at the Social Optimum 


We now consider the effect of C'(x) on 
over- or underprotection, holding g(x) con- 
stant. Of course, changes in C'(x) do change 
x*. Nevertheless, in equation (10), if mar- 
ginal costs. of protection are sufficiently 
large, then TC'(x*) can be driven positive. 
In that case there can be underprotection, 
that is, € < x*. By a similar argument it is 
clear that as C'(x*) goes to zero, then 
TC'(x*) becomes negative, implying. over- 
protection, that is, € > x*. 


PROPOSITION 2: Under the: assumptions 
of Proposition 1, if q(x*)>0, then for a 
sufficiently small (large) marginal cost of pre- 
caution at the social optimum, x*, the injurer 
will employ too much (little) precaution to 
prevent an accident when faced with a negli- 
gence rule. 


P Save, of course, for the possibility that undercom- 
pliance that is "gross, malicious, willful and wanton, 
reckless, or fraudulent" may trigger punitive damages. 


896 THE AMERICAN ECONOMIC REVIEW 


The intuition of this proposition is 
straightforward. If the marginal cost of pre- 
caution around the legal standard: is very 
low, then by overcomplying the potential 
injurer raises his precautionary costs only 
slightly but greatly reduces his expected lia- 
bility costs. On the other hand, if the 
marginal cost of precaution at the legal 
standard is very large, then undercompli- 
ance greatly reduces precautionary costs 
while increasing expected liability costs rela- 
tively slightly. 


C. The Effect of Biased Perceptions of 
the Legal Standard 


In the previous section we focused on the 
effect of uncertainty with respect to the 
legal standard on over-or underprotection. 
By assumption, the mean of the distribution 
was the social optimum; what we examined 
was the effect of changing the variance or 
spread of the distribution. We now intro- 
duce a bias in the firm’s perception of the 
legal standard, x(e). We are now concerned 
less with the spread of the distribution than 
with the extent to which the firm views the 
distribution of x as biased to one side or 
the other of the social optimum. In particu- 
lar, we consider the case where the bulk of 
the probability mass is either to the left or 
to the right of x* (E[x(e)] $ x*).^ Note 
that we are considering the effect of bias in 
the perception of x* only on the part of 
potential injurers. It is difficult to imagine 
why this should occur. But imagine that 
firms believe, whether rightly or wrongly, 
that juries consistently make biased assess- 
ments of firms’ precautionary efforts and of 
the size of victims' losses (Cooter and Ulen, 
1986). 

Consider the family of distributions, con- 
taining q(x), that differ only in location and 
scale, as in equation (11), and that share the 
same variance. This implies that the dis- 
tributions are shifted with respect to one 


“This is a generalization and extension of the case 
considered by Craswell and Calfee (1986) of the whole 
distribution shifting to the right or left, although our 
results support their conjectures. 
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another: 
(14) 


Obviously, go — q(x). Thus for 8 «0, the 
distribution is biased to the right; and for 
B » 0, it is biased to the left. Note that if 
q(x) can be made arbitrarily close to zero 
for sufficiently small or large x, then by 
choosing B, ¢,(x*) can be made arbitrarily 
small with R, arbitrarily close to 0 or 1, 
depending on whether B is small or large. 
Clearly if q is a continuous density (as we 
have assumed), with support (— 9,95), then 
q(x) must become arbitrarily small for small 
or large x. 

The case where qg is biased to the right 
might be the case of a work-related harm in 
which it is difficult to show causality, for 
example, an increased incidence of lung dis- 
ease as a result of a firm's precautionary 
behavior two decades earlier. Conversely, 
suppose the firm significantly underesti- 
mates the expected legal standard. Then 
R(x*) «1. This might be the case in emo- 
tionally charged accidents where juries may 
have sympathy for victims. 


qg(x) 7 a(x + B). 


PROPOSITION 3: If the distribution q(x) is 
sufficiently biased to the left (right) of x* in 
the above sense, then the injurer will underpro- 
tect (overprotect) against an accident when 
faced with a negligence rule. : 


PROOF: 

To prove that if the distribution is suffi- 
ciently biased one gets over-or under pro- 
tection, we parameterize q(x) as in equa- 
tion (14). Assume x distributed as q(x) has 
mean x*. Equation (10) then becomes 


(15) TC'(x*) 2 C'(x*)[1- R&(x*)] 
— A(x*)ag( x*) 
= C'(x*)[1— R&G?*)] 
— A(*)a(x* + B). 
Since R(x) 0 as x —, clearly R,(x*)= 


R(x*-- B)-0 ad Bo. By continuity of 
q(x), this implies that g(x*-- 8) 50 as B 
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 «, Thus there exists a sufficiently large B 
for which TC'(x*) in equation (15) becomes 
positive, implying that £< x*. Therefore, 
for uncertainty sufficiently biased to the left, 
injurers will underprotect under a negli- 
gence rule. Trivially, as 8  —o, then 
TC'(x*) becomes negative: if uncertainty is 
sufficiently biased to the right, injurers will 
overprotect. o 


This proposition has a straightforward in- 
terpretation: if the firm perceives the ex- 
pected legal standard of precaution to be 
sufficiently less than the social optimum, 
then the injurer will underprotect; con- 
versely, if the potential injurer believes the 
legal standard to be significantly to the right 
of the optimum, it will overprotect. This 
interpretation highlights the importance, 
noted elsewhere, of setting the legal stan- 
dard equal to the social optimum in order 
for negligence to induce efficient precaution 
(Cooter and Ulen, 1988). 


III. Negligence and Ex Ante Regulation 


We come now to the important public 
policy issue of whether efficiency is better 
served by joint use of a negligence rule and 
ex ante regulation (rather than negligence 
alone). We proceed by introducing an 
ex ante safety regulation into the. model just 
developed. ; 

Safety regulation typically specifies a min- 
imally acceptable level of precaution. There 
is no uncertainty with regard to the regula- 
tory constraint; that is, the firm and the 
regulatory agency know the level of the con- 
straint? and that it is enforced with cer- 
tainty. Let the safety regulation specify 
that precaution must be at least s. How 


No uncertainty with respect to the regulation may 
seem inconsistent with the assumption of uncertainty 
for liability. However, the asymmetry is real. Regula- 
tions are promulgated ex ante. The legal standard of 
care on the other hand is rarely known with the same 
degree of precision. 

‘6Certainty of enforcement implies that the sanction 
for noncompliance is sufficiently high that the firm 
never adopts a level of precaution below the minimally 
acceptable level that is the safety regulation. 
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does information about the safety regula- 
tion influence the firm's. perception about 
the (uncertain) legal standard of care? The 
firm knows that the legal standard of pre- 
caution cannot be less than s. But the firm 
may also perceive the legal standard to be 
significantly greater than s. This seems most 
clearly to approximate the prevailing rela- 
tionship between ex ante and ex post regula- 
tion where they are jointly used. For exam- 
ple, no court today accepts compliance with 
a regulatory agency standard as a complete 
defense against a complaint of negligence. 

We first examine the impact from intro- 
ducing the safety regulation on the injurer's 
level of precaution in our previous model. 
We represent the introduction of ex ante 
regulation by changing the injurer's distri- 
bution around the legal standard of precau- 
tion. With a safety regulation, s, the injurer 
will not consider precaution below s. In 
effect, the firm's probability distribution on 
the legal standard is truncated at s. (By 
assumption, there is zero probability that 
the legal standard will be below s.) There 
are a number of assumptions that could be 
made about the firm's new truncated sub- 
jective distribution on the legal standard, 
q(x). And the reader should note that our 
remaining results hinge on the relationship 
between g(x) and q(x)" We take a 
Bayesian approach, making a simple and 
direct assumption that q(x) has a condi- 
tional distribution 2 


(16) q(x) 7 a(xlx 2 s). 


In other words, the probability mass that 
did lie below s is now distributed above s. 
Thus, we can write the conditional probabil- 
ity RGx) that the injurer will pay damages if 


'7An alternative assumption would be that all of the 
probability mass, 1— R(s), is deposited at s with q(x) 
7 q(x) for x » s. In this case Propositions 4 and 5 do 
not hold. Alternatively, as a referee has suggested, a 
regulation may decrease the probability that a court 
will find a higher legal standard. An example of this is 
the tobacco companies' successfully arguing that their 
compliance with rules on warning labels insulates them 
from tort liability. 
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its level of precaution is x as 


` R(x); 


R(x) = ROT 


(17) 


The objective function of the injurer who is 
subject to both a safety regulation and neg- 
ligence liability becomes 


(18) minTC(x)=C(x)+ A(x) R(x). 


Let £ be the level of precaution that satis- 
fies this minimization problem. Then can 
be viewed as a function of s, (s). For a 
given s, the first-order condition for £ can 
be written as 


(19) -TC(4) = RG)C(£)  A($) R() 
| -A(q(f)-0  — 


where £ is understood to mean £(s). 

The question that needs to be addressed 
is how the injurer's choice of ê changes 
with a change of the ex ante safety regula- 
tion s; that is, what is the sign of d£ / ds? 
The answer to this question requires the 
total differentiation of the first-order condi- 
tion given in (19). The result of this total 
differentiation, upon rearrangement of 
terms, is 


LN a(s)C (4) 
ds  R(s)C"2) + A'($) R(4) 
-2A4'(£)a(4) - A(£)a'(4) 


By assumption, C(x)-- A(x)R(x) is convex. 
Thus, the denominator of (20) is positive. 
The numerator is also positive, which im- 
plies that d? / ds is greater than zero. Thus; 
increasing the minimally acceptable safety 
regulation has the effect of increasing the 
precaution taken. 

The above result would imply that if X < 
x* (that is, £(0) « x*) prior to the imposi- 
tion of the ex ante regulation, then the 
introduction of the regulation will promote 
efficiency. If, on the contrary, €(0)» x*, 
then the ex ante regulation will exacerbate 
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the inefficiency that exists with the negli- 
gence rule. 


PROPOSITION 4: Imposition of an ex ante 
regulation, given the existence of a negligence 
rule, will promote efficiency if the injarer were 
to be underprotective regarding an accident 
without ex ante regulation. and will exacerbate 
inefficiency if the injurer were to employ too 
high a level cf precaution without ex ante 
regulation. 


The gist of our result is that the imposi- 
tion of an ex ante regulatory standard, on 
top of.a negligence standard, induces poten- 
tial injurers to revise their perceptions of 
the legal standard of care. Specifically, the 
higher the level of the ex ante regulatory 
standard, the higher the legal standard is 
likely to be, at least in the eyes of the 
injurer. Thus, the ex ante regulation can. 
correct cases of underprecaution resulting 
from exposure to liability alone, but it can 
also exacerbate overprecaution. 

This proposition may now be related to 
the conclusions of the previous section re- 
garding the injurer's likely response to a 
negligence rule with uncertain enforcement 
of the legal standard. Recall that Proposi- 
tions 1 through 3 established that injurers, 
when faced with only a negligence rule, may 
choose suboptimal precaution when (1) un- 
certainty about the legal standard is suffi- 
ciently large; (2) the marginal cost of pre- 
caution at x* is large; or (3) the distribution 
about the legal standard is sufficiently bi- 
ased to the left of x* 

It follows that when any of these condi- 
tions holds, injurers can be induced to 
increase their level of precaution by estab- 
lishing a miniraum safety regulation, s. Ad- 
ditionally, it follows from our discussion that 
because d$ / ds is always positive, the impo- 
sition of an ex ante minimum level of pre- 
caution in circumstances other than those 
noted above "will cause injurers to take too 
much precaution. 

Given that the introduction of an ex ante 
safety regulation can reduce inefficiencies 
associated with the use of liability alone, the 
obvious next question is what level of the 
ex ante regulation, s*, will induce firms to 
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choose £(s)- x*? From Proposition 4, we 
know that s*=0 if and only if 2(0)> x*. 
Furthermore, if £(0)« x*, then s*>0 will 
promote efficiency. The question now is, 
what level of s will make <= x*? The an- 
swer can be found by substituting x* for £ 
in equation (19) and solving for s*. Stopping 
short of actually solving equation (19) for 
s*, we can rewrite it, using equation (2), as 


(21) C'(x*)[R(s*)— R(x*)] 
— A(x*)q(x*) =0. 


For this to hold, the bracketed term must 
be nonnegative. This implies. that the opti- 
mum level of the ex ante regulation is less 
than or equal to the optimal level of precau- 
tion, that is, s* x x*. Furthermore, gener- 
ally s* « x*. Consider the implications of 
5* — x*. In this case R(x*)= R(s*), which 
implies, using equation (21), that the proba- 
bility density at the social optimum (g(x*)) 
must equal zero (because A(x*)> 0). In 
other words, the only way s* can equal x* is 
if there is no chance that the legal standard 
will be at x*. That is unlikely.!$ 


PROPOSITION 5: The optimum level of an 
ex ante safety regulation, s*,. given that a 
negligence rule exists, will be less than the 
socially optimal level of precaution, x*, pro- 
vided q(x*)> 0. If q(x*) — 0, then s* = x* is 
optimal. 


The implication of this result is that where 
optimal precaution calls for the joint use of 
ex ante regulation and a negligence rule, the 
optimal ex ante regulatory constraint should 
be set below the socially optimal level of 
care unless there is no uncertainty concern- 
ing the legal standard of care. . 

This result is illustrated in Figure 2 for 
two cases. Case I is the situation where 


I51n (20), the term C'(x*) is positive as is the term 
A(x*), given our earlier definitions of these functions. 
In order for equation (20) to equal zero, R(s*) (the 
probability that s* is less than the ex post standard) 
must be greater than that same probability at x*. This 
relationship between R(s*) and R(x*) can only be true 
if s* «x*, 
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Level of Precaution 


Precluded 
Region 





x* 
Ex Anfe Regulation (s) q^-259 


Ficure 2. THE EFFECT oF Ex ANTE REGULATION 
WITH Ex Post LIABILITY 


negligence on its own overprovides precau- 
tion, x*. An ex ante regulation cannot in- 
crease efficiency. In fact, for any s>0 in 
case I, the level of precaution, x,(s), in- 
creases and deviates even further from the 
social optimum. Case II involves underpro- 
vision of precaution when the firm is subject 
only to liability regulation, x^. We see in 
both cases that as the ex ante regulation is 
raised, precaution increases. In case II, the 
optimal ex ante regulation is where s* re- 
sults in precaution of x*. Also shown in the 
figure is what we might call the “conven- 
tional wisdom" along the kinked line: liabil- 
ity alone induces optimal behavior for s « 
x*; as soon as s reaches x*, then the ex ante 
regulation becomes binding and precaution 
is provided at level s. 


IV. Conclusions 


The propositions presented above have 
profound implications in a wide range of 
public policies for dealing with external costs 
with regard to the conditions where ex ante 
regulation should be used alone or both 
ex ante regulations and ex post liability rules 
should be used jointly. We introduced un- 
certainty into a defendant's assessment óf 
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the legal standard of care and deduced the 
consequences of this uncertainty on the de- 
fendant’s choice of precaution under a neg- 
ligence rule. 

Propositions 1 through 3 indicate the ef- 
fect of uncertainty about the legal standard 
of care and the injurer’s marginal cost of 
precaution on under- or over precaution. We 
next demonstrated that the introduction of 
an ex ante constraint specifying a minimally 
acceptable level of precaution (a safety reg- 
ulation) will always cause the injurer to in- 
crease precautionary levels. We concluded 
that the joint use of ex ante and ex post 
regulation will enhance efficiency under the 
following conditions: if there is great uncer- 
tainty in the determination of the legal stan- 
dard of care, or if the distribution is highly 
biased to the left of the socially optimal 
level of care, or if the injurer's marginal cost 
of precaution is large at the social optimum. 
` Otherwise, ex ante and ex post regulation 
should be used separately. 

We used our model to show the relation- 
ship between the optimal ex ante constraint 
and the socially optimal level of care. 
Proposition 5 clearly indicates that if it is 
efficient to use both policies, then the level 
of the ex ante regulation should not be set 
at the social optimum but rather at a lower 
level. That proposition might further be 
taken to indicate that ex ante regulations 
should be used alone when the probability 
of a successful suit against the injurer is 
zero. This might be the case when a great 
deal of uncertainty is associated with a harm, 
as might occur when the harm is so new 
that those it affects and the consequences of 
the harm are unclear but suspected of being 
catastrophic, or when the-level of accident 
costs borne out by the injured party is so 
small that he or she might not even recog- 
nize it, even though many individuals are 
affected. 

Further implications for the optimal mix 
of regulatory policies arise from a compari- 
son of the assumptions of Propositions 1 
through 4 with actual circumstances. This 
comparison may reveal both positive and 
normative insights. For example, it might be 
possible, using the model discussed here, to 
explain why new harms—for example, the 
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escape of toxins into the environment—are 
typically regulated through ex ante com- 
mand and control policies, while harms aris- 
ing from more familiar sources—for exam- 
ple, automobiles—are typically regulated by 
exposing the injurer to ex post liability. 
There are several refinements to the 
model that seem appropriate. First, our 
model includes only the injurer's costs of 
avoiding the harm. A complete analysis 
would include an explicit treatment of the 
victim's precautionary behavior under un- 
certainty. Second, a.comparison of the ad- 
ministrative costs of the tort liability system 
and of the ex ante system should be made. 
Third, uncertainty surrounding the legal 
standard could be further broken down into 
its different components—for example, evi- 
dentiary uncertainty, uncertainty regarding 
the technology of precaution, uncertainty 
regarding the level of accident costs, and 
uncertainty about the victim's willingness to 
bring a tort action—in order to allow the 
examination of the conditions under which 
alternative ex post liability rules or a differ- 
ent mix of ex ante and ex post regulation 
might be efficient. Fourth, the possibility of 
bankruptcy could be introduced. Finally, 
uncertainty regarding the.ex ante regulation 
could be introduced into the model. We 
have assumed that there is no error in the 
determination or enforcement of the ex ante 
standard. To :he extent that such uncer- 
tainty exists, then the case for complemen- 
tary use of ex ante and ex post regulation 
becomes more complex. While it is clear 
that such uncertainty would not affect our 
prohibition of the use of ex ante regulation 
when injurers tend to oversupply precaution 
because of uncertainty regarding the en- 
forcement of the ex post liability rule, it 
should affect the level of regulation when 
injurers undersupply precaution. It could be 
argued that if regulators have enough infor- 
mation to set a lower bound on precaution 
ex ante, then the liability system should also 
have enough information to set the same 
lower bounds. But this reform cannot be 
affected within the tort liability system as it 
presently exists. It can only be achieved by 
supplementing exposure to tort liability with 
exposure to ex ante regulation. 
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The Subsidence of Preference Reversals in Simplified 
and Marketlike Experimental Settings: A Note 


By YuN-PENG Cuu AND Rugey-Linc CHu* 


A behavior pattern called the “prefer- 
ence reversal phenomenon” has received a 
lot of attention in recent years. This phe- 
nomenon refers to the experimental finding 
that, when individuals are presented with 
two gambles—one having a high probability 
of winning a modest sum of money (the P 
bet), and the other. having a low probability 
of winning a large amount of money (the $ 
bet)— they often prefer the P bet but assign 
a larger monetary value to the $ bet. Such 
behavior is astonishing because, as Paul 
Slovic and Sarah Lichtenstein (1983) noted, 
“it violates almost all theories of prefer- 
ence, including expected utility theory.” ` 

Considerable efforts have been made in 
the literature to better understand this puz- 
zle on both the theoretical and experimen- 
tal fronts. So far theoretical efforts to ratio- 
nalize such behavior appear to be quite 
fruitful; notable examples are Graham 
Loomes and Robert Sugden (1983), Chew 
Soo Hong (1983), Charles A. Holt (1986), 
Ronald A. Heiner (1983, 1985), Vernon L. 
Smith (1982, 1985), Paul Slovic and Sarah 
Lichtenstein (1983), William M. Goldstein 
and Hillel J. Einhorn (1987), Edi Karni and 
Zvi Safra (1987), and Uzi Segal (1988). The 
results from experimental studies have, 
however, been much less encouraging, in 
the sense that the behavior pattern remains 
robust in spite of the many different kinds 
of improvements in experimental design.! It 
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See Joyce E. Berg, John W. Dickhaut, and John R. 
O’Brien (1985) and David M. Grether and Charles R. 
Plott (1979) for a good review of the experiments that 
have been performed. 
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is apparently still an open question whether 
this phenomerion will ever disappear or at 
least subside under still newer experimental 
settings. The current study is an effort to 
answer this question. It consists of two ex- 
periments. The features of these experi- 
ments and the reasons for adopting them 
are explained below. 

(i) Most of the recently performed experi- 
ments have their subjects work with several 


pairs of gambles. In the experiments of _ 


Grether and Plott and Robert J. -Reilly 
(1982), subjects deal with a total of twelve 
(six pairs of) gambles and go through three 
steps. First, they are shown three of the six 
pairs and asked to indicate a preference for 
each pair. Second, they are given all twelve 
gambles, presented in no particular ar- 
rangement, and asked to assign a monetary 
value to each. Finally, they are presented 
with three pairs of gambles, which are the 
ones left over from step one, and are re- 
quired to indicate a preference for each 
pair. In Werner W. Pommerehne, Friedrich 
Schneider, and Peter Zweifel's (1982) exper- 
iment, which has a similar format, the total 
number of gambles is eight, arranged into 
four pairs in steps one and three. In Berg 
et al.’s experiment, subjects go through two 
sections, each of which duplicates the for- 
mat of Grether and Plott (and Reilly) de- 
Scribed above, with twelve gambles and 
three steps. One question naturally arises: 
Will the reversal phenomenon be reduced 
in less complex experiments? The current 
study examines this issue by confining the 
subjects’ choice to a single pair of gambles 
in the main parts of the experiments. 

(ii) As Smith (1985) correctly points out, a 
preference reverser is vulnerable to a con 
game (money pump) in which a sequence of 
decisions will cause a loss in assets. It would 
then be interesting to know whether the 
phenomenon would persist in a marketlike 
environment where reversers would see ac- 
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tual losses in their assets as the result of 
arbitrage by the experimenter. Berg et al. 


. perform precisely such an experiment in : 


` which they find that although the value (dol- 
lar magnitude) of reversals is reduced, the 
frequencies of reversals are not. One may, 
however, question whether their experiment 
is a serious disproof of the usefulness of a 
marketlike environment on two grounds. 
One is the problem of complexity as noted 
above. The other is that their experiment 
exposes the subjects to arbitrage only once, 
and it is questionable whether one exposure 
is enough to reduce reversals. The experi- 
ments in the current study are designed to 
deal with both problems. In particular, in 
the second part of our first experiment, 
subjects are presented with only two gam- 
bles. Each subject is asked to indicate a 
preference between, and the monetary val- 
ues of, the gambles, and one arbitrage is 
undertaken if there is a reversal. The sub- 
ject is then asked to state the preference 
and the monetary values again; arbitrage is 
again undertaken if there is reversal, and 
this process is repeated several times. In 
our second experiment, to see whether the 
subjects are able to carry their experiences 
from one pair of gambles to another, we 
perform three consecutive con games, each 
of which consists of a series of arbitrages. 
These experiments represent, we think, a 
fairer test of the robustness of the reversal 
phenomenon in the face of a marketlike 
environment. . 

(iii) Since the current study was carried 
out in Taiwan, while most of the other 
experiments so far performed took place in 
Western industrialized nations, it is neces- 
sary to ensure the cross-cultural applicabil- 
ity of the reversal phenomenon. The first 
part of our first experiment does this by 
completely replicating Reilly’s process. 

To summarize, the current study contains 
two experiments, with the first one further 
divided into two parts, I and II. Part I of the 
first experiment replicates Reilly's. design, 
and Part II is a.con game consisting of a 
series of arbitrages, played for a single pair 
of gambles. The second experiment goes 
one step further by performing three con- 
secutive con games for three different pairs 
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of gambles. In what follows, Section I will 


- explain in detail how the two experiments 


were carried out; Section II will report the 
results; and, finally, Section III will offer 
some concluding remarks. 


I. Experiment Design 
A. First Experiment 


There were two groups of subjects. One 
group consisted of 39 freshmen in applied 
psychology, and the other consisted of 55 
juniors in economics, both at Fu-Jen 
Catholic University in Taiwan. They will be 
referred to as Group A and Group B, re- 
spectively. 

The first part, or Part I, of our experi- 
ment essentially copies Reilly's Group 1 in 
Stage 2. As noted briefly earlier, it involved 
three steps. First, subjects were shown three 
pairs of gambles and asked to indicate their 
preference within each pair. Second, the 
subjects were shown twelve gambles, includ- 
ing the six from the previous step, unpaired 
and randomly presented. For each gamble, 
the subjects were asked to record the lowest 
price for which they would part with the 
right to play the gamble. Finally, subjects 
were shown three more pairs of gambles, 
made of the six gambles that appeared new 
in step two, and were asked to select the 
gamble they preferred in each pair. 

Most of the details of the above steps can 
be found in Reilly, so they are not repeated 
here. We will only note those arrangements 
of ours that are not explained in detail in 
Reily or are different from his arrange- 
ments, as follows: 

() The whole group went through this 
part together in a classroom. They were not 
divided into small sections. 

(ii) At the beginning of Part I, subjects in 
Group A were given a cash payment of 
NT$100 (US$3.50 at the average exchange 
rate of NT$28.59:US$1 in 1988) each for 
participating in the experiment, and those 
in Group B were paid NT$50 (US$1.75). 
The difference between the two sums was 
due to administrative reasons. Then each of 
the subjects, in groups A and B, was staked 
NT$90 (US$3.15) in token money that could 
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be converted into cash at the end of the 
experiment. Given that Taiwan’s per capita 
income is less than one-third of that of the 
United States, these amounts are certainly 
comparable to if not more attractive than 
those in Reilly’s experiment. 

(iii) The six pairs of gambles used in 
this Part I are (33/36; 70, —80) and 
(21/36; 140, —60); (33/36; 40, —40) and 
(18/36; 100, —30); (29/36; 60, —50) and 
(14/36; 210, —70); (31/36; 80, —30) and 
(12/36; 270, —40); (32/36; 60, —40) and 
(14/36; 170, —30); and (34/36; 60, —40) 
and (18/36; 130, —20); where the fractions 
indicate the probabilities of winning, and 
the winning and losing amounts are all in 
NT dollars. They will be called P1, 
P2,..., P6, respectively. These pairs are the 
same as Reilly's except for the payoffs, which 
we think are attractive enough, as far as our 
subjects are concerned, for the reasons 
stated above. 

Gv) At the end of Part I, one of the six 
pairs of gambles was chosen by the random 
process of drawing numbered Ping-Pong 
balls from a jar. Each subject was assigned 
one of the gambles in that pair according to 
his stated preference. Then the experi- 
menter disclosed to the subjects a “transac- 
tion price" for each of the two gambles— 
that is, a price that the experimenter could 
compare with the subject's recorded. selling 
price (which could be negative) to deter- 
mine whether he would buy the gamble 
from the subject or let the subject play it.? 
The subjects had already been told at the 
beginning of the game that this transaction 
price was an amount between NT$270 and 
—NT$30 (the maximum possible amounts 
of winning and losing for all gambles) that 
had been selected by a random process prior 
to the game. Now, if the gamble assigned to 
a subject should be bought, the experi- 
menter bought it and paid the subject the 
purchasing price in token money. If the 
gamble should be played, it was played, and 
the appropriate sums were paid to or col- 
fected from the winners and losers. in the 


"This is what is usually called the Becker-DeGroot- 
Marschak procedure. See Reilly for details. ` 
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form of token money. So at the end of Part 
I, each subject had either sold or played a 
gamble. Each was paid the actual value of 
his token money in cash at the end of the 
experiment. | 

After Part I was completed, all subjects in 
the. group were asked to stay in the class- 
room and no conversation was allowed. Part 


: TI began in a separate classroom with a total 


of five experimenter’s assistants ("traders") 
sitting in five separate parts of the room. 
Each trader “handled” only one subject at a 
time. No conversation was allowed between 
subjects in this room. Also, the subjects 
were not allawed to return to the original 
classroom, which was used as a waiting 
room, after the completion of Part I. 

So Part II began as the experimenter 
randomly chose students from the waiting 
room and assigned each of them to one of 
the traders, also randomly. It took an aver- 
age of.one minute and a half for.a trader to 
handle one subject. Once the process was 
completed, the subject was asked to leave 
the room, and a new subject was summoned 
from the waiting room. After all the sub- 
jects had completed their transactions with 
the traders, Part II was completed. 

The sequence of interactions between the 
subject and the trader was arranged as fol- 
lows. 

1. The subject was. given a new type of 
token money (colored differently from the 
money used in the first part) worth NT$300 
(US$10.49) and was told, “This is the initial 
stake for this part of the game. It is not real 
money because it cannot be cashed. But 
whether you win or. lose in this part of the 
game still matters, because at the end, Y will 
give you a present, a real gamble with a 
large positive gain and a small loss, which 
you can choose to play or not to play. If you 
decide to play and win, I will pay you the 
same type of token money used in the last 
part, and it will be cashed. If you lose, I will 
take the losing amount from the token 
money you are keeping. So the gamble will 
be real. There are many gambles in my 
drawer, some of them are very attractive 
(with higher winning amounts and lower 
risks) and some are less attractive. Which of 
them I will give you as a present depends on 
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the total value of your new type of token 
money at the end of this part of the game. 
The better you do, the more valuable a 
present you will receive.? 

2. The subject was asked to study two 
gambles, a and 8, which are the pair of 
gambles that invited the highest frequency 
of (positive) reversals in Part I, for either 
Group A or Group B.^ This pair turned out 
to be P4; that is, a = (31/36; 80, —30) and 
B — (12/36; 270, — 40). 

3. The subject was asked to state his 
preference between a and f and was told 
that if later he were holding a gamble that 
was not the one he preferred, or if he had 
no preference between the two gambles, 
then the trader would have the right to 
exchange the other gamble for the one he 
was holding. 

4. The subject was asked to indicate a 
"fair" price for each a and B and was told 
that whatever the price (positive or nega- 
tive) might be, if later the trader decided to 
buy from or sell to him gamble a or gamble 
B at the indicated fair price,? he must com- 
ply. 

The subject's response to steps 3 and 4 
above must fall under one of the following 
categories: 

(i) a> (is preferred to) £ and $(a) (the 
fair price of a) > $(8), or a < B and $(a) < 
$(8), or a~ (is as preferred to as) B and 
$(a) = $(8). In these cases no transaction 
took place; the subject was told that the 
game was over, his new token money 
(NT$300) was taken back, and he was given 


Admittedly, the use of an unknown gamble as a 
reward instead of real money constitutes a deficiency in 
our experimental design, as pointed out to us by an 
anonymous referee. However, this shortcoming is 
remedied in the second experiment, which is simply a 
more complicated version of Part II of the first experi- 
ment. 

The meaning of "positive" 
plained later. 

If the price is negative (positive), the trader will 
take back from (give to) the subject some token money 
in order to obtain a game from the subject. In reality 
none of the subjects stated a negative price for any 


reversals will be ex- 


gambles in this part of the game or in the second 


experiment (to be discussed later). 
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a gamble: as a present. If the subject de- 
cided not to play this gamble, he was asked 
to leave; if he decided to play, the gamble 
was played, the winning or losing amount 
was settled, and he was asked to leave. Part 
II was completed. 

Gi) ezB and $(a)«$(B). The trader 
now sold B to the subject, exchanged a for 
B, and bought æ back. The subject found 
that he was not holding any gambles but his 
(new type of) token money had decreased 
by $(8)—$(a). Then the subject was asked 
to go back to steps 3 and 4 above. 

(iii) a < B and $(a) > $(8). Analogously, 
one arbitrage transaction took place and the 
game went back to steps 3 and 4.° 

For Group A, a maximum of two arbi- 
trage transactions were undertaken for any 
one subject, no matter how many times he 
reached situation (ii) or Gii). For Group B, 
no maximum was set and arbitrage ney 
ued unless situation (i) was reached.’ 
either case, once no more arbitrage us 
tion was to take place, the game was over 
and the subject went through the same pro- 
cedure as that in situation (i). This com- 
pleted Part II. 

On the next day, the token money held by 
the subjects was cashed, and the experiment 
was completed. 


.B. Second Experiment 


Again, there were two groups of subjects; 
they will be referred to as groups C and D. 
Group C consisted of 46 freshmen in Chi- 
nese literature at Fu-Jen Catholic Univer- 
sity, and Group D consisted of 37 freshmen 
with the same major at National Cheng-Chi 
University. 


51f in the process of arbitrage, the value of the new 
type of token money held by the subject fell below 
zero, he would be floated a loan. This situation did not, 
however, happen in reality. 

For Group A, which was given the experiment first, 
we had no idea how rapidly or slowly subjects would 
change their reversal behavior under arbitrage, so we 
set the maximum at two to control time. No maximum 
was set for Group B because we knew from our experi- 
ence with Group A that reversers changed their behav- 
ior rapidly. See the later text for details. 
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TABLE 1—FREQUENCIES OF REVERSALS, ALL Six PAIRS 


Bet Choices? 
Group A P 110. 
n=39 $ 124 
Group B P 132 
n=55 $ 195 
Groups A and B P 242 
n=94 $ 319 
Reilly’s Results? P :106 
nz45 $ 159 





SEPTEMBER 1990 
Reservation Prices (Percent)? 
Consistent Inconsistent Equal  Total* 
45.5 47.3 73 100.0 
67.7 31.5 0.8 100.0 
45.5 48.5 6.1 100.0 
70.8 23.6 5.6 100.0 
45.5 47.9 6.6 100.0 
69.6 25.6 3.8 100.0 
46.2 49.1 4.7 100.0 
67.9 264 5.7 100.0 





*The total number of choices may not be equal to the total number of possible 
choices, namely 6 X n, because some subjects are indifferent between the P and $ bet. 
Frequencies of consistent, inconsistent or equal prices divided by the total number 


of choices in the same row. 


“Actual sum may not be exactly 100.0 because of rounding. 


*This is his Group 1 in Stage 2. 


This experiment skipped Part I of our 
first experiment and went directly to Part II, 
with the following added features: 

(i) The gambles used in this experiment 
were the three pairs that invited the highest 
numbers of positive reversals in Part I of 
our first experiment from groups A and B 
combined. It is already known that P4 is 
one, and the other two pairs turned out to 
be P1 and P5. 

(ii) Each subject was first given NT$50 
(US$1.75) in cash for attendance. The ex- 
perimenters then spent about 15 minutes 
explaining the gambles and the rules of 
transactions, which were the same as those 
in Part II of the first experiment. 

(iii) Three different sequences of con 
games were designed, they were, respec- 
tively, P1-P4-P5, P4-P5-P1 and P5-P1-P4. 
Each subject was randomly assigned to one 
of the three sequences. 

(iv) Using P1-P4-P5 as an example: a sub- 
ject was first staked NT$300 (US$10.49) in 
token money. The trader then played P1 
with him, following the same steps as those 
in Part II of the first experiment, without 
any limit on the maximum number of arbi- 
trage transactions. When these steps were 
completed, the subject kept his token 
money, however much it was worth at that 
point, and was again given NT$300 
(US$10.49) in token money. Pair P4 was 
then played. This process was repeated un- 
til pair P5 was also played. The subject was 


told that the total amount of token money 
he had at the end of the three con games 
would be cashed at 2/9 of its face value. 
Thus if a subject lost no money in these con 
games, he would receive NT$200 (US$7.00) 
in the end, in addition to the NT$50 
(US$1.75) attendance reward. 

This completes the description of our sec- 
ond experiment.? 


II. Results of the Experiments 


Table 1 gives our Part I results and those 
of Reilly. It is clear that our subjects in 
Group A or Group B or both together were 
remarkably similar to Reilly's in their pref- 
erence reversal behavior. The occurrence of 
“positive” reversals (P bet is preferred but 
assigned a lower monetary value) was 47.3 
percent for Group A, 48.5 percent for Group 
B, and 47.9 percent for both groups, while it 
was 49.1 percent for Reilly's Group 1 in his 
Stage 2. As for the "negative" reversals ($ 
bet is preferred but assigned a lower value), 
the rate was 31.5 percent for our Group A, 
23.6 percent for Group B, and 26.6 for both 
groups, while the rate for Reilly's group was 


®The whole idea of this second experiment was 
brought to our attention by an anonymous referee, to 
whom we are grateful. This explains why the two 
experiments in this paper could have been combined 
but were not. j 
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TABLE 2—PREFERENCE REVERSALS UNDER ARBITRAGE: ONE PAIR OF GAMBLES 
Group A: A Maximum of Two Arbitrage Transactions (n = 39) 
Situation (i) Situation (ii) Situation (iii) 
a> B and $(a) > $(8) ar exp. 
a < B and $(a) x $(8) and and 
or o ^ B and $a) = $(B) $(a) « $(8) $(a) > $(B) Sum 
Before Arbitrage 22° 16 1 39 
(56.4)? (41.0) (2.6) (100.0) 
After 1 Arbitrage 12 5 0 17 
Transaction : (30.8) (12.8) (0.0) (43.6) 
After 2 Arbitrage 3 2 0 5 
Transactions (7.7) (5.1) (0.0) (12.8) 
Group B: No Limit on Number of Arbitrage Transactions ( = 55) 
Situation (i) Situation (ii) Situation (iii) 
a> B and $(a) > $(B) ax axB 
a < B and $(a) < $(8) and and 
or æ ~ B and $(a) = $(8) $(a) < $(8) $a) > $(8) Sum 
Before Arbitrage 43 11 1 55 
(78.2) (20.0) (1.8) (100.0) 
After 1 Arbitrage 3 9 0 12 
Transaction (5.4) (16.4) (0.0) (21.8) 
After 2 Arbitrage 8 : 1 0 9 
Transactions (14.6) (1.8) (0.0) (16.4) 
After 3 Arbitrage 1 0 0 1 
Transactions (1.8) - (0.0) (0.0) (1.8) 


*Number of people. 


Percentage of total number of people in the group. 


26.4 percent. These results indicate that our 
subjects are certainly comparable to Reilly’s 
with regard to their preference reversal be- 
havior, so our Part II results should be 
meaningful. 

Data on the subjects’ preferences be- 
tween individual pairs of gambles are not 
given here but are available upon request. 
Suffice it to say that P4 invited the highest 
frequency of positive and negative reversals 
combined, with 35 positive and 9 negative 
ones; P1 ranked second with 25 positive and 
13 negative reversals; and P5 ranked third 
with 18 positive and 17 negative reversals, 


for groups A and B combined. In addition, | 


as stated earlier, when only the frequencies 
of positive reversals are counted, these same 
pairs also invited the highest numbers of 
reversals. 


For Part II, the results of which are re- 
ported in Table 2, two of the most impor- 
tant findings are as follows: 

(i) The frequency of reversals before arbi- 
trage took place was reduced among the 
Group B subjects, but it remained exactly 
the same for Group A subjects. In Part I, 17 
out of the 39 people in Group A made 
inconsistent choices with regard to P4. In 
Table 2, the numbers in the corresponding 
columns, namely situations (ii) and (iii) be- 
fore arbitrage took place, add up to the 
same amount. For Group B, while in Part I 
the number of reversals is 27 out of 55 
people for P4, it is reduced to 12 in Table 2. 
These results indicate that in the absence of 
arbitrage, preference reversal remained an 
important phenomenon even when the ex- 
periment was so simplified as to consist of a 
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TABLE 3—PREFERENCE REVERSALS UNDER ARBITRAGE: THREE PAIRS OF GAMBLES 





Round 1 Round 2 Round 3 


~ 
a 


Group C (n = 46) 


Positive Reversal 
First Appears at 
the Beginning of 
Round 1 
Round 2 
Round 3 
Negative Reversal 
First Appears at 
the Beginning of 
Round 1 
Round 2 
Round 3 


Group D (n = 37) 


Positive Reversal 
First Appears at 
the Beginning of 
Round 1 
Round 2 
Round 3 
Negative Reversal 
First Appears at 
the Beginning of 
Round 1 
Round 2 
Round 3 


Groups C and D 
Combined (7 = 83) 


Positive Reversal 
First Appears at 
the Beginning of 
Round 1 
Round 2 
Round 3 
Negative Reversal 
First Appears at 
the Beginning of 
Round 1 
Round 2 
Round 3 


People 


18 37 


Transactions? People 





Round 1 


People "Transactions? 


Round 1 


People — Transactions? 


Transactions 


Round 2 


People X Transactions? 


Round 2 


People Transactions? 





24 45 


Urb. 
Aw 


People 


NOS 


People 


ooo 


NOg 


People 


0 
0 
4 


Transactions? 


NWO 


0 
0 
2 


Round 3 


'Transactions? 


coo 


0 

0 

3 
Round 3 


"Transactions? 


Nw oS 


0 
0 
5 


“Total number of arbitrage transactions performed to convert all of the reversers (people) at the beginning of the 
indicated round to nonreversers. \ 
These people are subsets of the 10 people who are positive reversers at the beginning of Round 1, reported in 
the same row. The numbers of people in all other rows are defined analogously. In the next row for example, the 1 
person under “Round 3” is a subset of the 7 people under "Round 2.” 
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single pair of gambles, although simplifica- 
tion was effective in reducing reversal fre- 
quency for one group of the subjects. 

(ii) The frequency of reversals decreased 
rapidly under arbitrage for both groups. 
From Table 2, for Group A, 12 out of the 
17 people who were reversers before arbi- 
trage adjusted their behavior to become 
nonreversers after one arbitrage transac- 
tion. Out of the remaining 5 people, 3 ad- 
justed their behavior toward consistency af- 
ter the second arbitrage transaction; and 
only 2 remained unconverted. Reversers in 
Group B, for which no limit was set on the 
number of arbitrage transactions, adjusted 
their behavior equally rapidly. Out of the 12 
reversers before arbitrage, 3 became nonre- 
versers after one transaction; out of the 
~ remaining 9, 8 became consistent players 
after the second transaction; and the re- 
maining 1 person was also converted follow- 
ing the third arbitrage transaction. So for 
Group B, three transactions were all that was 
needed to wipe out preference reversals com- 
pletely. 

The results from our second experiment 
are presented in Table 3. In this table the 
three con game sequences referred to ear- 
lier—namely, P1-P4-P5, P4-P5-P1 and P5- 
P1-P4—have been combined. These three 
cón games in any sequence are referred to 
as rounds 1, 2, and 3, respectively. That is, 
the first con game any subject encounters is 
called round 1, no matter to which of the 
three sequences the subject belongs; the 
second con game the same subject encoun- 
ters is called Round 2, and so on. 

What does one read from Table 3? Be- 
ginning with the positive reversers in Group 
C, one observes from the first row in Table 
3 that of the 46 subjects in Group C, 18 
displayed positive reversal behavior in the 
first round before arbitrage took place, and 
it took a total of 37 arbitrage transactions to 
convert all of them to nonreversers in this 
Round 1, an average of 2.06 transactions 
per subject. Then, given what the subjects 
learned in Round 1, only 1 out of these 18 
people again displayed reversal behavior in 
the second round before arbitrage took 
place; and one arbitrage transaction was all 
that was needed to convert this person to a 
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nonreverser in this round. Finally, none of 
these 18 people displayed reversal behavior 
anymore in Round 3, before arbitrage took 
place. f 

The second row in Table 3 shows that, 7 
out of the 46 subjects in Group C displayed 
no reversal behavior in Round 1, but were 
positive reversers at the beginning of Round 
2, before arbitrage took place. In this round, 
following a total of 11 arbitrage transac- 
tions, all 7 became nonreversers. Then, out 
of the 7 people who had been exposed to 
the arbitrage environment, only 1 again dis- 
played reversal behavior in Round 3, and it 
took 3 arbitrage transactions to convert him 
to a nonreverser in that round. i 

Finally, according to the third row in 
Table 3, 2 out of the 46 people in Group C 
were nonreversers in rounds 1 and 2 but 
displayed positive reversal behavior -in 
Round 3, before arbitrage took place. It 
took 2 arbitrage transactions to wipe out the 
behavior between these two subjects in that 
round. 

Similar patterns were observed for the 
negative reversers in Group C and for the 
positive and negative reversers in Group D 
in Table 3 and need not be repeated. The 
overall picture Table 3 presents can be sum- 
marized as follows. 

(i) For groups C and D combined, 29 
(24 + 5) out of the 83 subjects reversed at 
the beginning of Round 1, and a total of 46 
(24 - 8-2 5 3 +4), or more than half 
of them, reversed at the beginning of one of 
the three rounds, prior to exposure to the 
money pump. These results indicate that in 
the absence of arbitrage, preference rever- 
sal remained an important phenomenon 
even when subjects worked with only a sin- 
gle pair of gambles at a time. Such a finding 
echoes that of Part JI of the first experi- 
ment.? 


?It is worth noting here that while it is meaningful 
to make a direct comparison between the results of the 
two parts of the first experiment to see whether simpli- 
fication in design reduced reversal frequency, a similar 
direct comparison between the results of Part I of the 
first experiment and those obtained here in the second 
experiment would not be justifiable because subjects in 
this second experiment differed from those in the first 
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(ii) The within-round behavior of the re- 
versers in Table 3 is similar to that of the 
subjects in Part II of the first experiment, as 
both groups of subjects were quick to adjust 
their behavior from that of reversers to non- 


reversers following a series of arbitrage’ 


transactions desigried to make reversers lose 
money. The reversal phenomenon is vulner- 
able to arbitrage transactions. 

(iii) For the 29 (24 4- 5) subjects who first 
reversed at the beginning of Round 1 and 
for the 11 (8 + 3) subjects who first reversed 
at the beginning of Round 2, the between- 
round behavior clearly indicates that both 
sets of subjects were able to carry their 
experiences of arbitrage transactions from 
one round to subsequent rounds. The rever- 
sal phenomenon is vulnerable to the experi- 
ences the subjects accumulated from the 
previous rounds of marketlike games. 


III. Concluding Remarks 


The experiments reported in this paper 
are designed mainly to test three hypothe- 
ses: (i) incidence of reversals will be re- 
duced if the choices are simplified; (ii) in- 
cidence of reversals will be reduced in a 
marketlike environment where reversers are 
subject to repeated arbitrage transactions 
that cause them to lose money; and (iii) 
incidence of reversals will be reduced for 
subjects who have had the experience of a 
marketlike environment. 

From a comparison between the results 
of parts I and II of the first experiment, 
hypothesis (i) above is only weakly con- 
firmed at best. Although reversal frequency 
was reduced for one group of subjects, it 
stayed at exactly the same level for the 
other group. Furthermore, an examination 
of the absolute numbers of reversal fre- 
quencies in Part II of the first experiment 
and in the second experiment reveals that 
preference reversal remained an important 
phenomenon, even when the choices con- 


in that they were not exposed to the Reilly's procedure 
(Part I of the first experiment) prior to making sharers 
between a single pair of gambles. 
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fronting the subjects were reduced to a sin- 
gle pair of gambles. Simplification alone did 
not seem to be able to put the ‘reversal 
phenomenon to rest. 

But what simplification alone could not 
achieve became achievable once arbitrage 
was introduced. In Part II of the first exper- 
iment, only 5.1 percent of the subjects in 
Group A remained reversers after two 
transactions, and no one in Group B re- 
versed after three transactions. In the sec- 
ond experiment, all reversers (in all rounds) 
were successfully converted to nonreversers 
after an average of 1.71 transactions per 
subject.!° So hypothesis (ii) above is strongly 
confirmed. 

Hypothesis (iii) is also strongly confirmed. 
Results of the second experiment show that 
subjects displaved substantially fewer rever- 
sals after they were exposed to a marketlike 
environment in previous rounds of games. 
They were able to carry their experiences 
from one game to subsequent games. 

It seems, then, that while the preference 
reversal phenomenon is important, it is 
vulnerable to a marketlike environment 
coupled with a simplification in the experi- 
mental design. This is no doubt the most 
important finding of the current study. 

Of course, the study as it is still leaves 
many important questions unanswered. We 
do not know to what extent or how long the 
"educational" effect of the con games will 
last. Will the subjects remember the “les- 
sons" after six months or a year? Will they 
be able to apply their experiences to games 
that are of the same nature as the ones 
played here but have very different formats? 
Furthermore, in real life situations, if the 
payoffs at stake are very large, will they 
actively seek tests of their decision-making 
procedure in marketlike environments be- 
fore they actually enter the markets, or they 
will still react to information transmitted to 
them from the market passively? That is, 


This average number of transactions is obtained 
by adding all of the numbers of transactions in Table 3 
together, and dividing the sum by the total number of 
reversers in all categories, for groups C and D com- 
bined. 
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will they take precautionary measures to 
prevent mistakes from being made, or they 
* will adjust their behavior only after the mis- 
takes have already been made? A large 
body of literature already exists on these 
issues, but more experimental findings would 
still be helpful in scrutinizing the relative 
usefulness of alternative hypotheses. We 
look forward to future efforts in these direc- 
tions. 
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Tests of "Fanning Out" of Indifference Curves: 
Results from Animal and Human Experiments 


By Joun H. KAcEL, Don N. MACDONALD, AND RAvMoND C. BaTTALio* 


In an earlier paper (Raymond C. Bat- 
talio, John H. Kagel, and Don N. Mac 
Donald, 1985), we reported Allais-type 
. violations of the independence axiom of ex- 

pected utility theory with rats choosing over 
positively valued payoffs (food rewards). 
This note extends this research, examining 
animals’ choices over losses, testing for (1) 
standard Allais-type common ratio effect vi- 
olations of expected utility theory and (2) 
fanning out of indifference curves for ran- 
dom prospects, tests of Mark J. Machina's 
(1982, 1987) hypothesis II (hereafter H2), 
over previously unexplored areas of the unit 
probability triangle. Results from a parallel 
series of experiments using human subjects 
choosing over real losses are also reported. 
For both rats and people, we find standard 
Allais-type violations of expected utility the- 
ory and a systematic failure of the fanning 
out hypothesis in the southeast corner of 
the unit probability triangle, in the case of 
losses. Thus, the fanning out hypothesis 
(Machina 1982, 1987) cannot provide a sat- 
isfactory explanation for behavioral devia- 
tions from expected utility theory. 


I. Allais-Type Common Ratio Effects and 
New Tests of Fanning Out 


A key element of all descriptive alterna- 
tives to expected utility theory is to explain 


*The authors' affiliations are, respectively, Depart- 
ment of Economics, University of Pittsburgh; Depart- 
ment of Finance, University of North Texas; Depart- 
ment of Economics, Texas A&M University. Research 
support from the Economics Division of the National 
Science Foundation is gratefully acknowledged. The 
research reported has benefited from discussions with, 
and the comments of, Colin Camerer, Mark Machina, 
and Amos Tversky and the comments of the referees. 
We alone are responsible for any errors. 


Allais-type common ratio effects and a 
number of other systematic violations of 
expected utilitv theory. Common ratio effect 
violations of expected utility theory include 
the “certainty effect" of Daniel Kahneman 
and Amos Tversky (1979) and the "Bergen 
Paradox" of Ole Hagen (1979), as special 
cases. Allais-tvpe common ratio violations 
involve rankings over pairs of prospects of 
the form: 


Example 1 
A: x, with probability p 
x, with probability 1— p 
x, with probability q 
x, with probability 1— q 


B: 


Example 2 
C: x, with probability rp 
x3 with probability 1— rp 
x, with probability rq 
x4 with probability 1— rg 


D: 


where p >q, 02 x4» x,» x, andOÜ«r«1 
(the term “common ratio” derives from the 
equality of prob(x,)/prob(x,) in A vs. B 
and C vs. D). An expected utility maximizer 
would prefer either A and C (if p[u(x;)— 
u(x3)] > qluGx;) - u(x4)], or else B and D 
(if the opposite were true). However, using 
primarily hypothetical choice alternatives, 
researchers find a systematic tendency for 
subjects to prefer B and C when the out- 
comes involve losses (Kahneman and Tver- 
sky, 1979; Kenneth R. MacCrimmon and 
Stig Larson, 1979; and the references cited 
in Machina, 1983). 

Machina (1982, 1987) identifies three sys- 
tematic violations of the independence ax- 
iom in addition to the common ratio effect 
and shows that all four follow from a single 


912- 


VOL. 80 NO. 4 


assumption, H2.! More recently, Zvi Safra, 
Uzi Segal, and Avia Spivak (1988) have used 
H2 to explain the preference reversal phe- 
nomena (Sarah Lichtenstein and Paul Slovic, 
1971; David W. Grether and Charles R. 
Plott, 1979; Machina, 1987). H2 states that 
in moving from one probability distribution 
to another, which (first order) stochastically 
dominates it, the local (linear in the proba- 


bilities) utility function retains the same de- : 


gree of concavity, or becomes more concave 
at each point. In other words, preferences 
tend to vary systematically so that a first- 
order stochastic dominating shift in wealth 
(i.e., a nonnegative random addition to 
wealth) results in the same or more risk 
averse choices. 2 

In terms of the unit probability triangle, 
this yields indifference curves that are paral- 


lel (as expected utility theory predicts) or - 


that fan out relative to the origin. Figure 1 
illustrates how fanning out can explain a 
typical Allais-type common ratio effect over 
losses. The set of all prospects over the 
fixed outcome levels 02 x,» x,» x, can 
be represented by the set of all proba- 
bility triples of the form P=(p,, P2 P3) 
where p; = prob(x,) and Xp, = 1. Since p; = 
1— p, — Pa, we can represent these lotteries 
by points in the unit triangle in the (p,, p3) 
plane. Figure 1 illustrates this for examples 
1 and 2 above when p= 1.0, q = 0.75, and 
r —1/3, (so that prospect A is represented 
by (p, p4) - (0,0, prospect B is repre- 
sented by (p, p3) = (0.75, 0.25), etc.). 
Prospects C and D stochastically dominate 
A and B, as they are derived from A and B 
by reducing the probability of x, and x, by 
a common proportion, 0 <r « 1, and adding 
the displaced probability mass to x4 in both 
cases. 

In Figure 1, the dashed lines represent 


linear combinations of the prospects chosen 


over. Solid lines are indifference curves over 


the random prospects. Since, in expected. 


‘These involve (1) the common consequence effect, 
the most famous specific example of which is the so 
called *Allais Paradox," (2) over sensitivity to changes 
in small probability-outlying events, and (3) the utility 
evaluation effect (see Machina, 1982, 1987). 
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FIGURE 1. PROSPECTS EMPLOYED IN TESTS OF 
FANNING Our. DASHED Lines CONNECT 
Prospects’ SUBJECTS CHOSE BETWEEN. 

STRAIGHT LINES ARE INDIFFERENCE CURVES 

THAT Display “FANNING Our." PROSPECTS 
CoRRESPOND TO THOSE SPECIFIED IN THE TEXT 
AND TABLE 1 


utility theory, the individual's indifference 
curves in the unit probability triangle are 
given by solutions to the linear equation 


U= Xu(xj)p; 
= u(x,)p,+u(x,)(1- pi— ps) 
+ u( x3) p4 = constant, 


they will consist of parallel straight lines 
with slope [u(x,)—u(x,)]/[u(x;) — u(x), 
with more preferred curves lying to the 
northwest. Since, by construction, the 
dashed lines-are parallel, the parallel linear 
indifference curves of expected utility the- 
ory require preference for prospects A and 
C or B and D. However, indifference curves 


that “fan out" relative to the origin can, as 


shown in Figure 1, explain the preference 


. for prospects B and C. 


Fanning out of indifference curves in the 
unit probability triangle corresponds to the 
“light hypothesis" of weighted utility theory 
(Soo Hong Chew and Kenneth R. Mac- 
Crimmon, 1979) and is a characteristic of 
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TABLE 1—TREATMENT CONDITIONS FOR LOSSES: ANIMALS 

















Condition Prospects 

1 A: 5 sec delay, prob. 1.0 or B: 13 sec delay, prob. 3/4 
(5.0) 1 sec delay, prob. 1/4 

(10.0) 
2 C: 5 sec delay, prob. 1/3 or D: 13 sec delay, prob. 1/4 
1 sec delay, prob. 2/3 1 sec delay, prob. 3/4 

(2.34) (4.0) 
3 E: 13 sec delay, prob. 1/2 or F: 13 sec delay, prob. 7/8 
5 sec delay, prob. 1/2 1 sec delay, prob. 1/8 

(90) . (11.5) 
4 G: 13 sec delay, prob. 1.0 or F: 13 sec delay, prob. 7/8 
3.0) 1 sec delay, prob. 1/8 


(11.5) 


Sequencing of Conditions: 1,2,1,3,1,3,1,4 





several other generalizations of expected 
utility theory as well (see Machina, 1983). In 
the case of losses, the obvious question is 
whether choices over pairs of prospects that 
are dominated by A and B, prospects, which 
lie in the southeast corner of the unit prob- 
ability triangle and are parallel to A and B, 
such as E and F in Figure 1, satisfy the 
requirements of fanning out (preference for 
F on the part of those choosing B and C). 


II. Experiment 1: Rats’ Choices Over Losses 


We define losses generically as any out- 
come for which less is better. In our experi- 
ments, the random variable was the time 
delay to reinforcement. Given the relatively 
steep temporal discount functions that rats 
have, time delay is a potent reinforcer, 
where it is well established that less is bet- 
ter (see Leonard Green, 1982, and Kagel 
and Green, 1987, for reviews of experiments 
most relevant to economists). 

Historically, psychologists have studied 
choices between constant and variable de- 
lays to reinforcement, finding strong prefer- 
ence for the variable outcome alternative 
(risk loving), given equal average delays to 
reinforcement (Peter Killeen, 1968; see 
James E. Mazur, 1986, for a review of this 
literature).? In these experiments, there are 


?Risk loving over losses here appears to be charac- 
teristic of a general preference for variable as com- 
pared to certain losses, consistent with predictions 


no auxiliary decisions to be made between 
the choice point and the time that all uncer- 
tainty is resolved. This is important, since it 
is well established that in choosing over 
risky prospects whose resolution is not im- 
mediate, and where auxiliary decisions are 
required prior to the resolution of the un- 
certainty, the independence axiom of ex- 
pected utility theory will not generally hold 
(Harry M. Markowitz, 1959, ch. 11; Jan 
Mossin, 1969). 

Table 1 shows the variable outcome alter- 
natives the rats chose between. Each out- 
come provided 0.1cc of water at the end of 
the delay period. Mean time delay to rein- 
forcement is shown under each pair of 
prospects. Conditions 1 and 2 involve stan- 
dard Allais-type common ratio manipula- 
tions as prospects C and D stochastically 
dominate A and B. Condition 3 checks for 
fanning out in the southeast corner of the 
unit probability triangle as prospects A and 
B stochastically dominate E and F and the 
dashed lines connecting the two sets of 
prospects are parallel. Prospects E and F 
are derived from A and B by multiplying 





from prospect theory (Kahneman and Tversky, 1979). 
See Kagel et al. (1988, experiment 1), where we test 
this prediction by examining rats' preferences over 
variable versus certain shock with the same mean. The 
rats prefer the variable shock. Note that unlike humans 
choosing over monetary losses, there is no opportunity 
for “asset integration” over these choices, with its 
implication for inducing risk aversion. 


VOL. 80 NO. 4 


the 5 and 1 sec delay probabilities by 0.5 
and adding the displaced probability to the 
13 sec delay probability in both cases. Con- 
dition 4 is a control condition that tests for 
the rats’ sensitivity to low probability out- 
come events, since it is generally assumed 
that the decision maker knows the relevant 
probabilities of the different payoffs. Payoff 
frequencies of 1/8 may be sufficiently low 
that the rats are not cognizant of them. 
Condition 4 tests for this. Figure 1 shows 
these prospects in terms of the unit proba- 
bility triangle. 

A discrete trials choice procedure was 
employed where the rats chose between a 
single pair of prospects at a time. Choices 
were recorded and payoffs delivered in re- 
sponse to a single press on one of two 
choice levers. Each choice trial lasted 39 
sec, irrespective of the alternative chosen or 
the length of the delay to reinforcement. 
This established a minimum of a 20 sec 
blackout period between choice trials (a 
maximum delay to reinforcement of 13 sec, 
with 6 sec to consume the water). Given the 
steep time discount rates the rats have, these 
20 sec blackout periods create relatively 
strong independence between trials. 

Each experimental session began with 32 
forced choice trials where only one lever 
was operative, followed by 48 free choice 
trials when both levers were operative. The 
forced choice trials served to familiarize 
subjects with the alternatives. During these 
trials, the empirical distribution function was 
forced to match the programmed distribu- 
tion function over each prospect's trial set.? 
Free choice trials served to measure prefer- 
ences. The random number generator was 
given free reign during these trials, so that 
the probability of obtaining a given outcome 
on any trial was independent of outcomes 
on other trials. 

Experimental sessions were conducted 
once a day, 7 days a week, at approximately 
the same time each day. Each experimental 


Sequences of choices across levers and outcomes 
were fixed; however, the start point was determined 
randomly on a daily basis. Details of experimental 
procedures are reported in Kagel et al., 1988, experi- 
ment 2. 
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condition lasted 38 days, divided into two 
19-day, periods. The levers delivering the 
certain and variable delay alternatives re- 
mained constant within each 19-day period, 
but they were switched between the two 
19-day periods. Switching alternatives across 
levers and averaging choices controls for 
lever bias, which may be severe at times for 
rats. Preferences were measured in terms of 
choices of the certain alternative during the 
free choice trials for the last 5 days of each 
19-day period and averaged across side 
switches. 

Table 2 shows average percentage of free 
choice trials allocated to the more certain 
alternative under each condition. A choice 
frequency of 50 percent indicates that the 
rats chose the more certain alternative half 
the time, averaged across side switches, and 
is interpreted as indifference. 

Under baseline pair 1 choices, the rats 
generally preferred the more variable (B) 
alternative. On average, the rats chose the 
A alternative on 38.9 percent of the trials 
(t = —2.68, p < 0.05, 2-tailed t-test). Using 
choice frequencies to measure intensity of 
preference, six of the eight subjects had 
relatively strong preference for the B alter- 
native, the two exceptions being subjects 
612 and 622. Given the strong preference 
for variable over fixed delay probabilities 
reported in the psychology literature, the 
preference for the B alternative here is not 
surprising, even though the average delay to 
reinforcement was twice that of the certain 
delay probability. Further, a necessary con- 
dition for generating Allais-type common 
ratio effect violations of expected utility the- 
ory, similar to those reported with human 
subjects, is to establish preference for the 
variable alternative under the high probabil- 
ity loss condition. The parameter values em- 
ployed were designed to achieve this. _ 

Under pair 2 choices, mean percentage 
choice for the more certain alternative, C, 
was 54.5 percent (t = 1.63). As indicated by 
the individual subject t-statistics, five rats 
were indifferent between C and D, and three 
had a clear preference for the C alternative. 
Upon return to pair 1 choices, mean per- 
centage choice for the A alternative fell to 
44.8 percent (t = —2.01, p « 0.10, 2-tailed 
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TABLE 2—CHOICES Over LOSSES: ANIMAL SUBJECTS 
Percentage Choice of More Certain Alternative 
(Absolute Value of t-Statistics in Parentheses) 
Standard Test of Test of Sensitivity 
Baseline Allais Baseline H2 Baseline H2 Baseline Test 
Subj. (1) (2) a) (3) (1) (3) (1) (4) 
611 21.7 723 38.9 64.6 400 644 483 31.5 
(7260 (618) (3.585  (3.40* (5.18^ (9.305 (0.60 (5.56 
612 50.0 502 55.6 498 | 492 498 494 27.7 
(0.0) (0.14) (8.30 (0.54) (1.00 (028 (1.34) (10.2)? 
613 44.6 49.6 50.4 677 490 59.0 46.0 31.5 
(1.94) (0.18) (0.16) (145) (034) (492^ (2.12) (5.04)? 
614 20.6 48.1 32.5 44.6 475 60.0 446 26.9 
(15.1) (074) (5.00 (168 (122 (320? (3.36 (6.02? 
621 45.0 51.7 44.2 63.8 502 617 494 41.3 
(3.20)* (0.52 (3.36) (8.90) (0.08) (3.825 (0.20) (4.56) 
622 49.8 52.7 50.4 681 496 610 52.9 16.7 
(0.44) (4.34) (0.22 (16.2) (0.58) (400^ (226 (166? 
623 36.9 58.3 41.9 513 475 579 463 40.6 
; (16^ 41.3)” (09^ (214) (2.14) (1025 (3.50)? (11.4) 
624 42.3 53.1 44.6 54.6 456 75.8 2 i 
(2.70)? (42  (Q74^ 2.92)" (160 (750  - = 
AV. 38.9 54.5 44.8 581 473 612 481 30.9 
(2.68? (1.63) (201  (Q.52? (2.31) (435* (1.81) (5.97) 





“Significantly different from 50 percent at the 5 percent significance level, 2-Tailed 


t-Test. 


t-Test. 


t-test). Six of the eight rats were under clear 
control of the contingencies as their choice 
frequencies decreased back toward their 
original levels, the two exceptions being rats 
612 and 613. Thus, six of the eight subjects 
and the aggregate choice data show stan- 
dard Allais-type violations of expected util- 
ity theory. 

Tests of H2 in the southeast corner of the 
unit probability triangle show that it fails to 
hold. Rather, fanning in characterizes the 
data here. Under the initial implementation 
of condition 3, average choice of the more 
certain (E) alternative jumped to 58.1 per- 
cent (t= 2.52, p<0.05, 2-tailed t-test). 
Equally as important, with the return to 
pair 1 choices, mean percentage choice of 
the certain alternative, A, fell to 47.3 per- 
cent (t= —2.31, p «0.05, 2-tailed t-test). 
Choice was under reversible control, with a 
pattern of fanning in for six of the eight 
rats, 611, 613, and 621—624. The two re- 
maining rats, 612 and 614, were unrespon- 
sive to the changes in experimental condi- 
tions, rather than displaying any hint of 


>Significantly different from 50 percent at the 1 percent significance level, 2-Tailed 


fanning out. A second implementation of 
condition 3 confirmed and strengthened 
these results, as. subject 614 now showed 
clear fanning in, as well. 

The results of the sensitivity test are 
shown in the last column of Table 2. On 
average, the rats chose the more certain, 
dominated alternative (G) 30.9 percent of 
the time, well below the 50 percent mark. 
All the rats were responsive to the 1/8 
chance of a 1 sec delay probability, as the 
mean choice of the dominated alternative 
was below 50 percent in all cases, and one 
can reject indifference between prospects 
for all subjects at the 1 percent significance 
level or better, including rat 612, who was 
largely unresponsive to the other experi- 
mental manipulations. These results show 
the rats to be sensitive to delay probabilities 


*MacDonald, Kagel, and Battalio (1989) report 
comparable failures of fanning out for rats over gains 
(food rewards) in previously unexplored areas of the 
unit probability triangle (the northwest corner, in this 
case). 
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as small as 1/8. Hence, we cannot attribute 
the failure of H2 here to any inherent in- 
ability to recognize such low probability out- 
come events. 


IIl. Experiment 2: Humans’ Choices 
Over Losses 


Studies of human choices over losses in- 
volved responses to a series of questions 
requiring subjects to indicate which of two 
gambles they preferred. All questions were 
of the following form: 


Example 3 


A: Losing $14 if 1-100 
B: Losing $20 if 1-70 
Losing $0 if 71-100 
Answer: (1) I prefer A. (2) I prefer B. (3) 
Indifference 


The numbers following each dollar amount 
referred to 100 numbered poker chips in a 
bowl sitting in front of the room, one of 
which would be drawn to determine payoffs. 
Thus, if a subject chose A, he or she would 
lose $14 with certainty as any numbered 
poker chip drawn gave this result. If, how- 
ever, the subject chose B, and a chip num- 
bered 1-70 was drawn, he or she would lose 
$20, but if a chip numbered 71-100 was 
drawn, the subject would lose $0. 

Subjects were paid off on a single ques- 
tion, determined at random, through the 
draw of a poker chip, after all the questions 
were answered. They then drew a poker 
chip from the bowl of 100 chips to deter- 
mine the actual payoff for that question. 
Immediately prior to answering these ques- 
tions, subjects answered three hypothetical 
questions and went through the two-stage 
chip drawing procedure, to ensure their un- 
derstanding of the payoff process. Subjects 
were given $30 cash balances against which 
to cover the possibility of losses. Payoffs 
were set so that all subjects would walk 
away with positive net earnings. 


5See Battalio, Kagel, and Jiranyakul (in press) for 
details of the experimental procedures employed. 
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Paying off subjects on a single question 
eliminates wealth effects from directly dis- 
torting preferences. However, papers by 
Charles Holt (1986) and Adi Karni and Zvi 
Safra (1987) show conditions under which 
answers to questions in a series of ques- 
tions, only one of which will result in a 
payoff, will not be independent of the set of 
questions posed. This criticism applies 
to both the Gordon M. Becker, Morris H. 
DeGroot, and Jacob Marschak (1964) pro- 
cedure for determining certainty equivalent 
values of gambles, and to simple elicitation 
of preferences over a series of paired alter- 
natives, such as employed here. Responses 
to each question are independent of the set 
of questions posed if the independence ax- 
iom of expected utility theory holds, or sub- 
jects choose as if each gamble will actually 
be selected (as in the "isolation effect" in 
prospect theory). To test for independence, 
Colin Camerer (1989) has permitted sub- 
jects to change their choices once the ques- 
tion to be played out had been selected. 
Few subjects changed their choices. An- 
swers reported here come from two sep- 
arate series of questions, with several iden- 
tical questions across the two series 
producing strong repeat reliability in the 
answers (Raymond C. Battalio, John H. 
Kagel, and Komain Jiranyakul, in press). 
This, too, is consistent with the indepen- 
dence assumption, as there were substantial 
differences in the other questions included 
in the two series. Since the data below show 
frequent violations of the independence ax- 
iom, we conclude that subjects treated each 
gamble separately, either in response to our 
urging them to do so or out of a natural 
isolation effect of the sort promoting the 
absence of asset integration reported ‘in a 
number of choice experiments (Kahneman 
and Tversky, 1979; Camerer, 1989; Battalio, 
Kagel, and Jiranyakul, in press). 


SIn the experiment using rats, subjects were paid off 
following each response. However, the payoffs, in terms 
of water reinforcement, were independent of the alter- 
native chosen. In this case, there were no wealth ef- 
fects in the sense that the outcomes of the prospects 
had no effect on the physical state of the rats. 
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TABLE 3—Cuolces Over Losses: HUMAN SUBIECTS 








Set 1: Allais-Type Common Ratio Changes 
A,(- $14, 1.0) or B,(— $20, 0.70) 
A,(—$14, 0.20) or B2(— $20, 0.14) 


Patterns 
Possible Choice Consistent Choice Frequencies 
Patterns with Set 1 
AA EU 10 
BB EU 5 
BA Fanning Out 10 
AB Fanning In 4 





Set 2: New Tests of Fanning Out 
2.1 A3(—$20, 0.74; — $14, 0.20) or B4( — $20, 0.88) 
A,(— $14, 0.90) or B,{— $20, 0.63) 
2.2 As(.—$20, 0.60; — $14, .40) or B,(—$20, 0.88) 
A4 $14, 0.90) or B,(— $20, 0.63) 


Patterns 
Possible Choice Consistent Choice Frequencies 
Patterns? with Set 2.1 Set 2.2 
AA EU 7 3 
BB EU 11 17 
BA Fanning out 1 5 
AB Fanning in 15 7 


*Stochastically dominated alternatives listed first. 


Table 3, set 1, shows one set of Allais-type 
common ratio alternatives employed (these 
are illustrated in Figure 2). The gambles 
employed are shown first, with the first 
number in brackets showing the dollar 
amount of the loss, followed by its probabil- 
ity (as a convention, we omit displaying zero 
loss probabilities). Below this are shown in- 
dividual subject responses to the questions. 
The four possible choice patterns in moving 
from higher to lower probability losses are 
specified, along with their relationship to 
expected utility theory and the fanning out 
hypothesis. H2, as specified in Machina 
(1982, 1987), is a weak inequality that in- 
cludes expected utility theory as a special 
case. Consequently, the fanning out hypoth- 
esis has power here to the extent that it 
organizes deviations from expected utility 
theory. It does this nicely, as a little under 
half the responses violate expected utility 
theory, and of these, 71.4 percent of the 
deviations correspond to fanning out rather 
than fanning in, far more than one would 
expect on the basis of chance factors alone.’ 


"Fanning out constitutes one of two possible devia- 
tions from expected utility theory. A binomial test 


Choice sets 2.1 and 2.2. report tests of 
fanning out over losses in the southeast 
corner of the unit probability triangle (see 
Figure 2). In set 2.1, expected utility theory 
organizes 52.9 percent of the data com- 
pared to 50 percent on the basis of chance 
factors alone. Fanning out breaks down 
completely here, as fanning in organizes 
44.1 percent of the data, well beyond ex- 
pectations based on chance factors alone 
(p « 0.01). 

Choice set 2.2 replaces prospect A, in set 
2.1, with its 6 percent chance to escape 
without losses, with prospect A;, which 
guarantees losses, should it be chosen. The 
certainty of losses reduces the attractiveness 
of the A alternative, thereby producing 
greater conformity with expected utility the- 
ory and increasing the frequency of clear 
fanning out, although fanning out still fails 
to explain a mzjority of the deviations from 
expected utility theory. 





statistic comparing observed frequencies with expected 
frequencies and a 10 percent significance level is em- 
ployed in making these statistical evaluations, 


VOL. 80 NO. 4 


1.0 X,» X, > X, 
$0 -$14 -$20 





P: 


FiGURE 2. Prospects EMPLOYED IN TESTS OF 
FANNING Our wiTH HUMAN SUBJECTS. 
Prospects CORRESPOND TO THOSE SPECIFIED IN 
TABLE 3 


The relatively superior performance of 
fanning out in the southeast portion of the 
unit probability triangle when the prospects 
are located on the edges of the triangle, 
compared to the interior, is characteristic of 
responses to other loss questions as well 
(Battalio, Kagel, and Jiranyakul, in press). It 
indicates that certain losses are much less 
attractive than uncertain losses with equal 
expected value when they provide some 
chance of escaping losses completely. Intro- 
ducing a chance to escape losses on both 
alternatives promotes choice on the basis of 
the lowest loss alternative. This produces 
fanning out and classic Allais-type com- 
mon ratio effects as in choice set 1, but 
it can also produce fanning in, as in choice 
set 2.1.8 


3Battalió, Kagel, and Jiranyakul (in press) report 
mirror image responses for tests of fanning out over 
gains in the northwest corner of the unit probability 
triangle as well: Prospects located on the edges of the 
unit probability triangle are much more likely to satisfy 
weak fanning out compared to cases where both alter- 
natives involve some possibility of zero gains. John 
Conlisk (1989) reports similar results as well. 
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IV. Summary and Conclusions 


Results from two sets of experiments are 
reported, testing for fanning out of indif- 
ference curves over random prospects in- 
volving losses. For both rats and humans, 
classic Allais-type violations of expected 
utility theory are reported. However, new 
tests -of fanning out in the southeast corner 
of the unit probability triangle show system- 
atic fanning in over losses, contrary to 
Machina’s (1982, 1987) H2. Lotteries in this 
corner of the loss triangle consist of highly 
positively skewed distributions over losses 
(i.e., most of the probability mass is on the 
left of the distribution, toward the maxi- 
mum negative value), so that there. is a 
“long left tail.” 

Other tests, over gains, for both human 
and animal subjects (Battalio, Kagel, and 
Jiranyakul, in press; MacDonald, Kagel, and 
Battalio,.1989) show comparable results: Al- 
lais-type violations of expected utility theory 
for standard manipulations, fanning in in 
previously unexplored areas of the unit 
probability triangle (in the northwest corner 
of the triangle in this case). These are largely 
negative results, as we do not offer an alter- 
native model to explain the data. There are 
models that can do the job, however; in 
particular, prospect theory (Kahneman and 
Tversky, 1979) can organize these patterns 
of behavior (although it does not necessarily 
imply them). But more broad-based tests of 
these alternative theories show important 
deficiencies as well. (See Camerer, 1989; 
Battalio, Kagel, and Jiranyakul, in press; 
David W. Harless, 1987; and Chris Starmer 
and Robert Sugden, 1987, for comprehen- 
sive tests of a number of nonexpected utility 
models.) Hence, we hesitate to declare a 
winner in terms of which theory best orga- 
nizes the data. Nevertheless, the results here 
are important, given the surprising power of 
H2 in organizing a number of established 
violations of expected utility theory (Mac- 
hina, 1982, 1987) and the preference rever- - 
sal phenomena (Safra, Segal, and Spivak, 
1988). The results also establish some new 
stylized facts for theorists to organize in 
their efforts to develop alternatives. to ex- 
pected utility theory as a descriptive model 
of choice under uncertainty. 
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Preference Reversal and Nonexpected Utility Behavior 


By Zvi SAFRA, Uzi SEGAL, AND Avia SPIvAK* 


The preference reversal phenomenon, 
first discovered by Sarah Lichtenstein and 
Paul Slovic (1971), puzzled economists for a 
long time, as it seemed to indicate nontran- 
sitive preferences (see David Grether and 
Charles Plott, 1979). This phenomenon 
emerges when a decision maker prefers lot- 
tery A to lottery B, but sets a higher selling 
price on B than on A. Three recent works 
(Charles Holt, 1986; Edi Karni and Zvi 
Safra, 1987; and Uzi Segal, 1988) proved 
that this apparent nontransitive behavior 
strongly depends on the expected utility hy- 
pothesis and on the special mechanism used 
to find decision makers’ selling prices of 
lotteries. When interpreted as a two-stage 
lottery with either the independence axiom 
or the reduction-of-compound-lotteries ax- 
iom violated, this mechanism does not elicit 
decision makers’ true certainty equivalents 
of lotteries.* 

Holt raised the question of a possible 
connection between the preference reversal 
phenomenon and other types of nonex- 
pected utility behavior. This paper tries to 
answer that question. We show below that 
the same conditions that imply preference 
reversals also imply the Allais paradox 
(Maurice Allais, 1953) and the common- 
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‘The main point of Holt's explanation is slightly 
different, since he is more concerned with the random 
lottery incentive mechanism that was used to control 
for wealth effects. In this paper we rather put the 
emphasize on the price selection mechanism, See Sec- 
tion I. 

?AmoS Tversky, Paul Slovic, and Daniel Kahneman 
(1990) have recently proposed a different explanation 
for the preference reversal phenomenon. They argue 
that what appear to be violations of transitivity may be 
a result of a bias introduced by the mode of reporting 
the prices. 
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ratio effect (Kenneth MacCrimmon and Stig 
Larsson, 1979). In other words, we can ex- 
plain the preference reversal phenomenon 
not only as a transitive nonexpected utility 
behavior, but also as consistent with otaer 
violations of this theory. 


I. The Preference Reversal Phenomenon 


Let A=(x,p;y,1—- p) be a lottery. This 
lottery yields x dollars with probability p 
and y dollars with probability 1— p. We 
assume throughout that all prizes are 
bounded within a [— M, M] segment. Let 
> be the decision maker's preference rela- 
tion over lotteries. The functional V repre- 
sents this relation if V/A) z V(B) = A 2 B. 
The decision maker's certainty equivalent of 
A (CECA)) is defined implicitly as that num- 
ber z for which (z,1) ~ A. To find its value, 
Gordon Becker, Morris DeGroot, and 
Jacob Marschak (1964) suggested the fol- 
lowing monetary incentive: Let the decision 
maker announce a selling price of A. Next, 
select at random an offer price from an 
[a, b] segment that includes this announced 
price. The decision maker will win the offer 
price if it exceeds the selling price. On the 
other hand, if the offer price is less than the 
selling price, the decision maker will keep 
the lottery and play it. 

Intuitively, decision makers’ optimal 
strategy would be to announce their true 
certainty equivalents of A. Otherwise, they 
might be forced to sell the lottery while 
willing to keep it, for example, when CE(.4) 
=3, the selling price is 1, and the offer 
price is 2. Alternatively, they might be forced 
to keep the lottery while willing to sell, for 
example, when CE(A) = 3, the selling price 
is 5, and the offer price is 4. Nevertheless, 
carefully conducted experiments proved that 
subjects have a strong tendency to reverse 
their preferences, that is, to prefer a lottery 
A to B, but to set a higher selling price on 
B than on A (Lichtenstein and Slovic, 1971; 
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Grether and Plott, 1979; Werner Pom- 
merehne, Friedrich Schneider, and Peter 
Zweifel, 1982; and Robert Reilly, 1982). 
Such behavior violates transitivity, one of 
the most fundamental assumptions in eco- 
nomics. 

To solve this paradox, Karni and Safra 
(1987) suggested a different interpretation 
of the Becker-DeGroot-Marschak mecha- 
nism. Rather than a simple lottery, they 
analyzed this mechanism as a two-stage lot- 
tery without the independence axiom. Let 7 
be the decision maker’s announced selling 
price of the lottery A. Lottery A is won with 
a probability of (m —a)/(b — a), while a 
(b —«)/(b — a) mass of probability is 
spread uniformly over the [7, b] segment. 

While accepting the reduction-of-com- 
pound-lotteries axiom, Karni and Safra 
proved that the decision maker's optimal 
announcement, (A), always equals the true 
certainty equivalent of A if and only if the 
subject is an expected utility maximizer. The 
preference reversal phenomenon thus be- 
comes just more evidence against expected 
utility theory, but does not prove intransi- 
tivity. (For other transitive interpretations 
of this phenomenon, see Holt, 1986; and 
Segal, 1988). ` 

The question that naturally follows is 
: whether the preference reversal phe- 
nomenon is consistent with other evidence 
against expected utility theory? In other 
words, does the preference reversal phe- 
nomenon belong to the large family of non- 
expected utility behavioral patterns that can 
be uniformly explained within the standard 
nonexpected utility models (Mark Machina, 
1982; John Quiggin, 1982; Chew Soo Hong, 
1983; Eddie Dekel, 1986; and Menahem 
Yaari, 1987), or does it present us with an 
entirely different problem, not yet ana- 
lyzed? 

. Our main conclusion here is that there 
does exist a connection between preference 
reversals and other nonexpected utility phe- 
nomena like the Allais paradox and the 


*For a discussion of the connection between the 
preference reversal phenomenon and the common-ratio 
effect within a different approach, see Holt (1986). 
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common-ratio effect. Moreover, almost the 
same condition as Machina's Hypothesis II, 
which he proved explained these paradoxes, 
also explains the preference reversal phe- 
nomenon. A similar result for a specific 
functional form (that of Quiggin and Yaari) 
that does not satisfy either Machina's Hy- 
pothesis II or our coming assumptions ap- 
pears in Karni and Safra (1989). 

To explain this hypothesis and its connec- 
tion to the Allais paradox, let x < y <z be 
fixed prizes and consider the set A of lotter- 
ies over these prizes, A ={(x, p;y,1— p— 
q; z, q)). Each such lottery is represented by 
a point in the unit triangle in Figure 1 (see 
Machina, 1987). It follows by monotonicity 
that, for a given value of p, a higher q 
benefits the decision maker because a prob- 
ability mass shifts from the middle outcome 
to the high outcome. Similarly, for a given 
value of q, a lower p is preferred. It thus 
follows that the indifference curves in this 
triangle have a positive slope, with higher 
curves indicating a higher utility level. The 
expected utility of the lottery (x, p;y,l— p 
—4;z,Q) is plu(x) — uCy)]-- alu(z) — uCy)] 
t uCy); hence the slope of the indifference 
curves is [u(y) - u(x)l/Iu(z)— u(y), and 
the indifference curves are parallel straight 
lines (Figure 1a). 

Let x = 0, y — 1000000, and z = 5000000, 
and consider the four lotteries A = 
(x, 0.9; z,0.1), B = (x, 0.89; y,0.11), C= 
(x, 0.01; y, 0.89; z,0.1), and D=(y,1) (Fig- 
ure 1b). For A to be preferred to B and D 
to C, which is the Allais paradox, indiffer- 


ence curve f must be steeper than a. This 


is the “fanning out” effect, also known as 
Machina’s Hypothesis II, which means that 
for p' « p and q'> q, the slope of the indif- 
ference curve at (p',q?) is higher than the 
slope at (p,q). (This refers, of course, to 
two different indifference curves.) 

This hypothesis also explains the com- 
mon-ratio effect, where for x «y, (0,1— 
p;x, p) - (0,1— q; y, q), but for A <1, (0,1— 
Aq; y,Àq) » (011— Ap; x, Ap). For example, 
as Daniel Kahneman and Amos Tversky 
(1979) found (Gn a hypothetical exper- 
iment), (3000, 1) > (0, 0.2; 4000, 0.8), but 
(0, 0.8; 4000, 0.2) > (0, 0.75; 3000, 0.25). This 
occurs when the indifference curve through 
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FIGURE la 


(0,0) and above (0.2,0.8) is steeper than the 
one that goes through (0.75,0) and below 
(0.8, 0.2). 

This graphical representation holds only 
for triple-outcome lotteries, but the results 
hold for general lotteries as well. Let F and 
G be two cumulative distribution functions. 
We say that F dominates G by first-order 
stochastic dominance and denote it by 
F > G if, for every x, F(x) < G(x). A pref- 
erence relation satisfies the first-order 
stochastic dominance axiom if F is pre- 
ferred to G whenever F = ,G. For a general 
distribution function F, let Up be the deci- 
sion maker's local utility at F (see Machina 
1982, 1987). For small changes in the distri- 
bution, the decision maker's behavior can 
be approximated by the expected utility of 
U,. Formally, 


Y(G)- VCF) = fu«G) dG — [U,(x) dF 


+ o(lG — FI), 


where, as before, V represents. the prefer- 
ence relation >. 

Let R*(x) = —UgGO/Ug(x). Machina 
made the following assumption concerning 
this expression: 


ASSUMPTION H2 (Hypothesis ID: If F 
dominates G by first-order stochastic domi- 
nance, then for every x, R*(x)> R(x). 
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Note that in Figure 1(b) C dominates B 
by first-order stochastic dominance. We later 
show that H2 implies that the slope of the 
indifference curve at C is higher than the 
slope of the indifference curve at B. We will 
use a similar assumption tó analyze the 
preference reversal phenomenon. 


IL. Necessary and Sufficient Conditions for 
Preference Reversals 


A preference reversal is established 
whenever a decision maker prefers lottery 
A to B, but sets a higher selling price on B 
than on A. In most of the experiments that 
found such reversals, B was (almost) a 
mean-preserving spread of A. To relate 
preference reversals to mean-preserving 
spreads, we must hypothesize about the 
manner in which the local utility function 
changes when the decision maker moves 
from one distribution to a mean-preserving 
spread of that distribution. Let F 2 4G if G 
is a mean-preserving spread of F and make 
the following assumption: 


ASSUMPTION A2: If G is a mean-preserv- 
ing spread of F, then for allx, RF(x) > R9(x). 


A2 is closely related to Machina's As- 
sumption H2, where F was assumed to 
dominate G by first-order rather than sec- 
ond-order stochastic dominance. These two 
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assumptions relate to changes in absolute 
risk aversion across distributions and should 
not be confused with the change in R^ 
induced by a change in x (Machina, 1982; 
Hypothesis I refers to such changes). In- 
stead of Assumptions A2 and H2, one can 
make the following two assumptions: 


ASSUMPTION A3: If G is a mean-preserv- 
ing spread of F, then for all x, R(x) < R(x). 


ASSUMPTION H3: If F dominates G by 
first-order stochastic dominance, then for all 
x, R*(x) x R8(x). 


Our analysis requires a few more defini- 
tions. The function V represents risk aver- 
sion if F >G implies V(F) > V(G). It rep- 
resents risk loving if Fz.,G implies V(F) 
<V(G). It is quasi-concave (convex) if for 
a€[0,1, V(F)-V(G) implies V(aF + 
(-a@)G)2V(F) WF) z VaF +- 
a)G)). If it is both quasi-concave and 
quasi-convex, then it satisfies the between- 
ness axiom (Chew, 1983; Dekel, 1986). If V 
is quasi-concave (convex), then indifference 
curves in the triangle are convex (concave). 
Betweenness implies straight (but not nec- 
essarily parallel) indifference curves. 


PROPOSITION 1: Let the function V satisfy 
the betweenness axiom. If it represents risk 
aversion, then Assumption H; holds if and 
only if Assumption A; holds as well, i= 2,3. 
If V represents risk loving, then Assumption 
H; holds if and only if Assumption A; holds, 
i+ j, i,j € {2,3}. 


The various assumptions and Proposition 
1 are easily interpreted in Machina’s trian- 
gle. If R^» RS, then the slope of the indif- 
ference curve through F is greater than its 
counterpart through G (see Figure 2). Com- 
paring the movement from F to F' with 
that from G to G', we see that, in the first, 
a greater probability of the high outcome 
must compensate for an increased probabil- 
ity of the low outcome. This is so because 
an increase in the probability of the highest 
outcome z contributes less to total utility, 
because of the higher risk aversion at F. 
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FIGURE 2 


More formally, 


Aq m U5( y) - Us(x) 
Ap V(G) = const. Ug(z) - UcCy) i 





This expression increases if G is replaced 
by F such that RF > RC, or equivalently, 
when U,(x) = y[U5(x)), where y is an in- 
creasing concave function. 

Proposition 1 is illustrated in Figure 3 for 
H, = A,. Betweenness implies straight in- 
difference curves. Unlike expected utility 
theory, they are not necessarily parallel. The 
expected value of the lottery (x, p; y, 
1—p-4;z,q) is given by w= p(x—y)+ 
a(z — y) * y. The locus of lotteries with a 
given mean yp is a straight line with a slope 
of Cy — x)/(z — y). It follows from (1) that, 
if the decision maker is risk averse (in which 
case the local utility function is concave), 
then for a general distribution F, 


Uk (y)-Ug(x) v»-x 
U-(z) - Uc(y) z=y` 


Therefore, the slope of the indifference 
curves is higher than the slope of the line 
along which u is a constant (see Figure 3). 
Because higher indifference curves repre- 
sent higher utility levels, risk aversion is 
expressed by the fact that moving in the 
northeast direction on the line along which 
p. equals a constant, that is, affecting mean 
preserving spread, reduces the decision 
maker's utility. 
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FIGURE 3 


By Assumption H2, risk aversion (hence 
the slope of the indifference curves) at 
points above and to the left of G (for exam- 
ple, F) is higher than at G. By Assumption 
A2, risk aversion becomes higher as we move 
to the left on the line along which u equals 
a constant (for example, to H). Obviously, 
if indifference curves are straight lines, then 
Assumption H2 holds if and only if As- 
sumption A2 is satisfied. Using the same 
diagram for quasi-concave or quasi-convex 
preferences, we can show that, for lotteries 
with three outcomes, under risk aversion 
and quasi-concavity Assumption A2 implies 
Assumption H2. Also, under risk aversion 
and quasi-convexity H2 implies A2. 

Having clarified the meaning of the as- 
sumptions, we can now state the major te- 
‘sult of this paper. For each lottery A, let 
v CA) be the decision maker's optimal value 
of m, the announced selling price of A. We 
assume henceforth that this maximum is 
unique (see the appendix). Let 


TT 





-a sep 
L(A,«) = boat bag okt) 


where Q(7,b) is the uniform distribution 
over [7,5]. LCA, 7) is the (compound) lot- 
tery obtained from the lottery A by the 
Becker-DeGroot-Marschak mechanism 
when the decision maker declares the 
selling price m. If 4A-(x,p;yl-p) 
then L(A,T) yields x with probability 
p(w — a)/(b — a), y with probability (1— p) 
Qr — a)/(b — a), and a (b — m)/(b — a) 
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probability mass, uniformly spread over the 
Ur, b] segment. Let U^ —U;, , meag and U^ 
= Urg, (4) be the local utilities of the lot- 
teries obtained from A and B with A's 
optimal selling price. 


THEOREM 2: Let A» B. 

a. If.V represents risk aversion and satisfies 
Assumption A3, then preference reversals 
will not occur. A sufficient condition for a 
preference reversal is that [U? dB » 
UP(z CAP. 

b. If V represents risk loving and satisfies 
Assumption A2, then preference reversals 
will not occur. A sufficient condition for a 
preference reversal is that [UP dB < 
UP?(vCAD. i 


The sufficient conditions indicate that 
preference reversals between A and B re- 
quire a sizable change in the absolute mea- 
sure of risk aversion between L(A,7(A)) 
and L(B,7(4)). To illustrate the inequality 
condition in Theorem 2(a), consider Figure 
4. Here A=(x, p; y,1— p). Note that, be- 
cause in this example U^ (x) - U?(x) and 
U^(y) = UY(y), [JUP dA = fU^dA. The 
first-order condition for maximization is 


ð 
—V(L(A,m)) =0. 


oT 7 —mCA) 


As in the proof of Theorem 2 (see Ap- 
pendix), this condition implies that (U^ dA 
— U^(s (A). Let A = JUPa(A — B) (A > 0). 
The condition now becomes U^(z(A)) » 
UP(zCA)) A. This implies that the local 
utility U^ is more concave than U^, and the 
value it assigns to 74) is much larger than 
U?(z( A). 


The conditions for existence or nonexis- 
tence of preference reversals somewhat re- 
semble those leading to a Giffen good. 
Assume first that V represents risk aver- 
sion. The movement from A to B has two 
effects. Under Assumption A3 they work in 
the same direction, to reduce sr, while un- 
der Assumption A2 they work in opposite 
directions. First, because B is worse than 
A, it will have a smaller y for the same 
local utility. Second, the local utility changes 
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uA(ar(a)) 
UPUT(A)) +4 
UBP(T(A)) 


X TT (A) y 


FIGURE 4 


from L(A,7(A)) to L(B,7(A)). The sign 
of the change depends on whether Assump- 
tion A2 or A3 is satisfied. Under Assump- 
tion A3, since B is a mean-preserving spread 
of A, the consumer is more risk averse at 
L(B,7(A)) than at L(A,7(A)) and hence 
gives the risky prospect B an even lower 
certainty equivalent. Under Assumption A2, 
the certainty equivalent of B increases. 
Thus, under Assumption A3 risk averters 
never show preference reversals, while un- 
der Assumption A2 they might. A similar 
argument explains the case of risk loving. 

Two opposing effects also appeared in all 
the experiments such as Allais’, the com- 
mon-ratio effect, and others. Machina (1982) 
proved that a sufficient, deviation from ex- 
pected utility, provided by the H2 assump- 
tion, implies these behavioral patterns. 
Strikingly, the same Assumption H2 also 
relates to the preference reversal phe- 
nomenon. Consider functionals satisfying 
betweenness. By Theorem 2, preference re- 
versals do.not occur under Assumption H3 
but might occur if Assumption H2 is satis- 
fied. A sufficient condition for its occur- 
rence is the fast fanout of indifference 
curves. 

This observation agrees with consistent 
experimental results displaying a fairly high 
percentage of preference reversals (50-60 
percent) among those who preferred the 
safer option and a fairly low percentage of 
preference reversals (10-15 percent) among 
those who preferred the riskier one. Our 
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explanation suggests that Assumption H2 is 
much more frequent among risk averters 
than among risk lovers. This conclusion is 
supported by experiments done by Marc 
McCord and Richard de Neufville (1984). 
In their experiments they tried to elicit util- 
ity functions using different approaches and 
found that their results contradicted the 
expected utility analysis. As shown by 
Machina (1987), their results indicate that 
Assumption H2 prevails for risk averters. A 
further look at their results indicates that, 
for risk lovers, Assumption H2 does not 
prevail. Our analysis thus agrees with these 
results as well. 

We conclude this section by showing how 
for any given pair of lotteries A and B that 
satisfy A > ,B, it is possible to build a func- 
tional displaying preference reversal be- 
tween them. The functional represents risk 
aversion, satisfies Assumption A2, and can 
be smoothed to be Frechet differentiable. 


Example. Let A and B be two given lotter- 
ies satisfying A>.B. Consider the ex- 
pected value functional u(F)= f[xdF(x). 
Clearly, for u we have (A) = (B) = uCA) 
= u(B). Define m = p(L(A, r(A))) = 
p(L(B,7(B))) and note that L(A,7(A)) 
> ,L(B,m(B)). Let u: R ^ R be a concave 
function such that N, defined by N(F)— 
fu(x) dF(x), satisfies (1) NCA) > N(B) (this 
follows by risk aversion), 


(2) N(3L(A,-(B)) 
+2L(B,7(B)))=m, 

and 

(3) N(L(B,m)) <m for all r. 


Conditions (2) and (3) are possible since u 
can be affinely transformed. Finally, let 


V(F) = max(u(F), N(F)}. 


Indifference curves of V are (informally) 
depicted in Figure 5, where the sets 
(LGB, m): m ER} and (LCA, 7): m €R} also 
appear and Q is the uniform distribution 
over [a, b]. 
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By construction we still have 7(B)- 
iB); therefore the local utility function at 
L(B,7r(B)) is linear. Furthermore, the local 
utility function at LCA,7(B)) is u and, by 
its concavity, u((B)) » fudA. This condi- 
tion is equivalent to the inequality condition 
in Theorem 2(a) (the roles of A and B 
were changed), which gives the existence of 
preference reversals between A and B 
(VCA) » VCB) and (A) « 7(B)). Figure 5 
clearly shows that the optimal point of 
{L(A,7): « ER} is to the left of 
L(A,7(B)), so a (A) « (B). Of course V 
does not represent strict risk aversion, but 
this can be achieved by replacing » with an 
expected utility functional having a concave 
and almost linear utility function. Frechet 
differentiability of V results from smoothing 
out the kinks of V without affecting the 
existence of the preference reversal. Finally, 
V satisfies Assumption A2. Indeed, let F 
> G such that V(G)= N(G) (this is the 
only interesting case). But then N(F)> 
N(G) x &(G) (by risk aversion and the def- 
inition ol of V), thus VCF) = NCF), and clearly 
RF = RY, 


JH. Summary and Conclusion 


Preference reversals seem to violate tran- 
sitivity, one of the most fundamental as- 
sumptions in economics. They occur when a 
decision maker prefers lottery A over B, 
but puts a higher selling price on B than on 
A. From the economist's point of view, these 
results are even more troublesome when 
subjects are offered a clear monetary incen- 
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tive to reveal their true preferences. It was 
assumed that the Becker-DeGroot- 
Marschak mechanism creates this incentive. 
Karni and Safra (1987) and Segal (1988) 
showed that this is not the case if either the 
independence axiom or the reduction of 
compound lotteries axiom is violated. In this 
paper we have shown that, given Karni and 
Safra's interpretation, similar conditions to 
those implying the Allais paradox and the 
common-ratio effect also imply preference 
reversals. Holt (1986) also discussed such a 
connection, but his explanation is slightly 
different (see fn. 1). Furthermore, we have 
shown that this analysis also explains why 
the preference reversal phenomenon is more 
common among risk averters than among 
risk lovers. 


This paper predicts that if risk-averse de- ^ 


cision makers display preference reversals, ' 
they will also behave according to the Allais 
paradox and the common-ratio effect. The 
opposite, however, is not necessarily true. 
Further experiments can examine these pre- 
dictions. 


APPENDIX 
PROOF OF PROPOSITION 1 


We only prove that if V represents risk aversion, 
then H2 = A2. Assume that Assumption H2 is satis- 
fied and take F z,G; thus, V(F)z V(G). Let 5, = 
(x,1) be the distribution that assigns all its mass to 
point x. ôm —(M,1) dominates G by first-order 
Stochastic dominance and is preferred to F; hence 
there exists H 2.,G such that V(H) — V(F). By be- 
tweenness, R” = R^ and by Assumption H2, R” = R9; 
hence RF> RC, which is Assumption A2. 

Assume A2, let F > ,G, and let 4(G) be the mean 
of G. If Vôo) z VCF), then let G(8) 8G - (1— 
B)8,(G) with 0 < 8 <1 such that V(G(B)) = VIF). Since 
G(B) = 2G, it follows by t betweenness and Assumption 
A2 that R^ = RC? > RY, 

Assume now that VS) « VCF). Without loss of 
generality, G ¥ 5,46). (Otherwise, take K#G such 
that V(K) — V(G).) By risk aversion, V(8,(5:) > V(G). 
Define G, = max!G + t}, Ôn} such ‘that VG) = 
V(, (oy) CF ^ t is a t-length rightward shift of F). Such 
ty exists because M > (G). Clearly, V(845,)) > V(G). 
Continue to define G, = max(G, .. , + tns Ông} such that 
VG.) = Vy, D) while VEG) 2 VG, ). The se- 
quence u(G,) is increasing and ‘converges to Syy. 
(Otherwise, G, > K and V(ê e) - V(K) while K # 
Su(K),a contradiction. ) Thus, for a sufficiently large n, 
Vrap) > VLC) = V(G,) and, by. Assumption A2, 
RO» = R5uc, 02 ROn- > --- > RG, Define G XB) = 
BG, * (1 — B)5,¢g,) and continue as before. 


VOL. 80 NO. 4 


Uniqueness of the Local Maximum in the Case of Quasi- 
Concavity. We show that if V is quasi-concave, then 
d/dwV(L(A,w)|,.,--— 0 implies that z*'is the 
global maximum; that is, m* = «(A). 

Consider the expected utility functional given by 
EU(F) = fUj 4. ,- dF. By first-order conditions, 
EU(A) = EU(à,.); thus * = CECA) for the local util- 
ity function Uzta, 7* is a global maximum of 
this function; hence [Uj(4 ne) dL(A, T*) > 
[Ur 4,9) ALA, 7) for all a. If m* is not a maximum 
of V, then there is m x 7* such that V(L(A,7))> 
V(L(A, 7*)). But then, by quasi-concavity, 
fUr 4,9) d CA m) L(A, 7*)]> 0, which is a contra- 
diction. 


PROOF OF THEOREM 2 

a. Necessity. Let V satisfy Assumption A3. Because of 
risk aversion, V(4) > V(B) and preference reversals do 
not occur if +(A)>7(B). Following Karni and Safra 
(1987), let 2L(B,m) be the derivative of the path 
L(B,7) with respect to its second argument, evaluated 
at m. By definition 


àL(A,m)(x) im TILA,” *te)- L(A,m)](x) 


li l[m-tse-a m-a ? 

lim =| bt+a -— (x) 
ie (oo peg 
im Bc (T 9 e, b) 

T 
-E| 


For x<, the last element is zero, and the value 
is A(x)/(b—a). For x», the second element is 
—1/(b-—a) (Since Elr + e, b)(x)= (x — Gr + 607 
(b — (Gr -- £)), and the value is (A(x)—1)/(b — a). It 
thus follows that 3LCA,7) — CA — 8,)/(b — a); hence 


à 
a, UA T)) 





are GARD) 


i [Uic scs LCA, 0 CA] 


1 
TES [Uca sca dL 4 7 ênan]: 


First-order conditions for maximum imply that this last 
expression equals zero. By Assumption A3 and because 
by the definition of L, LCA,T (A) » 4L(B,n CAD, it 
follows that fU,(s neay dL A — 84,4] «0. We thus ob- 
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tain 


d 
3; (4,7) 





manl) 


1 

=z [Ut may TB - Seca] 
1 

“b-a Im -4] 


+ f Uus say dL A — iu]. 


By risk aversion, the first term is negative, and by 
Assumption A3, the second one is negative as well. 
Hence d/dwV(L(B,m)|,.,(4) «0, which implies 
ar(A)> w(B). 

Sufficiency. As we proved above, the sufficient condi- 
tion is equivalent to 


à 
—V(L(B,7)) > 0; 
On 


ar = (A) 


hence it implies that (B) > (A). 

b. Let V satisfy. Assumption A2. By risk loving, 
V(B)» V(A) and fUr Ay dla- na> 0. This 
implies that d / dr (LB, m), s ay? Ó; thus «(B) 
> T ( A). The sufficiency is proved as in part (a). in| 
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Economic and Political Foundations of Tax Structure: 
Comment 


By HERBERT J. Kresyrnc* 


The paper by Walter Hettich and Stanley 
Winer (1988) on the economic and political 
structure of tax analysis is an important 
contribution, introducing a refreshingly pos- 
itive approach into an analytical tradition 
where normative judgments have long been 
overly important. But Hettich and Winer 
pursue the logic of their approach only part 
of the way. In this comment, I would like to 
extend the Hettich-Winer (HW) positive ap- 
proach to what I consider to be its logical 
conclusion, which should allow for a more 
complete analysis that avoids some possible 
methodological and ethical biases. 

The HW model features voter support of 
a government where support is based posi- 
tively upon the services received from a 
pure public good and negatively by tax costs 
including deadweight loss. Tax costs are af- 
fected by the choice of tax base, and differ- 
ent tax bases may involve different behavior 
responses (to avoid taxes) by different vot- 
ers. The government wishes to maximize 
expected support, written as 


(1) [b(G)-e(V)], . 


where G is the level of expenditures on all 
public goods, b; is the increase in probabil- 
ity of a yes vote by voter i because of G, vj 
is the cost in terms both of the tax payment 
t; and the deadweight loss d;(v; = t; + d;), 
and (— cj) is the decrease in the probability 
of a yes vote because of the cost. The 
amount of tax revenue the government can 
raise is in part dependent upon how much 
taxpayers change their behavior to avoid 
taxation. Such behavior is affected by the 
tax rate plus those taxpayer characteristics 


*Professor of Economics and Public/Environmental 
Affairs, Indiana University, Bloomington, IN 47405. I 
wish to thank Dave Wildasin and the referees for 
useful comments. 
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that affect how much taxes will affect the 
amount of tax-base activity he or she will 
supply. Given the existence of numerous 
possible tax-base activities, when the gov- 
ernment maximizes its expected support 
subject to available tax revenue and the 
behavioral characteristics of voters in sup- 
plying tax-base activity, it is found that “the 
politically optimal tax structure requires 
marginal political opposition per dollar of 
tax revenue to be equalized across taxable 
activities for each taxpayer, as well as to 
be equalized across taxpayers for each activ- 
ity.”! In the absence of administrative costs, 
the optimal tax structure would be to tailor 
taxes specifically to the characteristics of 
each individual voting unit. With adminis- 
trative costs this is not possible, but the key 
idea remains that optimally the tax system 
should accommodate to individual voter cir- 
cumstances when the benefits of doing so 
outweigh the administrative costs. 

The extension I suggest involves introduc- 
tion of public goods more directly. With 
their G variable, HW only allow their model 
to consider the total expenditure level on 
public goods and not its composition, a pro- 
cedure that can lead to analytical and ethi- 
cal distortion.* In being concerned about 
the level of support for its tax system, the 
government will need to consider more de- 
tailed features of public expenditures so 
long as voter tax-base activity responses 
might be related to the public good mix that 
is provided, as is likely. Not only are there 
relations of complementarity and substi- 
tutability between public goods and differ- 
ent tax base activities, but voters will have 


!Hettich and Winer, 1988, p. 705. 

2To be fair, HW briefly consider the possibility of 
bringing in the benefit side in their fn. 16. On the other 
hand, in presenting their model, they present justifica- 
tions for separating taxes from expenditures (p. 703). 
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differing amounts of “tax morale” according 
to how they perceive public good benefits. 
Economists would never consider abandon- 
ing the marginal utility to price ratio as a 
key criterion for private good purchases, 
and there is no reason why in principle the 
public sector should not be seen as working 
in the same way. 

When the variability of government ser- 
vice mix and amount is added to the HW 
model, we have the government maximizing 
a support function, 


N 
(2) 2. D [ECG eM], 


where the G, are public goods and the 
other variables are as above. 
The government faces the constraint 


(3) LG,- Lt,B; = 0 


where the t; are tax rates and B, is the 
amount of taxable activity of the voter (tax 
base). With addition of multiple public 
goods into the model, and following HW 
otherwise, the B; are influenced as 


(4) B; = B,(t;, X;,G,)- 


Voters adjust their tax bases because of the 
tax rates, their personal characteristics con- 
cerning tax-base supply behavior (the xj), 
and the amount and kind of public good 
they perceive their taxes to be buying. 

Maximizing (3) subject to (4) with respect 
to the G, and f; gives 


B cdit 
(5) 75, 8 
ac, àV, 
av, OT, 
= B, 3B; ` 





The politically optimal tax structure has 
tax rates that equalize marginal political 
costs per dollar of additional revenue across 
taxpayers, where this includes effects of the 
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popularity or unpopularity of the public 
goods being provided. 

Finally, the above does not take into ac- 
count the fact that each voter has several 
tax bases (indexed by j) against which taxes 
are levied. Accounting for this gives 


TAII; mE 


Following HW, the second term in the de- 
nominator of the RHS of (6) is meant to 
capture interaction effects between taxes 
levied on one tax base and behavior adjust- 
ments in other tax bases. The second part of 
the expression allows for the possibility that 
these interactions might be affected by the 
nature of public expenditures.” 

If the HW approach is correct, politicians 
pay attention to these tax bases to benefit 
interactions as they seek to win elections. 
Such interactions fall into at least two cate- 
gories. First, there is a large variety of tech- 
nical interactions between public services 


?An interesting alternative way to illustrate the im- 
portance of this relationship is shown by Wildasin 
(1984) in discussing the implications of the following 
relationship between taxed goods or factors (tax bases) 
and government services: 
àBt ôB; 


1 Li 





where the first term on the RHS is the income-com- 
pensated effect of G, on the tax base and the second 
term is the marginal benefit of public good & times the 
income derivative of the supply function for B;. But it 
is not possible for both 6Bf/dG,; and 8B;/3B, to 
equal zero if MRS, is significantly positive (as it must 
be if the public good is not a waste) and if 3B; /a/ is 
very much different from zero (as it is for labor supply, 
for example). In a world full of existing taxes, all of 
them to some extent distortionary, changes in govern- 
ment expenditure cannot help but change tax base 
behavior in ways that affect welfare gains and losses, 
effects that may be far from negligible (welfare changes 
plausibly in excess of $0.80 per dollar of expenditure), 
Wildasin, 1984, p. 240. 
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and tax-base activities. These are well de- 
scribed in recent papers by David Wildasin 
(1984) and Assar Lindbeck (1982). Wildasin 
shows how a theoretical welfare maximiza- 
tion for public goods includes interactions 
of tax bases with public goods and presents 
simulations that show that for some activi- 
ties, labor in particular, the public service to 
tax-base interaction can be quantitatively 
important.* Lindbeck presents a detailed 
exploration of the interaction of four types 
of public expenditures (transfers, subsidies 
to goods, goods with externalities, pure pub- 
lic goods) with labor supply, where he traces 
income and substitution effects from both 
the taxing activities and the publically pro- 
vided goods. He argues convincingly that 
many of these activities have been of con- 
siderable historical importance.? 

The second reason for considering tax- 
expenditure interactions involves the fact 
that tax-base activity depends importantly 
upon perceived benefits or public goods and 
services. While it is true that citizens with 
the sense of public spiritedness possessed 
by angels would gladly cooperate fully. in 
their tax-base and compliance activities even 
when paying for goods and services they 
voted against, this is too much to expect 
from ordinary humans. There was a highly 


^ Wildasin shows that the proper Samuelsonian solu- 
tion to optimal public good levels, instead of the tradi- 
tional MRS = MRT equality, is 


0S; 

MRT -tiz 
MRS.= ———= 
14+L—e; 

Qi 


where z is the publicly provided good, the x; are 
tax-base activities and 1;; q;, and e; are tax rates, gross 
of tax prices, and demand elasticities, respectively. For 
purposes of the HW model, these relationships are 
important if taken into account by the voters. 

5For example, a publicly provided or subsidized 
good can be either complementary to leisure or a 
substitute for leisure according to the degree to which 
the leisure time is being used for home production of a 
competing good to the publicly provided good. For 
another example, if a pure public good is considered 
beneficial, a family might view its provision as raising 
its income, with associated income effects upon labor 
supply, lowering it if the worker is on the backward- 
bending portion of his supply curve. 
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respected tradition among tax experts in the 
nineteenth and early twentieth centuries 
concerning the value of using “differentia- 
tion” of tax bases for this reason, where 
“objects of taxation are judiciously diversi- 
fied in such a manner as to realize the ends 
desired,” as C. F. Bastable (1895) put it.° If 
HW are correct, political decision makers 
sensitive to voter reactions will look care- 
fully at perceptions of benefits on the part 
of the public as they design tax policies. 
This need not require that benefits be mea- 
sured in some objective fashion but only 
that decision makers perceive benefit pat- 
terns and are able to impress the public 
with the reasonableness of their percep- 
tions. 

The accuracy of the HW view of the 
world, as amended here, is an empirical 
matter, of course, although the idea that 
voters take both sides of the fisc into ac- 
count in considering the records of their 
elected representatives in tax matters seems 
a reasonable enough proposition. Nor is it 
an approach that has gone. unnoticed in 
contemporary policy discussions. Thus the 
director of the Congressional Budget Office, 
Robert Reischauer (1988), in a recent paper 
widely cited in the press, has proposed as a 
way for the federal government to deal with 
current budget problems that taxes be in- 
stalled that are earmarked for specific types 
of federal activities known to command 
widespread public support, doing this by 
shaping the taxes to fall on those getting 
benefits. Proposals specifically suggested by 
Reischauer include a broad-based energy 
tax or combination of gasoline taxes and oil 
import fees to support environmental pro- 
grams such as acid-rain relief, a new value 
added tax to support extended Medicare 
and Medicaid programs, and the taxing of 
Social Security income to support the Sup- 
plemental Security Income program.’ 


SBastable, 1895, p. 334. 

TA drawback of using this amended HW approach is 
the complexity that it adds. While important, lack of 
complexity is only one goodness criterion; there are 
many others. Many proposals such as those of Reis- 
chauer (1988) do not add undue complexity to the tax 
code. 
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- More generally, policies suggested by the 
amended HW approach would include ac- 
tions where the Congress would adjust tax 
changes to current emphases in expenditure 
changes: In periods such as the Reagan 
years when the defense establishment was 
getting most of the emphasis, these costs 
could be financed relatively more by taxes 
falling upon those citizens who are “hawks”; 
when the emphasis changes to increases in 
services of compassion, taxes could be de- 
signed to fall relatively more on those who 
consider such social goals important. Wel- 
fare recipients, who cannot be taxed di- 
rectly, might be easily persuaded to pay 
taxes in the form of in-kind payments of 
labor time. Public opinion polls, if carefully 
constructed, would be quite useful in facili- 
tating this process by helping politicians 
learn voter preferences, and thus potential 
votes, with more precision.® 


*If the amended HW model turns out to be a 
reasonably correct view of the world, then it follows 
that the analytical work of the “expert” students of the 
taxation process should reflect this, as was suggested in 
a paper by Henry Aaron a number of years ago (1969). 
Among other things, this would require that- tax 
economists abandon their long exposition of the in- 
come tax against a comprehensive tax base as the 
“optimal” tax policy. For recent statements of this 
supposedly ‘orthodox’ position, see Charles McClure 
and G. Zodrow (1987) and Joseph Pechman (1987). 
‘Also see Richard Musgrave (1967). 
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Technological Stagnation, Tenurial Laws, and Adverse 
Selection: Comment 


By NADEEM Naovi* 


In a recent article in this Review, Kaushik 
Basu (1989) has made an important contri- 
bution to the literature on technological 
stagnation resulting from underinvestment 
or reduced adoption of new technology as 
an outcome of certain tenurial laws in many 
less developed countries. Tenurial laws in 
several less developed countries mandate a 
tenant's right to continue occupancy of a 
property (land or apartment) indefinitely ex- 
cept under a very limited set of circum- 
stances;' the tenant, on the other hand, is 
prohibited by the same law from selling his 
tenancy rights. Under the law, the tenancy 
rights with fixed rental agreement are prior 
to, and nullify, any subsequent contract be- 
tween the tenant and the landlord should 
the tenant decide to renege. In effect, the 
land tenure laws in these countries prohibit 
landlords from writing quit-contingent 
tenure contracts.? 

As possible explanations of the typical 
lack of innovation in these countries, argu- 
ments have been put forward as to why it is 
not in the interest of the landlord (Gale 
Johnson, 1950) or why it is not profitable for 
the tenant (Michal Kalecki, 1976) to adopt 
an innovation These arguments only 
demonstrate why it is not in the interest of 
one of the parties to innovate, and, it can be 
shown that they inadvertently end up pro- 


*Department of Economics, University of Georgia, 
Athens, GA 30602. I am grateful to a referee for 
encouragement and valuable suggestions. Any remain- 
ing errors are my sole responsibility. 

In India, for instance, if a landlord owns only one 
house and if he can "prove" that he requires it for his 
own use, then he can usually have an eviction order 
served to his tenant. 

?1 am indebted to the referee for bringing my atten- 
tion to this feature of the land tenure laws. 

"Also see Pranab Bardhan (1984), Amit Bhaduri 
(1973), Avishay Braverman and Joseph Stiglitz (1986), 
and David Newbery (1975), among others. 
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viding an explanation of why the other party 
would, in fact, adopt the innovation. Such 
arguments are clearly inadequate for ex- 
plaining stagnation. Kaushik. Basu (1989), 
on the other hand, has provided an explana- 
tion as to why both the landlord and the 
tenant would not find it worth their while to 
adopt the innovation under such tenurial 
laws. While this is' an important step toward 
providing a complete theory of technologi- 
cal stagnation, Basu’s argument is adverse- 
selection based, and it applies only to a 
landlord who owns several tenant-occupied 
rental properties. It does not explain, how- 
ever, why a landlord with a single tenant- 
occupied rental property and his tenant do 
not innovate (p. 252). Without detracting 
from the importance of Basu’s explanation, 
which I believe is compelling in the context 
of a multiple property holdings case, it may 
be pointed out that the inability of his the- 
ory to explain stagnation among single- 
property owners and their tenants is a sig- 
nificant shortcoming, for a large proportion 
of the landlords in many less developed 
countries own only a single rental property, 
and these properties again are character- 
ized by a similar lack of innovation. It re- 
mains to be explained, therefore, why both 
a landlord with a single tenant-occupied 
rental property and the tenant do not find it 
worth their while to innovate, despite the 
innovation being such as to add more to 
property value than its cost. The purpose of 
this paper is to do just this. I provide a 
simple alternative and, I believe, a com- 
pelling explanation for the stylized fact. 

I demonstrate how an attitude of aversion 
to risk on the part of the landlord, or the 
tenant, or possibly both can eliminate all 
innovation-cost-sharing contracts that the 
two parties could otherwise have entered 
into. The risk-aversion-based explanation of 
technological stagnation applies equally well 
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to both single- and multiple-property own- 
ers and their respective tenants. This alter- 
native explanation also points to the fact 
that there are several reinforcing reasons 
for the typical lack of innovation in less 
developed countries; included in this list of 
reasons are the adverse selection problem 
and the problem arising from the agents’ 
attitude of aversion to risk, which together 
with the property rights laws in these coun- 
tries, contrive to produce a suboptimal out- 
come in terms of investment and innovative 
activity, and hence technological stagnation. 


I, Risk Aversion and Technological 
Stagnation 


Following Basu, I shall take an innovation 
to be an ordered pair (X,C), where X is 
the output consequent upon the innovation, 
and C is its cost. Considering only those 
innovations that add more to the value of 
the property than their cost, I impose the 
restriction that X—C>0; that is, I con- 
sider only those innovations that are socially 
profitable. 

Let q be the exogenously given probabil- 
ity that the tenant quits after the adoption 
of the innovation. This probability is usually 
greater than zero? If the tenant quits, the 
landlord appropriates the entire output X, 
or the additional rent of X from a new 
tenant. If the tenant continues to occupy 
the property, he appropriates the entire 
output X. The landlord is not permitted to 
write quit-contingent tenure contracts. Such 
is the law.® 


^Basu (1989) also considers only those innovations 
for which X — C > Q0. 

Tenant quit probability is greater than zero for 
various reasons. Job change or transfer is one reason. 
Another common reason in the case of India is tenant 
harassment by the landlord. 

This is certainly a facet of the Delhi Rent Control 
Act of 1958, as noted by Basu. The tenancy laws in 
other cities in India are roughly the same. Further, 
instead of the landlord's output distribution (X,0; 4, 
1— q), one could have (X,, X5;9,1— q), with X;> 0, 
and the argument developed here will go through as 
long as, given an output distribution, no further output 
sharing between the landlord and the tenant is possible 
under the law. 
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Assume that the landlord (L) offers a 
contract to the tenant (T) such that the 
landlord pays a fraction / of the cost of the 
innovation, with the remainder (1—J)C to 
be paid by the tenant.’ Also assume for the 
moment that both the landlord and the ten- 
ant are risk neutral. Then the value of the 
innovation to the landlord is 


(1) V1.4) = aX —IC, 


and the value of the innovation to the ten- 
ant is 


(2) Vi(La)-(1-a)X -(1- C, 


where V, and V are equal to the expected 
profits of the landlord and the tenant, re- 
spectively. 

Let X/C=1+r, so that r-(X —C)/C 
> 0 under the assumption that the innova- 
tion is sociallv profitable. For a given quit 
probability g, the landlord will find all cost- 
sharing contracts acceptable as long as 
V, (I, q) = 0, and the set of all such contracts 
is 


(3 Le(Lklz(1*r)q, 
r»0, 0xqxl1). 


The tenant, on the other hand, will accept 
all cost sharing contracts such that V;(/,q) 
> 0, and these are 


(4) T={l:l>-r+(1+r)q, 
r»0, 0xqx1). 


For a quit probability q, the equality in (3) 
determines the maximum cost share that 
the landlord is willing to bear, /, = (1+ r)q. 
Similarly, the equality in (4) determines the _ 


"1f whoever uadertakes the innovation has to bear 
its full cost, and ic both the landlord (L) and the tenant 
(T) are risk neutral, then neither may innovate. For 
V, = qX — C, the value of the innovation to the land- 
lord, which is his expected profit, may be negative even 
if X — C > 0. Similarly, V7 =(1— 9) X — C may be neg- 
ative even if . Y — C0. On this see Besu (1989, 
p. 252). 
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minimum landlord’s cost share that the ten- 
ant is willing to accept, lp — — r -(1- r)q. 
Further, since 





X 
(5) lj,-ip-2r- >0, 


C 


because the innovation is socially profitable, 
it follows that the maximum cost share that 
the landlord is willing to bear is greater 
than the minimum landlord's cost share ac- 
ceptable to the tenant. Therefore the set of 
all cost-share contracts acceptable to. all 
parties 


(6) | A-EnT-[h,] 


is unambiguously nonempty. Hence the in- 
novation will definitely be undertaken. In 
fact, the innovative activity in this economy 
will be socially optimal, despite the tenurial 
law that seemingly discourages innovation. 

I show next that if at least one of the 
agents—the landlord or the tenant—is 
sufficiently risk averse, then the set A of 
cost-sharing contracts is empty. In order to 
develop this argument, it is instructive to 
present the above argument for the risk- 
neutral agents graphically. In Figure 1, OC 
(or line L) represents for each quit proba- 
bility q the landlord's maximum acceptable 
cost share /,; all cost shares below OC. are 
acceptable to the landlord. For instance, for 
q q^, the landlord is willing to accept all 
costs shares /</?. The segment AB (or 
line T), on the other hand, represents for 
each q the minimum landlord's cost share 
lr acceptable to the tenant; all cost shares 
above AB are acceptable to the tenant. In 
particular, for g=q° , Since the tenant finds 
all /z/2 acceptable, the set of mutually 
acceptable contracts is A(q9) - DE. And 
this set A(q) of cost-sharing contracts ac- 
ceptable to both agents is clearly non-empty, 
because lines L and T are parallel, with a 


5Of course, it is to be understood that for a given 
quit probability q — q^, the only mutually acceptable 
contracts are A(g?) = [i 49), I5 (q9)]. 
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Ti-t+(142)q 
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Tenant Quit 
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common slope of (1- r), and L is higher 
than T by a distance of r > 0? 

To see that with at least one risk-averse 
agent the set of mutually acceptable cost- 
sharing contracts may be empty, suppose 
the tenant is risk neutral but the landlord is 
risk averse. Then the tenant's bebavior is 
still described by (2), (4), and line T in 
Figure 1. However, the value of the innova- 
tion to the landlord is now given by 


(1) (ha,r) = Eu(W) 
= qu(W, + X -1C) | 
*(1- a)u(W, - IC), 


where Wọ is the landlord's initial wealth, 
W —W,- x — IC is his terminal wealth, x = 
[X,0;q,1— q] is the random variable cap- 
turing the output that accrues to the land- 
lord, E is the expectation operator, and 
u(-) is the landlord's von Neuman Morgen- 
stern utility function with u'(-) ^ 0. Assume 
that u"(-) « 0 in order to capture the land- 


91 = q? is one such contract. For it can be seen from 
(0) dnd (2) that for 1-49, V, =q X- pu 0 and 
V, =(1— a9XX - c)» 0. 
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lord's risk aversion. Further, since it is my 
purpose to demonstrate that there are situa- 
tions where desirable innovations get re- 
jected, I shall assume that the landlord's 
utility function is a constant absolute risk- 
aversion utility function, 
u(W)--—ke7", |. k,z»0. 
Since my purpose is to prove an existence 
result, restricting the analysis to a special 
class of utility functions is a harmless as- 
sumption.!? 

The landlord will accept all cost-sharing 
contracts as long as his expected utility with 
the innovation exceeds the utility of his ini- 
tial wealth, 


(7) Eu(W) x u(W;). 


From the definition of risk premium, we 
have 


(8 | u[W-P(a)|] = Eu(W), 


where W=W,+qX—IC is the expected 
terminal wealth, and P(q) is the risk pre- 
mium associated with the gamble. In gen- 
eral the risk premium is a function of W,, 1, 
c, z, X, and q. However, as shown in the 
Appendix, under the constant absolute 
risk-aversion preferences, the risk premium 
is a function only of z, X, and q. Since I 
never consider a change in the output X, or 
in the magnitude of risk aversion z, it is 
legitimate to think of the risk premium as a 
function only of the tenant's quit probability 
q. Further, given an uncertain prospect, the 
risk premium P(q) is strictly positive for a 
risk-averse agent and is higher for a more 
risk-averse agent." From (7) and (8) it now 
follows that the landlord is willing to offer 
contracts for the adoption of the innovation 
for which u[W — Pq) > u(W,), or W- 


10This was pointed out to me by the referee, who 
also brought my attention to the fact that the constant 
absolute risk-aversion utility function has the added 
feature that it makes the risk premium independent of 
l, a feature that permits a very simple proof of the 
central result of this paper. 
USee Kenneth Arrow (1965) and John Pratt (1964). 
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P(q) z W,, which, in turn, is tantamount to 
qX — IC > P(q). That is, the set of all con- 
tracts the landlord finds desirable is 


(3) L-[r l< 7 ana 


P(qg)z0, r»0, 0<asih. 


The equality in (3) determines /, = 
—[P(q)/C]+(+r)q, which is the maxi- 
mum cost share the landlord is willing to 
bear given the quit probability q. And this 
may well be smaller than the minimum 
landlord’s cost share (see (4)) that the ten- 
ant is willing to accept [Le, /p—-—r- 
(1+ r)q]. This will indeed be the case if and 
only if P(g)/ C >r or, equivalently, P(g) > 
CX — C). Therefore the set of mutually ac- 
ceptable cost-sharing contracts is 


(9) A=LNT=$¢ if P(q)>(X-C). 


This conclusion is not at all unreasonable. 
It merely says that if the landlord is risk 
averse and if his risk premium exceeds the 
social profit of the innovation, then while he 
will offer the tenant innovation-cost-sharing 
contracts, none of these contracts will be 
good enough for the tenant, despite the 
innovation being such as to add more to 
property value than the cost of the innova- 
tion. All that remains to be proved now is 
that P(g)» (X — C) is possible. And this 
condition is met by a C sufficiently close to 
X since, as shown in the Appendix, for the 
constant absolute risk aversion class of util- 
ity functions, P is not a function of C. 

Additional insight is gained from Figure 
1. The graph of /,(9) 7 -[P(q)/C1* 1 
r)q is presented in Figure 1 as the curve L. 
The particular shape of the curve L derives 
from the properties of P(q) established in 
the Appendix: /,(0) — 0, /,()=(1+ r), Fa) 
20 for all q, and / [,(q) is strictly convex 
since P(q) is strictly concave and //(q) — 
— P'(q)/ C. For a quit probability such as 

q°, the tenant accepts all contracts / 2/7, 
while the landlord is only willing to offer 
contracts | < Í? z» SO that for this quit proba- 
bility there exists no mutually beneficial in- 
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novation-cost-sharing contract. In fact, with 
the help of Figure 1, a stronger assertion 
can be made. For every (X,C), if for some 
tenant quit probability q it is the case that 
P(q)»(X— C), then there exists an in- 
terval (g,,45) such that the innovation is 
rejected for all g €(q,, 42), while it is ac- 
cepted for all sufficiently large or suffi- 
ciently small values of g €[0,q¢,]U[qp, 1]. 
This conclusion clearly demonstrates that 
there are circumstances in which, under the 
tenurial laws, both a landlord with a single- 
tenant-occupied rental property and the 
tenant will find that it is not worth their 
while to innovate, so that suboptimal invest- 
ment and innovation, and hence technologi- 
cal stagnation, will be the outcome.'? It may 
also be noted that, since for a given gamble 
the risk premium is higher for a more risk- 
averse agent, it will be the destiny of the 
relatively less risk-averse landlords and their 
tenants to enjoy the fruits of innovation, 
while, under the same tenurial laws, the 
relatively more risk-averse landlords and 
their tenants will not reach any mutually 
beneficial agreements regarding investment 
in property improvement. 


H. Conclusion 


There are several reinforcing reasons for 
technological stagnation in less developed 
countries. Some of these reasons are associ- 
ated with the peculiar nature of the tenurial 
laws in these countries, and Kaushik Basu 
(1989) has successfully explained technolog- 
ical stagnation as an outcome of the tenurial 
laws in the case of those landlords who own 
several properties. A large proportion of 
landlords in these countries, however, own 
only a single property. I have explained 
technological stagnation as an outcome of 


While in this paper I have developed the argu- 
ment for the case of a risk-averse landlord and a 
risk-neutral tenant, an identical argument is valid if the 
landlord is risk neutral and the tenant risk averse. 

1f landlord A is more risk averse than landlord B, 
then P,(q)> Pala). If P4(q)/C > r, so that A and his 
tenant do not adopt the innovation, then for a suffi- 
ciently less risk-averse B, P5(q)/ C <r, so that B and 
his tenant decide to jointly finance the innovation. 
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the tenurial laws in the single-property- 
holding case based on the work on decision 
making under uncertainty by Kenneth 
Arrow (1965) and John Pratt (1964); the 
argument I have developed is equally valid 
for the multiple-property-holdings case and, 
in that case, further reinforces Basu’s ad- 
verse selection argument for stagnation. 

Not only do several less developed coun- 
tries suffer from technological stagnation, 
but there seem to be institutions and laws in 
place that tend to perpetuate this stagna- 
tion. Tenurial Jaw amendments in favor of 
those who have the capacity to innovate are 
called for; relatively minor amendments may 
well foster a climate of tremendous techno- 
logical progress. 


t 


APPENDIX 
In this appendix I establish those properties of the 
risk-premium function that are used in the proof of the 
central result of this paper and that underlie the con- 
struction of Figure 1. The notation here is the same as 
in the text. 


Under the constant. absolute risk-aversion utility 
function 


(A1) u(W)--—ke7", — k,z»0, 
from (8) we have 


(A2) — ke~ [Wo t4 X-IC-P] 
seus k[ ge~2(¥o+X-!C) + (1- q)e 790-10]. 
In (A2) the LHS is u(W — P) and the RHS is 
Eu(W) = qu(W, X - 1C) - (1 a)u(W, — IC). 
From (A2) we have 
(A3) e *4X-P) = qge™7X + (1+ q). 


From (A3) it can be concluded that the risk pre- 
mium P is a function only of z, X, and q and not a 
function of Wo, l, or C. (A3) also permits an explicit 
solution of P. Taking the natural logarithms of (A3), 
straightforward manipulation yields 


(A4) P(q,X,z)- Inf ge? + (1- a)]* eX. 


Since in this paper the magnitude of risk aversion z 
and the output X are treated as constants, let z= X = 
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1. Then from (A4) we have 
(A5) P(q)=in[{(i-q)+¢/elt+a. 


From (A5) it may first be noted that P(0) — In(1) - 0 
and that P(1) — In(e^!)-- 1 = 0. Further, 


(e-1)) 
(ek. . NDS eve] ^ 


and 


gie y 
(n ex«--| rus] iu 


(A7) clearly demonstrates that the risk premium is a 
strictly concave function of q. And from (A6) we have 
P'(0—1/e»0, P(1)-2—e«0, and PT[1/(e—1)]- 
0. Hence P(4) starts out equal to zero at q — 0, in- 
creases initially as q rises, then decreases, and finally 
equals zero again at q = 1. 
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Quality Distortion by a Discriminating Monopolist: 
Comment 


By SHABTAI DONNENFELD AND LAWRENCE J. WHITE” 


Assume that a monopolist makes 
men’s suits, and that he makes only 
one size of suit. This is absurd behav- 
ior, but the picture of the sadistic 
monopolist who disregards consumer 
desires has often made fugitive ap- 
pearances in the literature... . 

The general conclusion that I wish 
to draw is that a monopolist who does 
not cater to the diversities of his buy- 
ers’ desires will suffer a substantial 
decline in his profits. 

[George J. Stigler, 1964, pp. 60-61] 


A number of recent theoretical papers 
have shown Stigler’s conjecture to be incor- 
rect! A monopolist that offers multiple 
product quality levels to multiple consumer 
types may well find it worthwhile to distort 
some quality levels or not provide them at 
all. This distortion can enhance the 
monopolist's profits because it prevents or 
discourages some customer types, who have 
a high willingness to pay for some quality 
levels, from switching to other quality levels 
that yield lower profits for the monopolist. 

In a recent paper, Padmanabhan Sri- 
nagesh and Ralph M. Bradburd (1989) show 
that this quality distortion can involve 
higher-than-optimal quality levels as well as 
quality deterioration. This finding closely 
parallels the results previously demon- 
strated by Shabtai Donnenfeld and 
Lawrence J. White (1984, 1988). 

The Srinagesh-Bradburd discussion of 
their model neglects any efforts at compara- 


*Stern School of Business, New York University, 
and Graduate School of Business, Tel Aviv University; 
and Stern School of Business, New York University, 
New York, NY 10006. 

‘These papers include Lawrence White (1977); 
Michael Mussa and Sherwin Rosen (1978); Eric Maskin 
and John Riley (1984); Shabtai Donnenfeld and 
Lawrence White (1984, 1988); J. J. Gabszewicz et al. 
(1986); David Besanko, Shabtai Donnenfeld, and 
Lawrence J. White (1987, 1988); and Padmanabhan 
Srinagesh and Ralph M. Bradburd (1989). 
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tive statics. We believe that these compara- 
tive statics are interesting; they indicate, for 
example, that the comparative numbers of 
each customer type are an important influ- 
ence on the degree of quality distortion that 
occurs and on the prices that the monopolist 
charges for each quality level. The remain- 
der of this paper provides and discusses 
these comparative statics results as well as 
some policy-related issues. 


I. A Review of the Model 


The Srinagesh-Bradburd and Donnen- 


.feld-White models are structured as 


follows.” 

A consumer is interested in buying (at 
most) only one unit of a specific product 
that can be produced in a continuous array 
of qualities that are indexed by Q, with 
higher Q implying higher quality; the re- 
mainder of the consumer’s consumption is 
of a composite good, X. The consumer pays 
a price P for the unit of the good of quality 
Q. The consumer’s indirect utility function 
can be represented by U(Q,P), and the 
lowest level of utility to which he can be 
forced (at which point he will consume only 
X) is U(0,0).? 

There are two types of consumers, a and 
b, numbering N, and N,, respectively. For 
all (Q, P) we assume that U(Q, P) « 
U,(Q, P); that is, type b consumers have a 
higher absolute willingness to pay for qual- 
ity. 

The product is produced by a monopolist 
at a unit cost of c(Q), with c'(Q)» 0 and 
c"(Q) 2 0. The monopolist knows the utility 
functions of the two types of consumers, 


?To ease the comparability of our results with the 
Srinagesh-Bradburd model, we adopt their notation 
and formulation rather than that used in our earlier 
model (Donnenfeld and White, 1984, 1988). 

This specification of the utility function assumes 
away income effects. 
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and he knows their numbers; but he does 
not know their identities, and he cannot 
keep them separated through any physical 
or legal means; that is, each consumer type 
is able to buy any product quality that is 
offered on the market at any prevailing 
price. 

The monopolist’s problem, then, is to 
maximize 


(1) T= NAP,- ¢(Q,)} 


+ NP, - c(Q,)) 
subject to 
(2) U,(Q,; ps U,(Q,,P,) 0 
(3) U,(Q,,P,) -U,CQ,, P,)=0 
(4) U,(Q,,P,) 2 U, (0,0) 2 
(5) U,(Q,, P) - U,(0,0) > 0. 

If the monopolist chooses to supply goods 
to both consumer types, the only binding 
constraints are (3) and (4), that is, the in- 
centive compatibility constraint (ICC) and 
the individual rationality constraint (IRO), 


respectively. The monopolist’s profit maxi- 
mization problem can be written as . 


Max II = N,{U,(Q,) - c(Q,)) 


(Qar Qo) 
+ N {U (Q) - U,(Q,) 


* U,(Q,) = c(Q;)} s 


The necessary and sufficient conditions 
are 


(6 Ho(Q.( 


QiCN,,8,,y,, 0), 


N,,N,,0,,0,, y, Ya 0). 


Nas Nos Yas Yoh) =0 


(7) IIo( Q4 N,,0,. y, ) ? N,; $) =0. 


In (6) and (7) we explicitly depict the depen- 
dence of the endogenous quality variables 
on parameters that shift the utility and cost 
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functions; 0; provides a constant increment 
to U,, and y; affects only the marginal will- 
ingness to pay for quality. Similarly, c = 
Co + $c(Q). Note that Q, depends on all 
the exogenous parameters, while Q, de- 
pends only on the parameters directly re- 
lated to type b consumers. 

As Srinagesh and Bradburd (1989) and 
Donnenfeld and White (1984, 1988) demon- 
strate, the solution to this problem depends 
on the marginal valuation of quality (MVQ) 
of the type a and type b consumers. If, for 
all Q, MVQ, < MVQ,—that is, if absolute 
and marginal willingness to pay by the two 
consumer types are positively correlated (the 
"standard" assumption)—then the 
monopolist maximizes profits by distorting 
downward the quality level of the lower- 
quality product. If, however, MVQ,> 
MVQ,—if absolute and marginal willing- 
ness to pay for quality are negatively corre- 
lated (the “nonstandard” assumption)— 
then quality distortion occurs through an 
increase in the quality level of the higher- 
quality product.? 

One other point is worth noting. As White 
(1977) and Donnenfeld and White (1984, 
1988) demonstrate and as Srinagesh and 
Bradburd (1989, fn. 5) note, the incentive 
for the monopolist to distort quality may be 
strong enough that he reaches a boundary 
solution in which only a single quality level 
is produced for the monopolist’s dominant 
(Le., most profitable) customers; the cus- 
tomers who were potentially subject to the 
quality distortion find that they are not pro- 
vided with any suitable quality level whatso- 
ever. 


‘Quality is distorted in the sense that the quality 
level provided to group a is different from the quality 
level that a competitive market would provide to group 
a or that a monopolist that could keep the two groups 
apart would provide to group a. 

See Donnenfeld and White (1984, 1988) for some 
economic interpretations of this assumption. 

See Stephen W. Salant (1989) and Hal R. Varian 
(1989) for a unifying formal treatment of the necessary 
and sufficient conditions that induce internal or corner 
solutions in atemporal and intertemporal models of 
price discrimination via self-selection. 


VOL. 80 NO. 4 


DONNENFELD AND WHITE: QUALITY DISTORTION 943 


TABLE 1—THE COMPARATIVE STATICS OF THE “STANDARD” CASE, 
IN WuicH THE Low-Qua.ity Propuct Is DISTORTED DOWNWARD 
(POSITIVE CORRELATION OF ABSOLUTE AND MARGINAL WILLINGNESS TO Pay) 


























aN, dN, 

E dQ, + EE 

dP, * = 

dQ, 0 0 

dP, > + 

Likelihood of 

Specializing 

on the High- 

Quality Product - + 





d6, dô, dy, dy, dẹ 
0 0 + z = 
+ 0 + i - 
0 0 0 + = 
+ 0 2 + = 
= + au + - 





TABLE 2— THE COMPARATIVE STATICS OF THE "NONSTANDARD" CASE, 
IN WHICH THE HIGH-QuaLity PRopucr Is DISTORTED UPWARD 
(NEGATIVE CORRELATION OF ABSOLUTE AND MARGINAL WILLINGNESS TO PAY) 

















dN, dN, dð, dê, dy, dy dd 
dQ, - + 0 0 + + - 
dP, - + + 0 + + - 
dQ, 0 0 0 0 0 + & 
dP, E + + 0 + + - 
Likelihood of 
Specializing 
on the High- 
Quality Product - + 7 4 = + = 
II. Comparative Statics where IIo, <0 by concavity of the profit 


The comparative statics for the “stan- 
dard" (positive correlation) and “non- 
standard” (negative correlation) cases are 
provided in Tables 1 and 2, respectively. 

For illustrative purposes, we shall provide 
the explicit derivation of the effects . of 
changes in the number of consumers for the 
standard case—positive correlation be- 
tween absolute and marginal willingness to 
pay for quality. This derivation explains how 
we arrive at the first two columns in Table 
1. Differentiation of (6) and (7) with respect 
to N, and N,, respectively, yields 





ðQ, Io 
8 ee M ed) 
aN, Hoo, 
9Q, ts Ilo, 0 
ON, IIo o, 
Q, 9Q, 
(9) e 


function and Ilo,v, - U/(Q,) - C(Q,) is 
positive by the first-order condition and 
IIo 7 —lUs(Q,)- U;(Q,)] is negative by 
thé "standard assumption. 

Further, differentiation of the ICC (3) 
and IRC (4) with respect to N, and N, and 


using (8) and (9) yields 





10 oP, 5 dQ, 
(10) an, ^ CQ) any > % 
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~ [Vs (Q4) -Ui (Qa)] = 


d 
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(12) a = Us Qa) ae <0, and 


ðP ðQ 
03) ab TUCO) aN, 





8Q, 
J [Us(Q,) z U:(Q,)] aN, 


., 90, 
z [Us(Q.) m U;(Q,)| aN, 





>0. 


Some interesting results emerge from 
these comparative statics. 

(i) First, the relative numbers of the two 
consumer types matter. As the relative 
number of the high absolute willingness to 
pay consumers (N,) increases, the quality 
distortion experienced by the other group 
worsens. In the standard case—positive cor- 
relation—the quality provided to type a 
consumers deteriorates, and its associated 
price declines (see Table 1). In the nonstan- 
dard case—negative correlation—the qual- 
ity provided to type a consumers is in- 
creased, and the price they pay.rises (see 
Table 2). Furthermore, in both cases it is 
more likely that the monopolist will special- 
ize in producing products only for those in 
group b as their number increases. The 
intuition behind these results is straightfor- 
ward: As the relative number of the high 
absolute willingness to pay (high profit) cus- 
tomers (N,) increases, the monopolist can 
increase his overall profits by worsening the 
alternative (Q,). He thereby loses some 
profits in his sales to group a, but he gains 
even larger profits by being able to raise his 
price (P,) to group b. 

(ii) Second, changes in either group's ab- 
solute willingness to pay have no effects on 
quality distortion, though they do affect the 
location of the boundary solution and there- 
fore affect the likelihood that the monopolist 
will specialize on producing products only 
for the b group. Also, an increase in group 
a’s absolute willingness to pay will mean an 
increase in the prices that both groups pay. 
The intuition here is that an increase in a's 


SEPTEMBER 1990 


absolute willingness to pay for quality means 
that the monopolist can charge. a higher 
piice to this group; this higher price means, 
in turn, that the (Q,, P,) alternative is still 
less attractive to group b, so the monopolist 
can now charge a higher price (P,) to that 
group. 

(iii) Finally, an increase in the marginal 
willingness to pay by group b in the stan- 
dard case causes the quality distortion (de- 
terioration) experienced by group a to 
worsen and increases the likelihood that the 
monopolist will specialize on group b. The 
price charged to group b increases, partly 
because of the pure effect of b's increased 
marginal willingness to pay and partly be- 
cause the monopolist has worsened the al- 
ternative (Table 1). However, in the non- 
standard case, an increase in the marginal 


willingness to pay by group b results in 


further distortion, albeit a too high a quality 
is provided to these consumers. Group a 
buyers end up paying a higher price for the 
same quality (Table 2). 

With a suitable reinterpretation of the 
variables presented in Tables 1 and 2, we 
can use these comparative statics briefly to 
examine a few policy issues. For example, 
the imposition of a production tax is the 
equivalent of an increase in the cost param- 
eter @. Hence, for both the standard and 
nonstandard cases, levying this tax of d on 
a multiproduct monopolist results in an 
overall decline in quality and prices of the 
goods provided to all consumers. 

Further, the imposition of a binding im- 
port quota on goods of a specific quality 
type supplied by a multiproduct foreign 
monopolist is the equivalent of a decrease 
in the number of consumers for that quality 
type. Hence, for the standard case, a quota 
on low-quality (high-quality) imports results 
in the monopolist’s downgrading (upgrad- 
ing) of the lower-quality imports; for the 
nonstandard case, a quota on high-quality 
(low-quality) imports results in the upgrad- 
ing (downgrading) of the higher-quality im- 
ports.’ 


7See Shabtai Doanenfeld (1988) for a more elabo- 
rate discussion on trade policies. 
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II. Conclusions 


Microeconomic models involving endoge- 
nous quality continue to provide a rich vein 
for theoretical research and policy-oriented 
issues. As we have shown, the comparative 
statics analysis of the Srinagesh-Bradburd 
(1989) and Donnenfeld-White (1984, 1988) 
models adds to the insights that can be 
gained from these models. 
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Comparative Productivity: Comment 


By STEVEN ROSEFIELDE* 


Abram Bergson (1987a) has recently at- 
tempted to show that socialist economic sys- 
tems are underproductive and undereffi- 
cient.! Close evaluation, however, reveals 
that this conclusion is too sweeping. Using 
the USSR as a case in point, this comment 
demonstrates that .Soviet productivity 
growth using Bergson's data (1987b) and 
methods (Bergson, 1978) matched Western 
Europe's and exceeded the U.S. achieve- 
ment in 1960—75. Moreover the pronounced 
duality between Soviet industrial and nonin- 
dustrial productivity discovered by Bergson 
for 1960 appears to have increased there- 
after through 1975, according to the sta- 
tistics of the United States Central Intel- 
ligence Agency (CIA) similar to those em- 
ployed in Bergson (1987b). This duality is 
confirmed further by a separate calculation 
of comparative machine-building productiv- 
ity. Using CIA sector of origin data that are 
consistent with Bergson's counterpart end 
use data (Imogene Edwards et al, 1979) 
and his factor share weights (Bergson, 1978), 
Soviet machine-building productivity is cal- 
culated to be 79 percent of the American 
level in 1975 (Table 1). Revised data for the 
same year, published by the CIA in 1982, 
raise this figure to 88 percent. These find- 
ings clearly suggest that socialist economies 
need not be comprehensively underproduc- 
tive (underefficient). Although they do not 
dispose of many issues of legitimate con- 

. troversy? and pose serious questions about 


*The author is at the University of North Carolina, 
Campus Box 3305, Gardner Hall, Chapel-Hill, NC 
27599-3305. 'The author wishes to thank three anony- 
mous referees for their constructive suggestions and 
the Earhart Foundation for its support. 

‘Bergson, 19872, p. 355. (Compare Bergson, 1948, 
Pp. 235—236.) For a discussion of the distinction be- 
tween underproductivity and underefficiency, see Berg- 
son (1978, pp. 68-74). 

2They do not take full account of relative backward: 
ness. Presumably, Soviet productivity growth should 
have exceeded, rather than matched the West Euro- 
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the potential shortcomings of the adjusted 
data on which Bergson relies, they do 
demonstrate nonetheless that the positivist 
debate over the comparative productivity 
(efficiency) of socialism even in the Soviet 
case still cannot be completely laid to rest.? 


L Underproductivity 


Specialists have never doubted that the 
ageregate productivity of the American 
economy surpasses the Soviets’. A clear pic- 
ture of the dimensions of the U.S. advan- 
tage, given the inscrutability of Soviet statis- 
tics, however, did not emerge until the late 
1960’s when Bergson published Planning 
and Productivity Under Soviet Socialism. His 
principal findings, as perfected in his later 
works, are reproduced in Table 2. They are 
expressed as < series of “coefficients of fac- 
tor productivity” for 1960 that compare fac- 
tor productivity in Western Europe and the 
USSR with the United States on the explicit 
assumption that all countries possess a com- 
mon linear homogeneous, Cobb- Douglas, 
transnational production function.* Output 
is measured by gross domestic material 


pean experience. They ignore welfare losses caused by 
the unresponsiveness of supply to consumer demand. 
And it can be argued, contrary to Bergson’s judgment, 
that they seriously underestimate the effects of hidden 
inflation (Bergson, 1987c, p. 420). 

Bergson carefully elaborates the theoretical and 
empirical limitaticns of his empirical calculations, in- 
cluding issues of technology, economies of scale, devel- 
opment stage, milieu, and degrees of freedom. He 
acknowledges that his findings are not definitive, but 
argues for their significance by observing that “Such a 
qualified potentiality, however, is not what proponents 
of socialism have usually claimed for that -system” 
(Bergson, 1987a, p. 356). Differences in the quality of 
land, labor, and capital that are not captured in the 
aggregate input data should also be added to Bergson’s 
inventory of shortcomings. 

The appendix to this paper (available upon re- 
quest) discusses coefficients of factor productivity. 
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TABLE 1—CoMPARATIVE INTERNATIONAL AGGREGATE, INDUSTRIAL, AND 
MACHINE-BUILDING EFFICIENCY iN 1960 AND 1975 








Summary Estimates: Coefficients of Factor 
Productivity (Valued at U.S. Dollar Prices; 
Index U.S. = 100) 











GNP Industry MBMW 
Country 1960 1975 1960 1975 
United States 100 100 100 100 
Western Europe 60 85 65 - 
USSR 41 55 58 79 





Sources: Tables 2, 3, and A12 (in statistical appendix, available upon request). 
Method: Coefficients of factor productivity following Bergson are computed according 
to the formula 


a= Qi») [(G 7k) Qro Q7) 77^]. 


where the ith subscript refers to the Soviet Union or Europe and the jth to the 
United States; y, k, v and / are output, fixed capital, inventories, and labor, and the 
exponents are factor shares based on American dollar weights. Inventories are 
omitted in computing coefficients of MBMW productivity. @ values for Western 
Europe are simple averages of the figures for France, Germany, Italy, and the UK. 
GNP is defined as gross material product, excluding selected final services: health 
care, education, government administration, defense, and housing. Industry is defined 
broadly and includes manufacturing, mining, power, construction, transport and 
communications, and trade. Statistics in columns 1 and 3 are Bergson's; those in 
column 2 are derived directly from Bergson's data; those in column 4 are Rosefielde's 
based on the CIA's estimates of Soviet MBMW output and growth provided in CIA 
(1975) and Edwards et al. (1979, p. 273). See sources and the appendix for further 
details. 

Notes: 'The MBMW statistics reported above have not been adjusted for hidden 
inflation. For alternative estimates see the text, fn. 13, and Tables A12 and A16 in the 
statistical appendix (available upon request) Weapons constituted 36 percent of 
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Soviet MBMW at established ruble prices in 1970. 


product (except in Soviet/American com- 
parisons, where gross national material 
product is employed) exclusive of output 
originating in selected final services: health 
care, education, government administration, 
defense, and housing. Nonfarm labor is ex- 
pressed in adjusted man-hours. Both labor 
and reproducible fixed capital have the same 
coverage as output. 

The coefficients of factor productivity re- 
veal that both labor and combined factor 
productivity were significantly lower in the 
Soviet Union than in the West. Given the 
uncertainties that beset international com- 
parisons, this finding is not conclusive, but it 
is persuasive because the principal norm 


adopted favors the Soviet Union. Soviet 
factor productivity is computed in dollars, a 
procedure that, as is widely understood, 
overstates its relative position. 

In 1960 Soviet labor productivity was only 
31 percent of the U.S. level, and combined 


°The ruble data Bergson uses correspond closely 
with official Soviet statistics and are not significantly 
adjusted for hidden inflation. According to Igor Bir- 
man (1983) the CIA dollar-ruble ratios Bergson em- 
ploys overstate the dollar value of Soviet output. The 
CIA itself has revised the estimates on which Bergson's 
analysis depends, sharply downward (CIA, 1984). It 
follows directly therefore that insofar as Bergson's esti- 
mates err, they probably do so in the Soviet's favor. 
(Compare Rosefielde, 19882.) 
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TABLE 2—-COEFFICIENTS OF FACTOR PRODUCTIVITY 
(U.S. = 100) 











Gross Material Product 
per Unit of Factor 


Gross Material (Labor and 
Product per Reproducible Capital) 
Employed Worker Inputs 
Country 1960 1975 1960 1975 
United States 100 100 100 100 
France Si 94 63 96 
Germany 51 91 65 92 
United Kingdom 49 69 64 73 
Italy 34 71 47 77 
USSR 31 58 41 55 





Sources: Tables A1 and A2 (available upon request) and Bergson (1978: table 6.1, pp. 
76-77; table 7.1, p. 93; appendix tables 11, p. 236, and 18, p. 241). 

Data: Gross material product represents gross domestic product exclusive of output 
originating in selected final services: health care, education, government administration, 
defense, and housing. Dollar estimates are valued at prevailing 1960 and 1975 U.S. 
prices. U.S. and Soviet data are consistently defined. Major inputs refer to labor and 
reproducible fixed capital. Employment is adjusted for differences in nonfarm hours; 
capital is calculated as the average of two relatives, one with fixed capital including gross 
of depreciation and the other with such capital including net of depreciation. 

Method: The coefficients of factor productivity for 1960 are reproduced directly from 
Bergson (1978). The coefficients of factor productivity for 1975 are computed by the 
author, using the data provided in Bergson's unpublished statistical appendix (Bergson 
1987b), according to the formula 


= (i/y)/[ki Jk) Qu £o) Q0) 7777] 


where the ith subscript refers to the Soviet Union, or a specific European country, and 
the jth to the United States; y, k, v, and / are output, fixed capital, inventories, and 
labor, and the exponents are factor shares based on American dolar weights. 

Labor, reproducible fixed capital inputs, and inventories are aggregated logarithmi- 
cally with weights of 0.7415, 0.2278, and 0.0307, respectively, reported by Bergson for 
American factor income shares in 1960 (see Bergson, 1978, pp. 71, 95 fn. 3, and appendix 
table 18, p. 241). Bergson (1978) includes inventories in his calculations for 1960 (see 


appendix table 12, p. 238). 


labor and capital productivity was 41 per- 
cent. This poor showing may have diverse 
technical causes, including production func- 
tion peculiarities, unfavorable factor pro- 
portions, and economies/diseconomies of 
scale. But as Bergson (1978) himself notes, 
to the extent that production functions in 
the countries sampled approximate the lin- 
ear homogeneous, transnational Cobb- 
Douglas standard, the low Soviet produc- 
tivity (efficiency) appears to be primarily 
attributable to the shortcomings of its eco- 
nomic system. 

Bergson's most recent contribution is 
aimed at further illuminating this important 
issue. He applies the same conceptual appa- 


ratus and data conventions to assess com- 
parative factor productivity for a later year, 
1975, but alters his approach in two ways. 
He expands his sample to include additional 
capitalist and socialist countries and esti- 
mates the systems underproductivity (un- 
derefficiency) effect: directly, using various 
econometric techniques. His regressions in- 
dicate that socialist countries are underpro- 
ductive and/or underefficient, and that the 
Soviet Union is the least efficient socialist 
country followed by Hungary, Poland, and 
Yugoslavia. 

As with most econometric research, it is 
easy to fault these results. The assumption 
of a transnational Cobb-Douglas production 
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function is implausible.$ The sample size is 
small. With the exception of Spain, the lev- 
els of development between socialist and 
capitalist countries diverge noticeably. The 
unifying factors that distinguish Soviet, Pol- 
ish, Hungarian, and Yugoslavian socialism 
from socialist democratic arrangements in 
West Europe and presumably cause their 
underefficiency are neither clearly explained 
nor convincingly defended. And the under- 
lying conversion of Edwards’ ruble esti- 
mates with the CIA’s dollar-ruble ratios is 
highly suspect.’ Nonetheless, on the basis of 
Bergson’s prior studies, it seems unlikely 
that reasonable allowance for these factors 
would significantly alter his principal con- 
clusion. Judged by gross material product, 
the Soviet Union and its fellow socialist 
economies of the East are underproductive 
and probably underefficient as well after 
proper adjustment is made for levels of eco- 
nomic development and sundry other con- 
siderations. 

This aggregate assessment is important, 
but its scope is more restrictive than it 
seems. Bergson’s econometric calculations 
pertain exclusively to the year 1975 and fail 
to illuminate whether the socialist 
economies of the East are overcoming their 
relative international underproductivity (un- 
derefficiency). Also, they are likely to con- 
ceal sectoral asymmetries. The Russian 
economy has manifested a distinct duality 
since the late nineteenth century. Industry 
in general and heavy industry in particular 
have been markedly more productive than 
other components of gross material prod- 
uct, and it would be surprising if this has 
ceased to be the case. Although the Soviet 
system as a whole almost certainly is under- 
efficient, it might still be successful in some 
important activities. 


The close correspondence between the stage of 
Soviet and Italian development and their aggregate 
productivity in 1960 had suggested that the transna- 
tional production function was understating Soviet in- 
efficiency. This is no longer the case. Compare their 
coefficients of factor productivity for 1975 in Table 2. 

"See fn. 5, Bergson (1987a, p. 347), and Bergson 
(1987b, table 3, p. 3). 
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Evidence supporting the first hypothesis 
is provided in Table 2. It presents coeffi- 
cients of factor productivity computed with 
data provided in Bergson (1987b), according 
to Bergson's original methodology, mea- 
sured with respect to employment and other 
combined factors for the years 1960 and 
1975. The improvement in relative Soviet 
factor productivity (efficiency) during this 
period is remarkable. Relative gross mate- 
rial product per employed worker is 87 per- 
cent higher in 1975 than in 1960. The gain 
in relative gross material product per unit of 
factor (labor and reproducible capital) in- 
puts is less pronounced but still an impres- 
sive 34 percent. The magnitudes of these 
advances are in line with the European 
mean, which is less than might have been 
expected given the Soviet Union's relative 
backwardness.? This shortcoming not with- 
standing, however, the USSR did reduce its 
overall relative underproductivity (under- 
efficiency) vis-à-vis the West. 

The duality of the Soviet economy also 
manifests itself in Bergson's data. Table 3 
reproduces his estimates of gross industrial 
product per employed worker and per unit 
of factor inputs in selected countries in 1960. 
They are calculated as before but omit agri- 
culture as well as selected services. Relative 
Soviet factor productivity is clearly higher in 
industry than in agriculture. Instead of con- 
spicuously lagging the pack, Soviet indus- 
trial factor productivity was more or less on 
a par with Italy and the United Kingdom 


*Bergson's econometric estimates could not be ap- 
plied to his 1960 data set because the required land 
statistics are unavailable. 

?Bergson (1987a) shows that Soviet static factor 
productivity is less than it should be after adjustment 
for the level of economic development. The dynamic 
results are mixed, although even here it can be argued 
that Bergson's results are confirmed with respect to 
Italy and, perhaps, England as well. See Table 2. As a 
consequence they qualify Bergson's finding that the 
Soviet Union is always underefficient, adjusted for the 
level of development. Bergson (1978) addresses the 
subject of dynamic productivity in his table 9.4 (p. 158). 
His results, which cover the period 1955-70, are simi- 
lar to those reported here. Soviet productivity growth 
1960-75, however, was better relative to its Western 
rivals than in the time period Bergson studied. 
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TABLE 3—Gnoss INDUSTRIAL PRODUCT PER EMPLOYED WORKER AND PER UNIT 
OF Factor Inputs, SELECTED COUNTRIES, 1960 








Gross Industrial 





Gross Industrial Product 
per Unit of Factor (Labor 


Product per and Reproducible Capital) 
Country Employed Worker Inputs 
United States 100 100 
France 60 71 
Germany 54 69 
United Kingdom 48 61 
Italy 46 60 
USSR 50 58 





Source: Bergson (1978, table 7.5, p. 108). 


Data: Gross industrial product represents essentially the gross output originating in 
manufacturing, mining, power, construction, transport and communications, and trade. 
In the calculation of output per worker and per composite unit of factor inputs, 
reference is to employment and capital stock used in the same sector. Valuation of 
output and inputs is in 1960 U.S. dollars. Employment is also adjusted for variations in 
the length of the workweek. Combined inputs are aggregated logarithmically accord- 
ing to the Cobb-Douglas specification. The method for computing the coefficient of 
factor productivity (column 2) is specified in Table 2. 


and not glaringly below France and Ger- 
many. 
'  Bergson has not published similar statis- 
tics for 1975, but official Soviet and CIA 
data can be used to show that the sectoral 
duality of the Soviet economy persists and 
has probably intensified. Table 4 reports 
estimates of aggregate and industrial factor 
productivity growth 1965-85. The output 
series are CIA estimates, as are the statis- 
tics on employment and capital. Industry is 
defined exclusive of construction, transport, 
communications, and trade. Inputs are com- 
bined according to the Cobb-Douglas speci- 
fication with Bergson’s income share 
weights. Industrial total factor productivity 
measured in this way exceeds the aggregate 
rate for the entire period 1965-85 and never 
falls below it for each quinquennial subpe- 
riod. The same comparison computed by 
using official industrial growth data suggests 
a more pronounced differential that exceeds 
60 percent (Table A17 in statistical ap- 
pendix, available upon request). Although 
analogous estimates for the United States 
and the West are not at hand, this disparity 
suggests that the Soviets not only signifi- 
cantly diminished their relative aggregate 


underproductivity, but may have reduced 
their industrial underproductivity as well.!? 

Industry is not a homogeneous activity. 
The Soviets have long accorded higher pri- 
ority to producer durables than nondurable 
and durable consumer goods. This prefer- 
ence is reflected in the rapid growth of 
producer durables! and in the relatively 
high level of machine-building productivity 
displayed in 1975 computed with the aid of 
CIA sector of origin statistics that are con- 
sistent with the counterpart end use data 
that Bergson employs and the methods he 
utilized in his earlier studies. This figure 
reported in Table 1 is 79 percent of the 
American standard.’? An alternative esti- 
mate using revised CIA sector of origin 
statistics for the same year raises the figure 
to 88 percent, while other estimates based 


Compare Bergson, 1978, table 9.5, p. 158. 

MSee Bergson, 1987c. 

P1 the disparity between industrial and aggregate 
productivity reported by Bergson for 1960 (Table 1) is 
extrapolated to 1975, industrial productivity would be 
78 percent of the U.S. level; 


[n 


~r 
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TABLE 4—ESTIMATES OF Soviet Facror Propuctiviry GrowTH 1965-85 
CIA Statistics /Bergson’s Weights 

Measure 1965-70 1970-75 1975-80 1980-85 1965-85 
Gross 

National 

Product 4.9 3.1 2.3 2.2 3.1 
Combined Inputs 3.8 3.8 3.1 2.5 3.3 

Work Hours 2.0 1.7 13 0.7 1.4 

Capital 74 8.0 6.9 6.3 74 
Total 

Factor 

Productivity 11 —0.7 —0.8 -—0.3 —02 
Industrial 

Output 6.0 5.7 2.7 2.3 4.2 
Combined Inputs 4.9 7.8 3.4 2.6 3.6 

Work Hours 3.1 1.5 14 0.5 1.6 

Capital 8.8 8.7 7.1 7.0 8.0 
Total 

Factor 

Productivity Li 1.9 —0.7 —0.3 0.6 


Compound Annual Rates of Growth 


Sources: Joint Economic Committee of Congress (1986, table 4, p. 80, and table 5, 
p. 81); Bergson (1978, appendix table 11, p. 236). 

Method: Factors are combined, following Bergson, according to the Cobb-Douglas 
specification (see Bergson, 1978, pp. 159-60, fn. 8). The income elasticities (shares) 
inputed to capital (0.328 GNP, 0.317 industry) are provided in Bergson, (1978, 
appendix table 11). Labor is 0.672 GNP, and 0.683 industry. The capital weights 
applied measure net reproducible capital services, assuming that the CIA’s series that 
is not defined grows proportionally with the net stock. Similar results are generated 
using Bergson’s gross, enterprise capital stock weights (1978, appendix table 22, 
p. 245). Both the input and output data are taken from CIA sources. Total factor 
productivity growth for aggregate output and industry are computed with the formula 


6=y— [ak 4 (1— a)i], 


where y, k, and Í are output, capital, and labor growth, and œ and (1— o) are the 
output elasticities of these factors measured in 1955 rubles (GNP) and 1959 rubles 


(industry). 


partly on official MBMW growth series indi- 
cate that Soviet machine-building factor 
productivity may have surpassed the U.S. 
level in 1975 (Tables A12 and A16 in statis- 
tical appendix, available upon request).? 


BIn evaluating the possibility that Soviet MBMW 
productivity may exceed the American level, one must 
carefully consider the extent to which official data are 
distorted by hidden inflation. Bergson appears to be- 
lieve that hidden inflation is not too severe in some 
civilian MBMW activities, but he has riot clarified his 
views on concealed inflation in the military machine- 
building sector. As à consequence, the dégree to which 





he might be prepared to accept the official MBMW 
growth series after 1975 is unclear. 


We must also reckon perhaps with a degree of 
concealed inflation owing to overevaluation of 
new domestic machinery, though the evidence 
on that is particularly opaque. It would, I think, 
be surprising if the resultant overstatement 
overall in annual producers’ durables invest- 
ment growth was more than one percentage 
point 1971-75 and three percentage points for 
1976-80. 

In sum, that the TsSU and CIA data on real 
fixed irivestment are subject to concealed infla- 
tion has yet to be conclusively demonstrated. 
But we must, I think, reckon those measures to 
be very possibly so affected. Should they be, the 
rate of growth cited above 1971-75 might have 
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The trend thereafter is less certain. Soviet 
MBMW productivity deteriorates using the 
CIA’s output estimates, and improves ac- 
cording to official series. These statistics in 
their totality are difficult to interpret, but 
they do suggest that factor productivity in 
the Soviet machine-building sector was at, 
or above, both the aggregate and industrial 
levels in 1975 and that the margin may have 
widened thereafter. 

As previously cautioned all comparative 
assessments of Soviet: economic perfor- 
mance are clouded by the suspicious char- 
acter of Soviet statistics. It should also be 
observed that the CIA's dollars-ruble ratios 
on which Bergson and the author rely may 
overstate the quality of Soviet equipment. 
Nonetheless, to the extent that Bergson's 
methodological conventions are used as a 
norm, it follows that Soviet machinery and 
industrial productivity in 1975 were much 
closer to the American level than the econ- 
omywide average. Whatever deficiencies be- 
set the socialist economic systems of the 
East generally, they appear to operate with 
reduced force in industry and industrial 
subsectors supporting priority Soviet activi- 
ties. 


II. Sources of Inefficiency 


During the course of his various analyses, 
Bergson identifies numerous causes that 





to be discounted by a fraction of a percentage 
point. For the tempo cited 1976—80, the corre- 
sponding discount might be, say, a full percent- 
age point. [Bergson, 1987c, p. 420] 


In his fn. 16 Bergson explains further that hidden 
inflation in the MBMW producers' durables sector is 
likely to be in line with rates on fixed investment 
because TsSU's producers’ durables statistics do not 
appear to be biased by imported foreign durables 
prices. Readers wishing to explore this matter further 
are directed to the appendix tables cited, CIA (1988), 
and Rosefielde (1988a, 1988b). 

In appraising the performance of the Soviet ma- 
chine-building sector it should be borne in mind that 
the U.S. Defense Intelligence Agency considers the 
technology of deployed Soviet weapons to be on a par 
with America's. See Joint Economic Committee of 
Congress, 1987, p. 88. 
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partly explain Soviet and/or socialist eco- 
nomic underefficiency. He places the most 
importance om Soviet technological short- 
comings’ and the least on intraenterprise 
technical inefficiency. (Compare Padma De- 
sai, 1987; Yasushi Toda, 1986; and Robert 
Whitesell and H. Barreto, 1986.) The find- 
ing that the productivity structure of the 
Soviet economy parallels the duality long 
manifest in its output structure does not 
contradict this causal ranking, but it does 
require further elaboration. Apparently, the 
potency of some or all these causes varies 
across branches and sectors. Recent empiri- 
cal work on intraenterprise efficiency in So- 
viet cotton refining (Rosefielde, Knox Lovell, 
Vyachaslav Danilin, and Ivan Materov, 
1985) and the confectionary sector (Mikhail 
Afanas"ev and V. Sokov, 1985; Afanas"ev, 
1987), insofar as it is representative, indi- 
cates that technical inefficiency in light in- 
dustry may not be a major source of aggre- 
gate underproductivity. Counterpart studies 
in other sectors have not been undertaken, 
but it would hardly be surprising if technical 
efficiency in priority activities were not at 
least up to the cotton refining and confec- 
tionary industrial norm (Rosefielde and 
R. W. Pfouts, 1988; Richard Ericson, 1984). 
It may therefore be provisionally inferred 
that the superior productivity of the Soviet 
industrial sector generally, and the machine- 
building sector in particular, are at- 
tributable to the selective acquisition of 
advanced technologies and the priority . 
allocation of key factor supplies. Priority 
sectors enjoy privileged access to scarce, 
superior inputs, have some influence on the 
production of intermediate input supplies 
including special order goods, and are less 
prone to factor supply disruptions. Research 
and development, like technology transfer 
efforts, are concentrated in these activities 
and are more effectively diffused because 
of the close working arrangements among 
foreign trade organizations, specialized 
procurement agencies, design bureaus and 
enterprises. The asymmetries in measured 


Bergson, 1987a, pp. 355-56. Also see his discus- 
Sion on prices, p. 345. 
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factor productivity manifested by the Sovi- 
ets and other socialist economies thus may 
not be accidental. They may reflect endur- 
ing institutional priorities of diverse sorts 
that, other things equal, should be expected 
to persist, thwarting all efforts to prove on 
purely positive grounds that these economies 
are, or must eventually be, comprehensively 
underefficient. 


III. Conclusion 


Subject to the usual controversies regard- 
ing Soviet data and profound but in- 
tractable difficulties of measuring compara- 
tive demand satisfaction, this note supports 
Bergson's important finding that the Soviet 
economy and socialist economies of the East 
were statistically underproductive (under- 
efficient) in 1975. However, it disconfirms 
two closely related subhypotheses. First, 
contrary to the spirit of Bergson's article, 
his own data demonstrate that the Soviets 
narrowed the productivity gap with the West 
in 1960—75 and that socialism as it has 
developed in the Soviet Union and Eastern 
Europe therefore need not always be mea- 
surably underproductive in all regards. Sec- 
ond, disaggregation revealed further that 
factor productivity in the Soviet economy 
exhibits a pronounced duality that may make 
it possible for the Soviets to excel in some 
aspects of industrial production, while lag- 
ging noticeably in others. These findings 
cast some doubt on the adequacy of the 
kinds of adjusted data on which Bergson 
relies, but insofar as they serve as a valid 
norm, it must be concluded that the social- 
ist economies of the East are not unambigu- 
ously and comprehensively underproductive 
(underefficient). An impartial reading of the 
evidence thus indicates that Soviet eco- 
nomic performance evaluated sectorally 
and/or dynamically may not always be infe- 
rior to its rivals in the developed West, and 
might be improved by administrative, incen- 
tive, property right, and/or market reforms 
(Aleksandr Isaev, 1989). Although the prog- 
nosis for Soviet and Eastern European pro- 
ductivity (efficiency), given their poor per- 
formance since 1975 and the widespread 
dissatisfaction that has been expressed in 
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the East with the quality of socialist life, is 
hardly encouraging, it is still premature 
to suppose that the aggregate productivity 
of these socialist economies will continue to 
underperform the capitalist economies of 
the developed West according to the mea- 
sures Bergson prefers, or that the East 
cannot excel in some important industrial 
pursuits. 
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Comparative Productivity: Reply .- . 


By ABRAM BERGSON” 


By reprocessing comparative productivity 
data for 1975 in Bergson (1987) to conform 
methodologically to related data for 1960 in 
Bergson (1972, reprinted 1978), Steven 
Rosefielde usefully provides a basis to gauge 
relative productivity shifts over time. He 
considers the advance in Soviet status thus 
manifest, however, as “remarkable.” It is 
also held to represent one of several ways in 


which my principal finding, that Soviet fac- . 


tor productivity is depressed by Western 
standards, is misleadingly incomplete. Rose- 
fielde apparently accepts that finding but 
concludes that "contrary to the spirit" of 
Bergson (1987), *the Soviets narrowed the 
productivity gap with the West in 
1960-1975.” 

Perhaps they did, but if so, according to 
his compilation, the narrowing was hardly 
dramatic. At 41 (USA = 100), Soviet factor 
productivity in 1960 is 60.5 percent of the 
average for the five Western countries con- 


sidered. At 55, the corresponding 1975 ratio - 


is 62.8 percent. 

As recent writings of William Baumol and 
others show, the quantitative relation be- 
tween “development stage” and productiv- 
ity growth is a complex matter. According to 
a separate inquiry (Bergson, 1974, reprinted 
1978), however, factor productivity growth 
during 1955-70 varied markedly among the 
same five Western countries, plus Japan. 
The variation takes the form of a statisti- 
cally significant inverse relation to develop- 
ment stage, such as often hypothesized. The 
calculations also indicate, though with 
somewhat less statistical assurance, what 
perhaps might have been inferred from 
Rosefielde’s compilation, that considering 
development stage, the USSR here too is 
relatively depressed. 


*Department of Economics, Harvard University, 
Cambridge, MA 02138. 
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In comparing factor productivity levels in 
the USSR and the West in Bergson (1972), 
I compiled data not only for all "material 
sectors" together but for "industry" alone. 
The latter calculation, however, proved es- 
pecially laborious, and in Bergson (1987), 
with a decidedly larger sample of countries 
to deal with (including, in addition to the six 
studied for 1960, five others: Japan, Spain, 
Poland, Hungary, and Yugoslavia), I fo- 
cused on the more comprehensive sphere 
alone. In view of the importance that Rose- 
fielde attaches to this deficiency, perhaps he 
himself will feel impelled to repair it. 

Meantime, though, he properly states my 
finding that, regarding factor produc- 
tivity, the USSR in 1960 compares more 
favorably with the West in industry than in 
material sectors generally. As he seems to 
acknowledge, though, with appropriate nor- 
malization, the USSR probably underper- 
forms the West in one sphere as well as in 
the other (Bergson, 1972). 

Rosefielde must nevertheless be right to 
infer that factor productivity in industry in 
the USSR is higher than in the economy. 
more generally. Whether that margin of su- 
periority has been increasing, as he argues, 
is doubtful. In view of inevitable data limita- 
tions, the edge in favor of industry shown by 
his calculations seems rather narrow. Also, 
as he fails to consider, his measures of pro- 
ductivity growth in the economy generally 
cover services. For purposes of productivity 
measurement, Bergson (1972, 1987) rightly 
focuses on material sectors. There, factor 
productivity growth has been distinctly 
greater than in the economy generally and 
probably has been fully comparable to that 
in industry (Bergson, 1974, 1983). Contrary 
to Rosefielde's further speculation, indus- 
trial factor productivity in the USSR may 
have retrogressed relative to that in the 
West during 1955—70 (Bergson, 1974). 

Rosefielde's computations for MBMW are 
novel and interesting. He is probably right 
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that Soviet productivity compares more fa- 
vorably with that of the USA there than in 
industry as a whole. He might have stressed 
more than he does (fn. 13), though, that 
MBMW includes military machine building. 
For fairly obvious reasons, comparative data 
on output in this sphere must be subject to 
decidedly more than the usual margin of 
error attaching to such statistics. 
Rosefielde alludes more than once to the 
dubious nature of “Soviet statistics" under- 
lying my comparative productivity computa- 
tions. As the nonspecialists may not grasp, 
he is being cautioned thus not primarily 
about Soviet official statistical data, such as 
have rightly come to be discredited among 
Soviet as well as Western scholars (and that 
Rosefielde himself audaciously uses in the 
second of his two computations on MBMW). 
Rather, Rosefielde is referring in that odd 
way to widely accepted Western recalcula- 
tions of such Soviet data. Substantial and 


carefully documented as such recalculations: 


are, however, they too have their limita- 
tions. In cited studies, I have sought system- 
atically to address such limitations. My 
findings, I believe, are quite robust to resul- 
tant data errors. 

, Rosefielde speculates finally about the 
conditioning factors underlying Soviet un- 
derperformance regarding factor produc- 
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tivity. My discussions of this interesting 
question in a number of different writings 
admittedly call for collation and review, but 
that is a task for another occasion. Flawed 
as Rosefielde’s critique is, it effectively un- 
derscores the complexity of comparative 
economic performance appraisal. 
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On the Basing-Point System: Comment 


By Dav D. Happock* 


In “On the Basing-Point System,” Bruce 
Benson, Melvin Greenhut, and George 
Norman (1990)! properly correct a misun- 
derstanding by Jacques Thisse and Xavier 
Vives (1988)? concerning my 1982 paper on 
basing-point prices. But BGN then take ex- 
ception with several of the conclusions I 
had reached, or that they thought I had 
reached. Although I have insufficient space 
to deal with all the points they raise, I will 
address the most important ones. 

In 1981 most economists believed that a 
model of profit-maximizing basing-point 
pricing would have to assume collusion. My 
model, as BGN seem to agree, showed that 
it need not; atomistic competition at one 
production site accompanied by local mon- 
polies elsewhere may lead to basing-point 
pricing and freight absorption without collu- 
sion.? Conversely, TV showed that the prac- 
tice as observed empirically will not arise if 
all production sites contain noncooperative 
local monopolies. BGN establish yet an- 
other important theoretical result —nonco- 
operative oligopoly at one site could gener- 
ate the f.o.b. prices (or f.o.b. plus transport) 
that, in my model, led local monopolies 
elsewhere to adopt freight-absorbing bas- 
ing-point prices. The three models would 
seem to be complements. 


*Northwestern University School of Law, Chicago, 
IL 60611-3069. I am grateful to Ian Ayres, Marty 
DeBoer, Mark Grady, Fred McChesney, and Mena- 
hem Spiegel for helpful comments. 

;BGN henceforth. 

TV henceforth. 

IOn the other hand, the situation envisioned may 
lead to some alternative pricing pattern. Under some 
circumstances the profit-maximizing monopoly price 
for a remote firm will be less than a price based on 
those of atomistic firms elsewhere. In that event, the 
remote firm will charge a monopoly price. If it is too 
costly for the remote firm to price discriminate spa- 
tially in a more refined manner, the remote firm will 
become a base in a multiple basing-point system. See 
Section III, part B of my 1982 article. 
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I. A Paradox 


But BGN claim that their formalization 
shows that noncooperative freight-absorbing 
basing-point prices require "highly restric- 
tive conditions" (p. 584), implying to them 
that such systems would be rare. Hence, 
BGN contend that, though the logic of such 
a model is internally consistent, rivalrous 
freight-absorbing basing-point systems are 
rare or nonexistent in practice; actual in- 
stances must arise from conscious paral- 
lelism or collusion (p. 587). And even if 
such prices reflected rivalrous behavior, the 
implied spatial price discrimination might 
be welfare reducing. 

BGN's contention creates a paradox: (1) 
BGN claim that basing-point pricing with 
freight-absorption is not rivalrous in prac- 
tice; (2) economists have long suspected 
that a strong cartel will not use freight- 
absorbing basing-point prices; (3) hence, if 
(1) and (2) are correct, the practice must 
arise in some sort of weak cartel; but (4) 
neither BGN nor any earlier writer has elu- 
cidated a widely applicable and satisfacto- 
riy rigorous model that implies that 
basing-point pricing will comprise profit- 
maximizing behavior by a weak cartel. 

It seems peculiar to try to settle an essen- 
tially empirical issue through pure theory, 
particularly since BGN offer no comparably 
rigorous model that yields basing-point 
prices as a collusive practice. Lacking the 


‘In the latter regard BGN resemble TV, who re- 
mark that a single base system is "not a stable config- 
uration since it is not an equilibrium of our two-stage 
game. This suggests the hypothesis that [basing-point 
pricing] cannot be explained in the context of a nonco- 
operative model...and suggests that theoretical expla- 
nations of [basing-point pricing] should consider its 
role as a coordinating and collusive device..." (pp. 
130-31). That suggestion, however, is reached by de- 
fault, because TV have no collusive model yielding 
basing-point pricing either. As BGN properly note, TV 
were simply not working with a model capable of 
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latter model, it would seem impossible to 
guess, even on theoretical intuition, whether 
or not collusive basing-point prices require 
less restrictive conditions than do rivalrous 
ones. 

Nevertheless, for the sake of argument, 
assume BGN are correct. What would that 
imply? Freight-absorbing basing-point pric- 
ing seems to be an unusual system. Few 
industries ever use it, and then only transi- 
torily? If the circumstances that implied 
such a rare practice were not restrictive, 
then one might indeed suspect collusion. 
Hence, I find no cause in BGN to alter my 
prior belief that basing-point prices per se, 
even those involving freight-absorption, pro- 
vide no prima facie evidence about collu- 
sion one way or the other. If most firms are 
not (are) colluders, then most firms using 
basing-point prices are (are not) probably 
exhibiting rivalrous behavior. 

Clearly, that does not mean that a firm 
utilizing basing-point prices can never be 
guilty of collusive behavior, just as f.o.b. 
pricing (or any third pricing structure) does 
not prove rivalrous practice. But as I indi- 
cated in 1982 (p. 293), those alleging collu- 
sion should be required to produce other 
evidence, because basing-point prices them- 
selves seem useless as indicators.’ 





generating basing-point pricing, but that does not mean 
one cannot exist—my rivalrous one, for example, or 
BGN's oligopolized variant. 

Curiously, immediately following the quoted pas- 
sage, TV very briefly outline in rudimentary form alter- 
ations in their model that would yield conclusions 
broadly consistent with my own. 

Except, perhaps, for some European examples. But 
there the pricing structure is established and enforced 
through governmentally established bureaucracies. 

In fact, I noted in 1982 that “basing-point prices 
may arise in a competitive industry at least as readily as 
in a poorly cartelized one" (p. 293, emphasis added). 
Only the competitive model was elucidated; I had 
nothing to add to the existing literature regarding the 
poorly cartelized possibility, as formulated by George 
Stigler (1949) and John McGee (1954). Subsequent 
publication of Dennis Carlton's 1983 article, however, 
has dramatically weakened my confidence in the prior 
theory of poorly cartelized basing points. 

"One collusive organization that might opt for bas- 
ing-point prices is one that is strong locally but has 
geographically limited influence—that is, one that can- 
not achieve agreement with, detect chiselitig by, and/or 
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II. Refuting the Presumption of Collusion 
A. Limit-Entry Location 


BGN base their belief that cartels might 
adopt basing-point prices on the hypothesis 
of pure profit with freedom of entry, as 
advanced by Nicholas Kaldor (1935) and 
developed by Curtis Eaton and Richard 
Lipsey (1978)5 The Kaldor-Eaton-Lipsey 
model resembles a spatial version of limit- 
entry pricing. Plants locate closely enough 
together to enable the existing ones to earn 
some profits, though rarely short-run 
profit-maximizing ones, but there is insuf- 
ficient demanc to cover average costs in the 
event of entry? BGN argue that "more 





punish violations of the cartel agreement by at least 
some firms that are at a distance. Functionally, a 
locally strong cartel resembles a spatial monopolist, 
and so resembles the remote firm of my 1982 article. A 
simple extension of my 1982 model shows that under 
the appropriate conditions maximizing cartel profits 
will require basing cartel members' prices on those of 
nonmembers elsewhere, and then absorbing freight. 
See Section VI, Part B of my 1982 article for a discus- 
sion of behavior that may have reflected the use of 
such weakly collusive basing-point pricing. Also see 
Federal Trade Commission v. National Lead Company 
et al. In 1933, while drafting a "code of fair competi- 
tion" for themselves under the National Industrial 
Recovery Act, several lead producers adopted a spatial 
price schedule that resembled that of the industry's 
largest firm. 

Beginning with a similar insight, Thomas Gilligan 
(1990) has independently undertaken a detailed theo- 
retical and empirical investigation of the plywood in- 
dustry. 

Note carefully, however, that the requisite condi- 
tions for this form of weakly collusive basing-point 
pricing closely resemble the ones that lead remote 
firms to absorb freight in conjunction with basing-point 
pricing in a rivalrous market. Consequently, from a 
purely theoretical standpoint, those collusive require- 
ments are no easier to satisfy than are those facing my 
rivalrous model. Whether or not the use of basing-point 
prices reflects collusion, then, remains an empirical 
issue that requires independent evidence of collusion 
for its resolution. 

‘BGN, p. 586. 

Though resembling limit-entry price theories, the 
Kaldor-Eaton-Lipsey hypothesis is on firmer ground 
when firms have site-specific long-lived investments as 
credible precommitments to “stand their ground” in 
the face of entry-induced losses, a point alluded to by 
Eaton arid Lipsey, and developéd by Dennis Capozza 
and Robert Vari Order (1980). 
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profitable systems may well induce competi- 
tive entry. ...[I]f base-point pricing... pro- 
tects existing rents, it may actually be the 
most desirable arrangement...[because]...a 
base-point system is much less costly to po- 
lice and enforce...” (p. 588). 

Kaldor, Eaton, and Lipsey have not gone 
unchallenged. The erosion of abnormal 
gains when the rights to them are ill-defined 
is well understood for other activities.’° 
Ronald Johnson and Allen Parkman (1983) 
argued that the same "tragedy. of the com- 
mons" applies to Kaldor-Eaton-Lipsey. Ex- 
pected future profits for incumbents induce 
premature investment. Profits aggregated 
over the investment's life will be normal, 
though the plant incurs losses during some 
periods and reaps profits during others. 

So entry cannot be deterred by Kaldor- 
Eaton-Lipsey techniques unless an industry 
is stagnant or declining. Indeed, Johnson 
and Parkman claim to refute the empirical 
implications of the hypothesis for the ce- 
ment industry. If basing-point prices will not 
deter entry, one would expect a cartel— 
whether weak or strong—simply to maxi- 
mize short-run profits, but that will usually 
require a pricing structure other than one 
using basing points. 

If property rights to above-normal gains 
are well defined, as through ownership of 
essential inputs or processes, or through 
regulation, the rights owners can control 
entry directly, if that is desirable. The own- 
ers can simply refuse to sell the input, 
license the process, or authorize the entry, 
and thus have no incentive to adopt basing 
points to exclude potential competitors. 


B. A Cartel's Ability to Detect Chiseling 


Not only do basing-point prices not maxi- 
mize cartel profits, entry possibilities being 
neglected, they actually enhance opportuni- 
ties to chisel. Contrary to the supposition of 
many observers, BGN included, basing-point 


I? H, Scott Gordon (1954) was an early expositor of 
the "property rights paradigm." See Terry Anderson 
and P. J. Hill (1990), and Richard Cornes, Charles 
Mason, and Todd Sandler (1986) for updated bibli- 
ographies. 
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prices interfere with cartel discipline. 
George Stigler (1949) is the most cited au- 
thor on collusive basing points. But Carlton 
argues that Stiglers own ^A Theory of 
Oligopoly" (1964) is inconsistent with 
Stigler's earlier theory of delivered prices. A 
cartel must first detect chiseling to control 
it. If prices are unobservable, Stigler's 
oligopoly theory shows that a second-best 
means to detect chiseling is to observe sales 
patterns. Changes that cannot plausibly re- 
flect stoachastic factors will signal chiseling. 

Basing points make price irrelevant for 
buyers’ choices among sellers, the sellers 
seem too homogeneous to buyers, so chisel- 
ing becomes more difficult to infer from a 
given shift in buying patterns. But many 
alternative pricing systems yield similar in- 
determinacy only along market boundaries 
between adjoining production sites. Chisel- 
ing elsewhere will create damning evidence. 
Even if all colluders at a single site quote 
identical prices to every customer because 
no other way to divide the local market has 
proven workable, they will retain market 
boundaries with neighboring cartel mem- 
bers. Market division is a better cartel strat- 
egy than is product homogenization. To ar- 
gue the contrary implies that firms prefer to 
be oligopolists rather than monopolists, 
or that they prefer to be n-firm oligopo- 
lists rather than m-firm oligopolists, where 
n>m. 

If, in contrast, prices are observable, bas- 
ing points are not inferior to other systems 
for detecting chiseling. But neither are they 
superior. Indeed, no pricing system is disad- 
vantaged for that purpose. But some alter- 
native system will typically yield higher car- 
tel profits than will basing points. So neither 
strong nor weak cartels would often use 
basing points, providing the cartels are 
industrywide. 


C. Procollusive Legal Attacks on 
Basing-Point Pricing 


Alternatively, suppose a cartel exists that 
does not include all the firms in the indus- 
try, or one that has difficulty policing its 
members’ prices. Contrary to BGN’s asser- 
tion, Carlton shows that the cartel would 
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like to divide market areas among cartel 
members, not adopt basing-point prices. As- 
sume that the cartel tries to institute a pric- 
ing scheme that divides markets.!! Since the 
cartel’s price quotations exceed marginal 
cost (if the cartel merits concern), a firm at 
one site can profitably invade all or part of 
another site’s “natural market” by matching 
(or slightly undercutting) cartel prices there. 
The invasion may reflect chiseling by a car- 
tel member, or simply opportunism by a 
nonmember. The chiseling/nonmember 
firm would, in effect, be creating a multi- 
base basing-point system if the cartel had 
adopted f.o.b. prices. 

But in that event the cartel itself may 
instigate a challenge to the multibase sys- 
tem by encouraging some buyer, the Fed- 
eral Trade Commission (FTC), or the Justice 
Department to sue the chiseling/nonmem- 
ber firm for "anticompetitive" behavior. If 
the suit is successful, the chiseling/non- 
member firm will be forced to limit its anti- 
cartel behavior to its own natural market 
area. In other words, antitrust suits against 
basing-point prices can be procollusive ac- 
tions. 


III. Welfare Effects of Rivalrous 
Basing-Point Pricing 


Even if freight-absorbing basing-point 
systems appear in rivalrous industries, BGN 
worry that consumers gain nothing. Perhaps 
the enhanced revenues become rents, loca- 
tional or otherwise, or (I might add) are 
dissipated in needless overhead. I agree with 
the spirit of that suggestion. Whether the 
discrimination increases welfare ultimately 
is another empirical matter, and another 
one that will not be resolved here. Never- 
theless, there are several additional points 
that must be underlined. 


UAn implication of an article by Arthur Smithies 
(1941) is that a cartel would often opt for some geo- 
graphically discriminatory schedule, though not 
basing-point prices, rather than f.o.b. prices. That dis- 
tinction is immaterial to Carlton's discussion; many 
price schedules other than f.0.b. would divide markets. 


SEPTEMBER 1990 


First, an “Austrian” economist might 
foresee economic advantage whether or not 
increased consumer surplus results. If all 
potential entrants are inferior to a firm that 
absorbs freight under a basing-point sched- 
ule, the firm may indeed earn rents. That 
the firm would necessarily survive if forced 
to adopt a less preferred pricing schedule 
does not follow, however. And producer 
surpluses count, too. 

And even neoclassical models offer po- 
tential sources of consumer surplus that 
ought not to be ignored. Arthur R. Burns 
(1936), no apologist for basing-point prices, 
nevertheless often observed near-basing- 
point prices, not precise ones; to capture 
the bulk of sales in its “natural market," the 
remote firm in my model must be superior 
to distant firms in the eyes of local buyers. 
But discounts provide buyers a benefit. BGN 
might think it a paltry benefit, but perhaps 
it is the greatest one possible if the firm is a 
high cost producer (transport cost aside). 


12This observation was briefly discussed in my 1982 
article at fn. 8. 

“Near-basing-point price" and " precise-basing-point 
price" are not terms coined by Burns, but accurately 
denote his findings. Burns was specifically interested in 
steel pricing but ample evidence indicates near- 
basing-point prices have been the rule rather than the 
exception. For example, in Federal Trade Commission 
v. Staley Mfg. Co. it was noted that for glucose sales the 
"[rlespondents, to gain access to the markets... made 
their sales 'by first quoting the same prices as were 
quoted by competizors and then making whatever re- 
duction in price...was necessary to obtain business’” 
(Supreme Court of the United States, quoting, in part, 
evidence gathered by the FTC, 324 U.S. 746, 751). 
Similarly, Greenhut (1987) remarks that for plywood 
“the new entrants in the South established a delivered 
pricing formula which assured the capture of the 
southern market. They accomplished this by quoting a 
Portland Base price at a slight differential ‘below’ the 
f.o.b. mill price on Doulgas Fir plus West Coast freight 
to southern destinations” (p. 202). 

One is reminded of the outrage that greeted 
Robert Fogel (1964) when he discovered that the U.S. 
rail system had yielded a social saving of “only” a 
percent or two of GNP. Fogel’s response, of course, 
was that one is hard-pressed to find any alteration in 
economic structure that results in really staggering 
benefits to the economy. If one dismisses seriatim every 
“small” increment in economic welfare, there is little 
left to count. 
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Further, the firm’s effort to offer only a 
modest discount will sometimes fail if the 
firm must rely on buyers’ representations to 
estimate competing prices, for buyers have 
a clear incentive to misinform sellers. Hence, 
some buyers will obtain discounts that are 
more than modest. 

Finally, suppose for the sake of argument 
that rivalrous freight-absorbing basing-point 
prices yield gains in neither consumer nor 


producer surpluses. Even a zero gain is 


preferable to a loss. There is little question 
that either regulatory actions or private liti- 
gation absorb a substantial amount of real 
resources and, worse yet, can often be 
turned opportunistically to nefarious, so- 
cially injurious purposes, as argued by Stigler 
(1971) and his followers. Antitrust actions 
should be reserved to remedy actual in- 
juries, not to attach parties that (some may 
think) have rendered merely modest bene- 
fits. It is unwise to insist that courts take a 
more active role in fine-tuning the opera- 
tions of complex commercial enterprises. 
Justices have JDs, not MBAs; in a healthy 
society courts are asked to render justice, 
not to administer firms. 

The welfare impact of freight-absorbing 
basing-point prices will not be discovered 
solely by theoretical means. Hard empirical 
work is called for at this juncture. 


V Recognizing buyers’ incentives to mislead, a firm 
will sometimes refuse to undercut a claimed offer. But 
if the claim is genuine, a failure to underbid will cause 
the order to be lost to a more distant rival; there will 
be both Type I and Type II errors, which is merely an 
implication of Stigler (1961). A seller's inability to 
perfectly separate genuine buyers’ reports from oppor- 
tunistically misleading ones can account for at least 
some intermingled sales in freight-absorbing industries, 
as reported by Fritz Machlup (1949); just as it accounts 
for intermingled sales across natural market bound- 
aries in f.o.b. industries. Additional intermingling, it 
was argued by Stigler (1949), arises from lags between 
orders and deliveries when prices vary. 

There are other, even more subtle, benefits to buy- 
ers that may enable the remote firm to dominate its 
local market. For example, the firm may be able to 
offer quicker delivery or more personalized service or, 
though seeming to quote precise basing-point prices, 
be less insistent on prompt payment. These are but a 
few possibilities. 
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On the Basing-Point System: Reply 


By Bruce L. BENson, MELVIN L. GREENHUT, AND GEORGE NORMAN* 


David D. Haddock’s (1990) contention is 
really that the Jacques F. Thisse and Xavier 
Vives (1988) and the Bruce L. Benson, 
Melvin L. Greenhut, and George Norman 
(1990) papers do not provide a collusive 
model that yields base-point pricing (BPP). 
He apparently believes his paper (1982) es- 
tablished competitive BPP and suggests (in 
1990) that the collusive model and hard 
empirical work alone remain. However, no 
collusive model is required since it has al- 
ready been well demonstrated in the litera- 
ture on plant location and spatial price the- 
ory that firms at feasible distance locations 
from a production center would accept BPP 
only because they fear that to do otherwise 
would subject them to retaliatory actions by 
the larger, more powerful firms at the pro- 
duction center. Moreover, the paper by 
Thisse-Vives (1988, henceforth TV) and 
Benson-Greenhut-Norman (1990, hence- 
forth BGN) indicated sufficiently that other 
systems would arise than Haddock’s com- 
petitive BPP, especially since the restrictive 
conditions necessary for the latter militate 
against its use. Haddock, in fact, appears to 
accept these conditions in his references to 
Arthur R. Burns’ proposals (1936) of near- 
base-point pricing (NBPP). Hard empirical 
analysis would indeed be in order in deter- 
mining the extent of NBPP, not competitive 
BPP. 

This paper responds to the above noted 
issues in three main sections. Section I 
briefly provides some historical data on the 
collusive uses of BPP for any future re- 
searcher who seeks empirically to uncover a 
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true competitive BPP system. Most impor- 
tantly from a theoretical standpoint, this 
section also explains why cartels will employ 
BPP notwithstanding the obvious and well- 
known thesis that other more profitable 
spatial pricing forms exist, ceteris paribus. 
Section II of this paper then distinguishes 
between profits and locational rents, in the 
process indicating in partial contradiction to 
Haddock (1990) that BPP does generate 
such rents. Though we have already con- 
tended that existing theory has demon- 
strated the use of different competitive 
forms of spatial pricing and locations under 
profit-maximizing conditions than competi- 
tive BPP, Section III of this paper provides 
specific basis for this contention as well as 
references. That same section discusses 
NBPP. To save space, we shall let the analy- 
sis on the subject of the TV (1988) and 
BGN (1990) papers stand by themselves, 
without repeat demonstrations or explana- 
tions of their results. 


I. What Is an Effective Cartel? 


Haddock’s references to weak or strong 
cartels and apparent suggestion (1990) that 
BPP would be used by weak cartels over- 
simplifies the subject. A cartel arises if it is 
able to cover the cost of organizing, and 
then survives if it establishes a sufficiently 
strong monitoring and enforcement (polic- 
ing) system that is capable of limiting com- 
petition (for example, price competition, en- 
try) to an acceptable level. Cartel costs vary 
with the characteristics of an industry (num- 
ber of firms, geographic distribution of firms 
and consumers, etc.). When firms and con- 
sumers are geographically dispersed, orga- 
nizational and policing costs tend to be high 
and the ability of the cartel to limit compe- 
tition is lessened. 

We echo many others when we suggest 
that BPP reduces policing costs for a geo- 
graphically dispersed group. Haddock is not 
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convinced by this argument. He contends 
that cheating will be detected more readily 
under f.o.b. pricing because prices (both 
f.o.b. and basing-point) are not easily ob- 
served. He proposes that effective monitor- 
ing must therefore involve observations of 
relative sales. He suggests that chiseling at a 
distance from a boundary “will create 
damning evidence" under f.o.b. pricing 
(1990, p. 959). We would propose, however, 
that prices are readily observed under many 
real basing-point pricing arrangements. Is 
Haddock suggesting that chiseling would not 
have been easily detected on the part of one 
or more of the eleven differentially located 
firms that offered cement to the U.S. gov- 
ernment at the identical delivered prices of 
$3.286854 (Aetna v. FTC, 1946)? Or what 
_ of the ten sealed bids priced at $253,633.80 
for reinforcing bars (New York Times, 
February 20, 1939), or the 59 steel pipe bids 
to the U.S. Navy Department, each for 
$6,001.83 (Annual Report of the Attorney 
General, 1937, pp. 37-38)? Or, consider the 
eight geographically dispersed cement com- 
panies that submitted bids to the Illinois 
Department of Highways for deliveries at 
102 sites in 102 Illinois counties, every 
bid for every delivery being identical in 
price (Congressional Record, May 31, 1959, 
p. 7961). 

The reason that basing-point prices are 
relatively easier to monitor is that they are 
determined by a simple formula that is 
known by every cartel member. For in- 
stance, between January 1982 and Septem- 
ber 1983 there were three regional bases for 
the pricing of cement in England, and deliv- 
ery increments were applied to every 5 miles 
of road. 'These increments reflected haulage 
rate charges computed from a particular 
base point, not the point of origin of the 
shipper; these rates varied over time from 
16p to 56p per ton per 5 miles of road. 
Prices certainly may not be readily observ- 
able under an f.o.b. system, but cooperative 
BPP makes them relatively more observ- 
able, and therefore reduces monitoring 
costs. 

The fundamental feature of BPP that 
Haddock overlooks is that the main conspir- 
ators are typically localized at (near) the 
base point(s), and other spatially dispersed 
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firms feel obliged to adhere completely to 
the system. A chiseler must be punished if 
caught, or there will be no effective deter- 
rent to cheating. When chiseling under some 
organized type of pricing system other than 
BPP affects only a single firm located at a 
distant site, the incentives and ability of the 
cartel to retaliate and punish the transgres- 
sor are quite weak. This is particularly the 
case if the affected firm is small and not 
part of the main group of powerful conspir- 
ators. When chiseling impacts all cartel 
members, including the largest and most 
powerful, as would be the case under BPP, 
the potential for and likelihood of retalia- 
tion is much greater. This could conceivably 
induce, in some countries, the procollu- 
sive type of antitrust action mentioned by 
Haddock; however, in a country such as the 
United States of the late twentieth century, 
any BPP cartel would be likely to pursue 
more subtle means of punishing a chiseler. 

We contend that even if monitoring costs 
are no lower under a basing-point system 
than they are under some undefined, orga- 
nized f.0.b. type of arrangement, total BPP 
cartel costs (which include enforcement as 
well as monitoring costs) will tend to be 
less. The imposition of credible punishment 
is an important part and cost of effective 
cartelization. For example, Clair Wilcox 
(1960) mentioned 49,000 pages of testimony 
along with 50,000 pages of exhibits on ce- 
ment industry prices that were presented to 
the Federal Trade Commission. After also 
discussing the equally detailed documents 
on record concerning the steel industry, he 
went on to review penalties. Those imposed 
by the respective cartels on member firms 
that failed to adhere to the system were 
strikingly severe (pp. 280-81). He con- 
cluded that “If basing point pricing were a 
spontaneous outgrowth of natural causes, as 
some economists have argued, it would 
scarcely have been necessary to go to such 
lengths to ensure that its requirements be 
observed” (1960, p. 281). 

Haddock is also incorrect when he sug- 
gests that sellers under BPP seem relatively 
more homogeneous to buyers than a cartel 
would wish (1990, p. 959). The fact of ho- 
mogeneous output is exactly one reason why 
the system is desired by the large firms 
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located at the base point. The steel, cement, 
and plywood firms that located at base 
points wanted and took advantage of homo- 
geneity via a spatial delivered price that 


allowed them to maintain their markets over _ 


substantial distances. Haddock ignores an 
important aspect of many of these indus- 
tries when he contends that a cartel would 
prefer geographic market division. As 
Wilcox (1960, p. 280) observed, changes in 
the geographic pattern of demand for steel 
(and presumably other construction mate- 
rial) took place rapidly in the United States. 
Thus, for these industries, producers’ loca- 
tions are fixed while consumers’ are not. 
When subject to geographic market division 
and f.o.b. pricing, firms would have to build 
new facilities to follow demand. This is not 
required as often under BPP since the large 
powerful firms under BPP can readily sell in 
new distant markets.” 


II. Profits or Rents? 


Haddock implies that cartelization must 
fulfill its objectives (for example, joint profit 
maximization, entry deterrence), or the car- 
tel is ineffective. He refers to Ronald N. 
Johnson and Allen M. Parkman’s (1983) 
demonstration that the cement industry did 
not earn supranormal profits, proposing this 
as evidence that any entry deterring efforts 
by the firms in the industry had to be inef- 
fective. It would then seem to follow that 
any noncooperative, noncontrolled system 
could also be ineffective with respect to 
entry deterrence.? However, long-run rents 


IBGN assume immobile consumers as well, but 
demonstrate that even in this case noncooperative pric- 
ing is very unlikely and market segmentation is the 
competitive result. In contrast, Haddock implies that 
market segmentation is strictly a collusive outcome. 

This is another consideration that would reinforce 
the TV (1988) conclusion that BPP becomes a relevant 
collusive practice. i 

*Haddock apparently accepts George J. Stigler's 
(1971) theory of economic regulation since he suggests 
that regulation and licensing may be a source of entry 
limits. According to this view, most regulatory actions 
provide cartel-like benefits for the regulated firms 
rather than benefiting consumers. Stigler's theory is 
based on the public choice paradigm in which bureau- 
crats and other public officials are driven by self-inter- 
est rather than public interest motives. Observe in this 
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in a true BPP system can appear in different 
form than higher net profits, while also re- 
sulting from entry deterring efforts. In par- 
ticular, competition for prime locations 
would bid up the value of these sites. These 
locational rents would be capitalized as part 
of land values, rather than appearing regu- 
larly on P&L statements as high profits. 
More fundamentally, cement industry prof- 
its and individual firm profits are very dif- 
ferent matters. This is especially the case 
because the distant small firm locates dif- 
ferently than a strictly competitive f.o.b. firm 
while, at the same instant, earning less un- 
der BPP than it otherwise would net (Melvin 
L. Greenhut, 1956). Finally, note that for 
risk-averse individuals, collusion that re- 
duces the behavioral uncertainty inherent in 
noncooperative oligopoly could easily make 
the affected firms better off, even if nominal 
profits do not increase in any measurable 
way. 


IH. Conclusions: What in Fact Is Haddock's 
Noncooperative Spatial Pricing System? 


Haddock's conclusions seem to agree with 
BGN that a true basing-point system would 
not exist under competitive conditions. Only 
its reflection would arise, with distant firms 
offering a modest discount.* He thus em- 
phasizes what Burns (1936) called “near- 
base-point pricing," NBPP. If this is what 
Haddock means by competitive BPP, then 


regard that the only examples of BPP which Haddock 
now appears to consider to be valid come from Europe 
"where the pricing structure is established and en- 
forced through governmentally established bureaucra- 
cies" (1990, fn. 5) While we do not agree with 
Haddock that these are the only prime examples (for 
example, the federal milk order system in the United 
States has many basing-point characteristics), the fact 
that the most obvious examples arise in governmentally 
regulated settings reinforces the long-standing view 
that nongovernmentally imposed BPP is associated with 
cooperative rather than competitive behavior. 

4A modest "discount" system could indeed prevail, 
and in a crude sense it often does in retailing where an 
uptown (suburban) department store and the uptown 
branches of downtown stores charge higher prices than 
the downtown price. Quite significantly, the differences 
in price correspond roughly to the time-distance 
“cost-saving” of suburban residents who shop at the 
stores located nearest them rather than having to go 
downtown. 
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there really is little disagreement between 
us except semantical. After all, competitive 
price discrimination, which just undercuts 
the base-point schedule, that is, NBPP, is 
precisely the noncooperative price equilib- 
rium that arises in the BGN extension (1990) 
of the TV (1988) paper. 

Competition in spatial markets, where 
both immobile buyers and sellers are spa- 
tially dispersed and transportation costs are 
significant, as modeled by Haddock (1982) 
and BGN (1990), leads naturally to a seg- 
mented market structure under which spa- 
tially separated firms, acting independent- 
ly, would increase their profits by setting 
prices that undercut the distant rivals, 
ceteris paribus. It has, indeed, been well es- 
tablished in the literature on spatial price 
theory that freight-absorbing discriminatory 
pricing over a geographic space, as depicted 
by TV (1988) and BGN (1990), is the natu- 
ral pricing form for noncooperative firms. 
Certainly, demand elasticities can be ex- 
pected on a priori grounds to differ at each 
buying site within a submarket (Edgar M. 
Hoover, 1936-37; Arthur F. Smithies, 1941; 
Greenhut, 1956). Note further that even the 
traditional view of £o.b. pricing as a com- 
petitive process comes into question when 
invasion of another firm's submarket 
through price discrimination is considered 
(Greenhut, Norman, and Chao-shun Hung, 
1987; Benson and Greenhut, 1989). Com- 
petitive spatial price discrimination en- 
hances consumer welfare relative to a 
basing-point system, which welfare conse- 
quence is a rather clear theoretical predic- 
tion, not the unpredictable empirical issue 
suggested by Haddock. Furthermore, BPP 


5For empirical evidence of this pricing in the United 
States, West Germany and Japan, see Greenhut (1981). 

Indeed, we find Haddock's discussion of welfare 
implications somewhat mystifying since the welfare 
benefits he discusses arise under NBPP (which is a 
form of spatial price discrimination) and BGN never 
suggested that such gains would be "modest." We are 
also surprised that Haddock turns to Austrian argu- 
ments to defend his position on competitive BPP that 
was originally based on a very non-Austrian static 
equilibrium model. Of course, producer surpluses are 
relevant, but the gains in consumer surplus from the 
breakdown of a basing-point system do not arise solely 
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is a hybrid price system characterized by 
freight absorption and phantom freight, as 
all firms at sites other than the base point 
are obliged to price discriminate while those 
located at the base point price f.0.b. mill. 

When a true basing-point pricing system 
arises, it is likely to have been imposed as a 
result of a cooperative process (Machlup, 
1940; Stigler, 1949) under which the distant 
sellers feel obliged (coerced) to follow the 
established system (Greenhut, 1956; Wilcox, 
1960). As such, it provides strong corrobora- 
tive evidence of cooperative pricing, particu- 
larly when accompanied by organized en- 
forcement efforts and punishment of those 
who cut price in violation of rate books, and 
so on. When Haddock's quasi-base-point 
price system arises, it is in the form of 
discounts offered to buyers located most 
proximate to a distant seller. 

Two final issues warrant mention. (1) 
Haddock’s statement (1990, p. 957) that “It 
seems peculiar to try to settle an essentially 
empirical issue through pure theory,..." 
(emphasis added) reveals a failure to appre- 
ciate the place of theory in understanding 
the world we live in. Specifically, what is the 
empirical issue? Surely not the issue of ri- 
valrous versus collusive behavior. That issue 
is a purely theoretical one that can be re- 
solved only through careful consideration of 
the theory of pricing behavior. A theoretical 
model explains the circumstances under 
which a firm will price f.o.b. or follow BPP. 
Then, and only then, does the empirical 
determination of the form of pricing, its 
circumstances and extent become relevant. 
(2) Based on theory, the reason for a distant 
firm’s adherence to BPP requires just one 
restrictive condition: fear of the impacts that 
would follow from competitive pricing. On 
the other hand, the reasons why firms at the 
production center use BPP to protect rents 
rather than what otherwise would appear to 
be more profitable systems are the homo- 
geneity/near homogeneity of their goods, 





from a surplus transfer: the non-base point firm is also 
better off. Surely Haddock is not suggesting that BPP is 
acceptable because the base-point firms are better off 
even though consumers and distant firms lose out? 
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the simplicity of BPP, and the low costs in 
implementing, monitoring, and enforcing 
the system. 
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The Adjustment of Expectations to a Change in Regime: 
- Comment | 


By Raymonp P. H. FisHe AND MARK WoHAR* 


In an article in this Review, Gregory 
Mankiw, Jeffrey Miron, and David Weil 
(1987, hereafter MMW) argue that “the 
founding of the Federal Reserve System in 
1914 led to a substantial change in the be- 
havior of nominal interest rates." They sub- 
stantiate this claim by using data on three- 
and six-month time loan rates available at 
New York City banks between 1890 and 
1933. 

We take issue with their conclusions for 
two reasons. First, the data that they use 
are subject to error and do not always rep- 
resent market transactions. There were 
many months in which no business was con- 
ducted in the time loan market, primarily 
because of financial panic or distress, and in 
these months only a "nominal" loan rate 
was reported. The nominal rate was arbi- 
trarily set at the usury ceiling in New York, 
which was 6 percent over this period. Thus, 
for many months, the true market rate was 
unknown. 

Second, their claim that World War I had 
little to do with the stochastic structure of 
interest rates is questionable. The outbreak 
of the war closed stock and bond exchanges 
in the United States and Europe for more 
than four months.! The closing of the mar- 


*Department of Economics, University of Miami, 
Coral Gables, FL 33124; and Department of Eco- 
nomics, University of Nebraska, Omaha, NE 68182, 
respectively. We wish to thank Louis De Alessi, Al 
Holtmann, and two anonymous referees for helpful 
comments, as well as Cigdem Ural for providing re- 
search assistance. We assume responsibility for any 
remaining errors. 

1On Monday, July 27, 1914, the Vienna and Brus- 
sels exchanges were closed. On Tuesday, the Paris 
Bourse and the Montreal and Toronto stock exchanges 
were closed. On Wednesday, stock and bond exchanges 
in Berlin, St. Petersburg, Amsterdam, Liverpool, 
Antwerp, Hamburg, and Frankfort suspended trading. 
The New York Stock Exchange and the London Stock 
Exchange closed on Friday, July 31, 1914. 
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kets because of the war precipitated a li- 
quidity crisis. The birth of the Federal Re- 
serve System helped mobilize resources to 
reduce this crisis, but the system’s initial 
endowment cf only $247 million was small 
in comparison to the $382 million in 
“emergency currency” that Secretary of the 
Treasury William McAdoo issued in re- 
sponse to the crisis. 

The reopening of the bond market on 
November 28, 1914, and of the stock market 
on December 12, 1914, offered additional 
relief to the liquidity crisis by allowing in- 
vestors and bankers to reorganize and value 
their portfolios in a formal market setting. 
Because of the many liquidity-creating ac- 
tions instituted after the onset of War World 
J, it is not possible to claim that interest 
rates were more affected by the birth of the 
Federal Reserve System than, say, the issue 
of emergency currency or the reopening of 
the markets. 

Our discussion of these points proceeds 
as follows. The data problems are docu- 
mented in Section I. In Section II, we offer 
a simple solution to the problem of “mis- 
sing” market loan rates. We find that the 
three-month series shows a break in late 
1914 or early 1915, but that the six-month 
series shows a break in the middle of 1912. 
We argue that the actual break in the 
three-month series coincides more closely 
with the reopening of the major stock ex- 
changes than the opening of the Federal 
Reserve System. A few brief closing re- 
marks are offered in Section III. 


I. Data Problems 


The data examined by MMW were col- 
lected from two sources. One was the 1910 
report of the National Monetary Commis- 
sion to Congress, compiled by A. Piatt An- 
drew and entitled "Statistics for the United 
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States: 1867-1909.” This volume contains 
weekly time loan rates from 1890 to 1909 
taken from the Commercial and Financial 
Chronicle (C&FC), a popular business mag- 
azine of the day. The data from 1910 to 
1933 were collected independently by MMW 
from the C&FC. For their empirical work, 
‘MMW created a monthly data series using 
time loan rates published in the first week 
of each month. 

In the C&FC, most of the time loan rates 
are market rates, but there are periods 
where the data are not reported (nr), re- 
ported as nominal (nom), or reported with 


the provision that a commission (com) is. 


paid to the lender. Andrew (p. 119) warned 
potential users of these data errors and 
omissions when he wrote: "It will be ob- 
served that at different times in the years 
prior to 1897 classified rates are lacking for 
time loans or for paper, or for both. These 
represent periods of disturbed conditions." 
A careful examination of the C&FC shows 
that this statement is also correct for a few 
years after 1897, particularly 1902, 1907, 
and 1918.? Table 1 provides a summary of 
these periods for the three- and six-month 
time loan rate series.? 


^Unfortunately the C&FC is inconsistent in its 
treatment of commission payments. Sometimes it would 
note the amount of commission paid, so that an effec- 
tive rate could be computed, and other times it would 
only report that commissions were involved. After 1905, 
however, it would generally, although not always, in- 
clude commissions in its quoted rates. An examination 
of the Wall Street Journal during the period reveals a 
similar reporting pattern. 

There is a story behind each of the dates identified 
in Table 1. For example, time loan rates were quoted 
as 6 percent nominally for four months during 1918. 
During this period, the government undertook two 
bond issues to finance its participation in World War I. 


These issues, known as the third and fourth Liberty 


Loan issues, were subscribed to by banks that resold 


them to their customers. There was significant concern ` 


surrounding the success of these issues because the 
Germans had begun a major offensive against Allied 
positions in an attempt to win the war. To guarantee 
that the Liberty bonds sold, the Liberty Loan Commit- 
tee, working with the Federal Reserve Board and the 
secretary of the treasury, pressured banks into limiting 
their issue of time loans. Accordingly, these time loan 
rates represent bids with almost no loans forthcoming 
at these rates. 
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` It is not an accident that loan rates are 
quoted at 6 percent or 6 percent plus a 
commission during these problem periods. 
The state of New York set the usury rate at 
6 percent between 1890 and 1933, and the 
National Banking Act required that all na- 
tional banks abide by the usury Jaws of the 
state in which they operated. National 
banks that violated usury ceilings could lose 
their charter, although none ever did. The 
comptroller of the currency was assigned 
the duty of monitoring national banks for 
usury ceiling violations. To avoid the usury 
limit, banks would charge a front-end com- 
mission to borrowers, but they would not 
quote an effective rate so as to avoid the 
unwanted attention of state authorities or 
the comptroller of the currency. 

Because of these reporting inconsisten- 
cies, the data series collected by MMW 
does not reflect true market rates. In gen- 
eral, MMW underestimate the true market 
rate by failing to allow for commission pay- 
ments. The important question, however, is 
whether these data errors will affect their 
claim that the beginning of the Federal Re- 
serve marked the beginning of a new 
stochastic structure of interest rates. This 
issue is addressed in the next section. 


II. New Estimates 


We have chosen the simplest approach 
possible to correct for the errors inherent in 
the time loan data; that is, we have defined 
the largest time interval whére the data are 
not censored to define our data series? Be- 


4See John James (1978, p. 80) for a discussion of 
usury laws during this period. The actual statute is 
reprinted in James Cahill (1923), Article 25, Section 
370. This statute exempts call loans over $5,000 from 
usury restrictions, but it does not exclude time loans. 

Other approaches to hese data errors are possible, 
but they are not free of problems. For example, one 
may simply exclude the “bad” data from the series and 
reestimate MMW's model. This creates a problem be- 
cause one does not know whether the excluded data 
define a break in the series, which is a possibility, 
particularly during the latter part of 1907 when call 
loan rates reached 125 percent while time loans were 
legally restricted to 6 percent. 'An alternative approach 
would be to develop a forecasting model to estimate 
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TABLE 1—Data ERRORS FOR THREE- AND S1ix-MontH Loan RATES 


Date 


Apr 12, 1890 
Apr 19, 1890 
Apr 26, 1890 
May 3, 1890 
May 10, 1890 
May 17, 1890 
May 24, 1890 
May 31, 1890 
Aug 16, 1890 
Aug 23, 1890 
Aug 30, 1890 
Sep 6, 1890 
Sep 13, 1890 
Sep 20, 1890 
Nov 15, 1890 
Nov 22, 1890 
Nov 29, 1890 
Dec 6, 1890 
Dec 13, 1890 
Jan 3, 1891 
May 23, 1891 
May 30, 1891 
Jun 6, 1891 
Jun 13, 1891 
Nov 14, 1891 
Jan 2, 1892 
Sep 24, 1892 
Oct 15, 1892 
Dec 3, 1892 
Apr 29, 1893 
Jul 1, 1893 
Jul 8, 1893 
Jul 15, 1893 
Jul 22, 1893 
Jul 29, 1893 
Aug 5, 1893 
Aug 12, 1893 
Aug 19, 1893 
Aug 26, 1893 
Apr 20, 1895 
Jan 4, 1896 
Jan 11, 1896 
Jan 18, 1896 
Jan 25, 1896 
Feb 1, 1896 
Feb 8, 1896 
Aug 8, 1896 
Aug 15, 1896 
Aug 22, 1896 
Aug 29, 1896 
Sep 5, 1896 
Sep 12, 1896 
Sep 19, 1896 
Oct 10, 1896 
May 26, 1899 
Oct 13, 1899 
Oct 27, 1899 
Dec 22, 1899 


Originally Reported 
(percent) 
Three-Month Six-Month 
nr 5.00 
nr 5.00 
nr 4.50 
nr 5.00 
nr 5.00 
nr 5.00 
nr 5.75 
nr 5.00 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6-- com 6-- com 
6-4- com 6+com 
6+com 6+com 
6-- com 6+com 
6+com 6+com 
nr 6.00 
nr 6.00 
5.75 nr 
nr 6.00 
nr 6.00 
5.25 nr 
4.00 nr 
4.50 nr 
6.00 nr 
nr 6.00 
6.00 nr 
6-4- com 6--com 
6+com 6+com 
6+com 6+com 
6+com 6+com 
nr - 6-4 com 
6+com 6+com 
6+com 6+com 
6+com 6+com 
6+com 6+com 
nr 4.25 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6 nom 6 nom 
6.00 6-- com 
nr 6+2 com 
6+2 com 6+2 com 
6+1 com 6+2 com 
6+1com ~ 6+2 com 
6+com 6+com 
6 nom 6 nom 
nr 3.75 
nr 6.00 
nr 6.00 
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MMW Reported 
(percent) 
Three-Month Six-Month 
ex ex 
ex ex 
ex ex 
4.50 5.00 
ex ex 
ex ex 
ex ex 
ex ex 
ex ex 
ex . ex 
ex ex 
6.02 6.00 
ex ex 
ex ex 
ex ex 
ex ex 
ex ex 
6.00 6.00 
ex ex 
6.00 6.00 
ex ex 
ex ex 
5.75 6.00 
ex ex 
ex ex 
4.00 4.75 
ex ex 
ex ex 
5.00 6.00 
ex ex 
6.00 6.00 
ex ex 
ex ex 
ex ex 
ex ex 
6.60 6.00 
ex ex 
ex ex 
ex ex 
ex ex 
6.00 6.00 
ex ex 
ex ex 
ex ex 
6.03 6.00 
ex ex 
ex ex 
ex ex 
ex ex 
ex ex 
6.00 6.00 
ex ex 
ex ex 


ex 
ex 
ex 
ex 
ex 
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TABLE 1 —(Continued) 








Originally Reported MMW Reported 
(percent) (percent) 

Date Three-Month Six-Month Three-Month Six-Month 
Jul 6, 1900 nr 4.25 3.25 4.25 
Nov 9, 1900 nr 4.75 ex i ex 
Feb 22, 1901 nr 4.00 . ex ex 
Mar 21, 1902 nr 4.50 ex ex 
Mar 28, 1902 nr 4.50 ex ex 
Apr 4, 1902 nr 4.00 4.25 4.00 
May 2, 1902 nr 4.25 4.50 4.25 
May 9, 1902 nr 4.87 ex ex 
Sep 19, 1902 6+1 com 6+1 com ex ex 
Sep 26, 1902 6+1 com 6+1 com ex ex 
Oct 3, 1902 6+com 6.00 6.25 6.00 ' 
Oct 10, 1902 6-- com 6-- com ex ex 
Oct 17, 1902 6--1com 6.00 ex ex 
Nov 21, 1902 6-- com . 0845 ex ex 
Dec 12, 1902 64- com 6.00 ex ex 
Jan 2, 1903 nr 6.00 5.25 5:25 
Mar 6, 1903 nr 5.25 5.25 5.25 
Apr 24, 1903 nr 4.67 ex ex 
May 22, 1903 nr 4.50 ex ex 
Oct 25, 1907 6.5 nom 6 nom ex ex 
Nov 1, 1907 14.00 nr 14.00 6.00 
Nov 8, 1907 13.50 nr ex ex 
Nov 15, 1907 13.50 nr ex ex 
Nov 22, 1907 13.50 nr ex ex 
Nov 29, 1907 13.50 nr ex ex 
Dec 6, 1907 nr nr 10.00 7.00 
Jul 9, 1909 2.37 nr ex ex 
Aug 5, 1910 3.87 nr 3.62 5.00 
Jul 31, 1914 6 nom 6 nom ex ex 
Mar 1, 1918 6 nom 6 nom 6.00 6.00 
Mar 8, 1918 6 nom 6 nom ex ex 
Mar 15, 1918 6 nom 6 nom ex ex 
Mar 22, 1918 6 nom 6 nom ex ex 
Mar 29, 1918 6.00 6 nom ex ex 
Apr 5, 1918 6.00 6 nom . 6.00 6.00 
Apr 12, 1918 6.00 6 nom ex ex 
Apr 19, 1918 6.00 6 nom ex ex 
Apr 26, 1918 6.00 6 nom ex ex 
May 3, 1918 6.00 6 nom 6.00 6.00 
May 10, 1918 6.00 6 nom ex ex 
May 17, 1918 6.00 6 nom ex ex 
May 24, 1918 6.00 6 nom ex ex 
May 31, 1918 6.00 6 nom ex ex 
Sep 6, 1918 6 nom 6 nom 6.00 6.00 
Sep 13, 1918 6 nom 6 nom ex ex 
Sep 20, 1918 6 nom 6 nom ex ex 
Sep 27, 1918 6 nom 6 nom ex ex 
Mar 11, 1933 Dr nr ex ex 


Source: Andrew (pp. 119-38); Commercial & Financial Chronicle, 1909-33. 

Notes: "ex" indicates that this week was excluded from the data collected by MMW; 
“nr” indicates that no rates were reported for this week; “nom” indicates that rates 
are quoted but no business is conducted; and "com" indicates that a commission is 
paid in addition to the legal rate of 6 percent. 
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TABLE 2—SwrrcH Dates Usinc MONTHLY DATA FOR INTEREST RATE MODEL 


Tip Kc pr, t Vua 


Three-Month Rate 
Jan 1908- Feb 1918 


Date -Log L 


1912: 131.4 
131.1 
130.8 
130.7 
130.2 
129.8 
129.5 
129.6 
131.7 
134.3 
137.2 
139.6 
1914: 7 139.2 
8 131.4 

9 132.7 

10 134.0 

134.9 
12 129.9 

1915: 1 130.0 
2 129.2 

3 130.0 

4 130.7 

2 131.5 
6 
7 


Lr 
NK Cw 00-10v UA 4 2 T2 S 


132.1 
132.9 


Six-Month Rate 
Sept 1910—Feb 1918 
Posterior Posterior 
Odds Ratio -Log L Odds Ratio 

0.111 63.4 0.277 
0.162 63.6 0.236 
0.212 63.0 0.415 
0.234 63.4 0.274 
0.372 62.9 0.469 
0.541 62.1 1.000 
0.757 63.2 0.340 
0.686 65.7 0.029 
0.087 66.0 0.020 
0.007 66.1 0.019 
0.000 66.0 0.020 
0.000 65.4 0.036 
0.000 67.5 0.047 
0.114 69.7 0.054 
0.031 67.5 0.047 
0.009 68.1 0.026 
0.004 68.3 0.020 
0.509 65.2 0.045 
0.485 65.5 0.035 
1.000 65.0 0.054 
0.456 65.5 0.034 
0.225 65.9 0.022 
0.105 66.3 0.015 
0.054 66.7 0.011 
0.026 67.0 0.075 


Notes: Log L is the log of the likelihood function. The posterior odds ratio is 


calculated assuming a diffuse prior. 


cause MMW selected the first week in each 
month, their three-month loan rate data are 
error-free for months between December 6, 
1907 and March 1, 1918. The six-month 
loan rate data are error-free for months 
between August 5, 1910 and March 1, 1918. 


the “unobserved” rates. This would create a problem 
when testing for a structural break, however, because 
the forecasting model is assumed to be correct over all 
of the data. If a break is found, the forecasting model 
must then be reestimated and another search for a 
break undertaken. This procedure is continued until 
the forecasting model and the structural break results 
are consistent with each other. We did not select the 
latter procedure because it introduces forecasting error 
into our tests and is not necessary given that our 
sample covered the relevant period. 


Other than the week ending July 31, 1914, 
the weekly data that we have collected are 
also error-free over these intervals. 

The monthly data may be analyzed be- 
tween these dates to search for a structural 
break. Alternatively, the weekly data may 
be used if a proxy can be found for July 31, 
1914. For this proxy, we use the first interest 
rate quoted during the following week. This 
proxy is quite reasonable because World 
War I effectively started on Friday, July 31, 
and the uncertainty concerning who would 
participate in the war continued into the 
following week. 

Table 2 reports the results of our tests for 
a structural break using monthly data. We 
estimate the step-switching model described 
by MMW (p. 366), and our results are pre- 
sented so that they are comparable to those 


VOL. 80 NO. 4 


reported in table 6 of their paper The 
switch dates are found when the logarithm 
of the likelihood function is maximized or, 
because we report the negative of the log 
likelihood to be comparable to MMW, when 
it is minimized. The three-month interest 
rate series produces a switch date in Febru- 
ary 1915, which is slightly later than that 
reported by MMW for the same regres- 
sions.’ In contrast, the switchpoint for the 
six-month series is June 1912, which is more 
than two years before the Federal Reserve 
System began operations.? 

Comparing the posterior odds ratios of 
the two periods shows that the June/July 
period during 1912 is a likely candidate for 
a break in the three-month series, but a 
similar claim may not be made for the pe- 
riod in late 1914 or early 1915 for the six- 
month series? Although these results 
suggest that the structure of three-month 
interest rates changed near the founding of 
the Federal Reserve, it is clear that the 
entire term structure was not similarly af- 


SThe models were also estimated with seasonal 
dummies included. The lack of degrees of freedom 
inhibited a complete search of the monthly and weekly 
samples, but when the regressions could be calculated, 
using all of the dummies, the switch point results were 
the same as those reported here. 

This is the second switch point found for the 
three-month data. The first switch point is August 
1909. Because the first switch occurs very early in the 
data, we searched for a second switch point Both 
switch points, however, are statistically significant, 
(F4, 121 = 13.98). 

8MMW do not report a switch date for the six-month 
series using the simple model with the rate lagged one 
period as the independent variable. Estimating this 
model for their data over the period 1890 to 1933 
reveals that the switch occurred in June 1901. 

?The switch point identified in 1912 may be due to 
the uncertain political climate at the time. President 
Taft was engaged in a tough battle for the Republican 
nomination with ex-President Roosevelt. The stock 
market appeared to favor the renomination of Taft, 
but the outcome of the fight was not known until the 
convention in June. After Governor Woodrow Wilson 
of New Jersey won the Democratic nomination on the 
46th ballot, taken July 2, 1912, and adopted a “radical” 
platform, Roosevelt reentered the race as a third-party 
candidate. The markets were alarmed at the turn of 
events because Wilson’s platform was viewed as anti- 
business. 
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fected. This is an important result because 
if the Federal Reserve System changed the 
stochastic behavior of interest rates perma- 
nently, as suggested by Gregory Mankiw 
and Jeffery Miron (1986), then one would 
expect that this change would be present in 
both the three-month and six-month rates. 

Table 3 refines the estimation of these 
switch points by using weekly data on three- 
and six-month interest rates. The three- 
month series produces a switch point during 
the first week in December 1914, while the 
six-month series suggests that the switch 
point occurs during the second week of June 
1912. Both of these results are in general 
agreement with the switch points found us- 
ing the monthly data series, although the 
six-month switch point is precisely where it 
is expected to be whereas the three-month 
switch point is two months away from the 
date predicted using monthly data. 

These findings offer only limited support 
for MMW’s conclusion that there is a struc- 
tural break in interest rates after the Fed- 
eral Reserve began operations on Novem- 
ber 16, 1914. The weekly results for the 
three-month series, which are the closest to 
supporting MMW’s hypothesis, suggest that 
the break occurred sometime during the 
first week of December 1914, not in the 
week the Federal Reserve went into busi- 
ness. This presents a causal dilemma be- 
cause there were other events affecting fi- 
nancial markets during the weeks before 
and after the opening of the Federal Re- 
serve System, and these events may be re- 
sponsible for the break in three-month rates. 

The event that immediately preceded the 
structural break identified in Table,3 for the 
three-month series is the reopening of bond 
trading on the New York Stock Exchange. 
Both the bond and stock markets provided 
ready sources for liquidity during 1914. Be- 
fore the onset of the war, the bond market 
traded nearly $61 million per month and 
the New York Stock Exchange averaged 
$574 million per month.? With the closing 


V CF&C, Bank & Quotation Section, September 5, 
1914, p. 19. 
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TABLE 3—Switcu Dates Usinc WEEKLY DATA FOR INTEREST RATE MODEL 


tipi 7 KT prot Vua 














Three-Month Rate 
Dec 13, 1907 to 


Six-Month Rate 
Aug 12, 1910 to 








Feb 22, 1918 Feb 22, 1918 
Posterior Posterior 
Date —LogL Odds Ratio Date —LogL Odds Ratio 
1914: Oct 2 217.9 0.013 1912: Apr 5 ~16.4 0.046 
Oct 9 216.5 0.054 Apr 12 — 18.8 0.071 
Oct 16 216.0 0.092 Apr 19 ~17.4 0.123 
Oct 23 216.4 0.059 Apr 26 — 16.2 0.040 
Oct 30 217.1 0.030 May 3 — 16.6 0.055 
Nov 6 217.8 0.015 May 10 ~ 16.9 0.074 
Nov 13 215.9 0.104 May 17 -174 0.127 
Nov 20 214.7 0.330 May 24 — 17.8 0.187 
Nov 27 214.9 0.268 May 31 — 18.3 0.325 
Dec 4 213.6 1.000 Jun 7 — 18.9 0.568 
Dec 11 214.2 0.569 Jun 14 —19.5 1.000 
Dec 18 214.7 0.325 Jun 21 —19.1 0.677 
Dec 25 215.3 0.187 Jun 28 —19.3 0.859 
Dec 31 215.8 0.107 Jul5 — 18.7 0.473 
1915: Jan 8 216.5 0.053 Jul 12 — 18.0 0.231 
Jan 15 216.1 0.084 Jul 19 ~17.2 0.105 
Jan 22 216.2 0.072 Jul 26 -—-17.7 0.168 
Jan 29 216.8 0.041 Aug 2 —16.9 0.077 
Feb 5 217.5 0.020 Aug 9 —174 0.128 
Feb 11 218.2 0.010 Aug 16 —17.9 0.221 
Feb 19 219.9 0.005 Aug 23 —184 0.361 
Feb 26 219.6 0.003 Aug 30 ~17.2 0.102 


Notes: Log L is the log of the likelihood function. The posterior odds ratio is 


calculated assuming a diffuse prior. 


of these markets, investors had to find li- 
quidity from other sources, primarily banks, 
thereby increasing interest rates. The bond 
market reopened immediately before the 
structural break identified for the three- 
month series in Table 3, on November 28, 
and immediately after this break, on De- 
cember 12, the New York Stock Exchange 
reopened. Because of the volume of trans- 
actions involved, these events may be partly 
responsible for the rapid decrease in inter- 
est rates during December 1914 and Jan- 
uary 1915 and thus may have precipitated 
the structural break in the three-month data 
series.!! 


UIF the reopening of the markets was important, 
then their closing is expected to be important too. We 
reestimated a switch point for the three-month data 
using the period June 21, 1912, to December 4, 1914. 
This period is just after the switch observed for the 
six-month data (the posterior odds ratio suggested the 


Using the monthly figures above, the 
transaction volume that was loss due to the 
closing of the stock and bond markets to- 
taled nearly $2.5 billion. This amount liter- 
ally swamps the transactions that were facil- 
itated by the Federal Reserve System in 
November and December of 1914. The Fed- 
eral Reserve System began operations with 
$247 million in assets, of which about $228 
milion represented reserve deposits from 
member banks and $18 million represented 
paid-in capital. The Federal Reserve did 
not act to inject these resources into the 
economy during 1914. As Table 4 shows, 
the amount of bills discounted and loans 


possibility of a switch in the three-month data around 


this date) and just before the switch observed for the 
three-month data. A significant break was found on the 
week ending July 31, 1914, which marks the beginning 
of World War I and the closing of the markets. 
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TABLE 4—AsSETS OF THE FEDERAL RESERVE SYSTEM, 1914 























(1,000s) 
Ratio of 
Bills Bills and Loans 
Total Discounted Total to Total Assets 
Date Cash and Loans Assets (percent) 
Nov 20, 1914 $241,408 $5,626 $247,158 23 
Nov 27, 1914 262,470 7,383 270,018 2.7 
Dec 4, 1914 262,932 9,844 273,084 3.6 
Dec 11, 1914 260,243 10,257 272,476 3.8 
Dec 18, 1914 258,287 9,043 269,990 3.4 
Dec 24, 1914 258,316 8,552 271,683 3.2 
Dec 31, 1914 255,647 10,593 277,844 3.8 











Source: Exhibit K, Annual Report, Federal Reserve Board, 1914. 
Notes: Total cash includes gold coin and certificates, legal-tender notes, silver certifi- 
cates, and subsidiary coin. The difference between total assets and cash plus bills and 


loans equals miscellaneous assets. 


made represented less than 4 percent of the 
resources of the system. The Federal Re- 
serve literally kept nearly 96 percent of its 
resources idle during crucial months at the 
beginning of World War I.!* 

Secretary of the Treasury William 
McAdoo did more to provide liquidity to 
the banking system and facilitate commerce 
than did the Federal Reserve System. With 
the stock exchanges closed, public panic led 
to hoarding of currency. By moving cuickly 
in early August 1914, Secretary McAdoo, 
with the assistance of Congress, authorized 
the issue of $500 million in emergency cur- 
rency as provided for by the Aldrich-Vree- 
land Act of 1908. The cost of emergency 
currency was only 3 percent initially, where- 
as the New York clearinghouse charged 6 
percent for clearinghouse loan certificates 
and the New York Federal Reserve bank 
charged 6 percent to discount loans with a 


The performance of the Federal Reserve System 
was not much different during 1915. At the end of 
December 1915, only 12.1 percent of the system’s as- 
sets were committed to bills and securities (Federal 
Reserve Board, Banking and Monetary Statistics: 
1914 — 1941, p. 330). 

“The emergency currency was identical to national 
bank notes issued after 1908, and thus wes easily 
introduced into circulation. National banks fcund the 
emergency currency appealing because it met their 
demand for liquidity, which was hampered by public 
hoarding, and it was a cheap source of capital. 


maturity greater than 30 days.!^? It is no 
wonder then that the discount operations of 
the Federal Reserve were limited to about 
$10 million during 1914, while the U.S. 
Treasury had “sold” over $382 million in 
emergency currency, an amount that repre- 
sented 10.2 percent of the existing money 
stock. 

Although one may debate whether or not 
the events discussed above resulted in a 
permanent change in the stochastic struc- 
ture of interest rates, it is clear that they 
cast doubt on the claim that the founding of 
the Federal Reserve System was the pri- 
mary factor. 


III. Conclusions 


The data on time loan rates used by 
MMW are subject to error, primarily be- 
cause of usury laws and reporting inconsis- 


“The Wail Street Journal, August 12, 1914, com- 
mented on the issue of emergency currency at 3 per- 
cent: ““Emergency currency’ under present conditions 
is a misnomer. What it really is, is a makeshift and 
temporary rediscount market.” 

The discount rate at the New York Federal Re- 
serve Bank was 6 percent from November 16, 1914, to 
December 18, 1914. On December 18, 1914, the dis- 
count rate dropped to 5.5 percent (Exhibit M, Annual 
Report, Federal Reserve Board, 1914). 

Comptroller of the Currency, Annual Report, 1915, 
Vol. I, p. 90. 
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tencies. Using a subset of these monthly 
data, which is error-free, and a weekly data 
set, we find that the six-month series does 
not support their claim that there was a 
regime change in 1914 and that the three- 
month series offers only superficial support. 
These findings suggest that the economic 
events during the period did not affect the 
term structure similarly. 

A careful examination of the events sur- 
rounding the break point in the three-month 
series suggests that the reopening of the 
stock and bond markets and the issue of 
emergency currency may have affected the 
behavior of three-month rates. In addition, 
as an operating entity, the Federal Reserve 
was largely benign during 1914, using less 
than 4 percent of its resources to stimulate 
the economy. In total, these findings suggest 
that the birth of the Federal Reserve Sys- 
tem was less important than MMW have 
claimed. 
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The Adjustment of Expectations to a Change in Regime: 
‘Reply 


By N. GREGORY Mankiw, JEFFREY A. Miron, AND Davin N. Wern” 


In their comment on our paper (1987), 
Raymond Fishe and Mark Wohar (1990) 
dispute our conclusion that the founding of 
the Federal Reserve caused a change in the 
monetary regime shortly after 1914. They 
make two unrelated points. First, they note 
that our interest rate data are contaminated 
by measurement error, suggesting that our 
results on the timing of the regime change 
may be an artifact of this measurement er- 
ror. Second, assuming that we were correct 
that the change in regime occurred in late 
1914 or early 1915, they argue that the 
Federal Reserve. could not have been re- 
sponsible for the change. 

In this reply we first show that alternative 
ways of dealing with the measurement error 
problem, including the method proposed by 
Fishe and Wohar, do not substantially alter 
the results in our original paper. We then 
argue that their attribution of the change in 
regime to factors other than the founding of 
the Fed is implausible because the change 
in the behavior of interest rates was perma- 
nent, while these other factors were transi- 
tory. In the third section we examine an 
anomaly raised by the Fishe and Wohar 
results. l 


I. Measurement Error 


Fishe and Wohar focus on the issue dis- 
cussed by Jeffrey Miron (1989) that usury 
laws imposed a ceiling of 6 percent on the 
reported interest rate. We agree that this 
ceiling presents a problem with the data by 
making the reported interest rate differ from 
a true market interest rate for certain ob- 
servations. In particular, for about 2 percent 
of the observations in our sample, our data 


* Departments of Economics, Harvard University, 
Cambridge, MA, 02128; Boston University, Boston, 
MA, 02215; Brown University, Providence, RI 02912. 
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include a reported interest rate of 6 percent 
when the true market interest rate was 
above 6 percent.! 

When using the data to estimate the tim- 
ing of the change in regime, there are alter- 
native ways to treat the questionable obser- 
vations. In our original paper, we simply 
included all the data. Because the data near 
to the estimated switch date do not suffer 
from this measurement error problem, there 
is no obvious reason to suspect the estimate 
of the switch date. 

Fishe and Wohar suggest that one should 
restrict the sample to the longest stretch 
around the founding of the Fed in which 
there are no questionable observations. The 
problem with this method is that the re- 
maining sample period is extremely short, 
implying that one is excluding a large num- 
ber of perfectly good observations. Yet even 
using this procedure, Fishe and Wohar ob- 
tain essentially the same result that we orig- 
inally reported for the switch date for the 
stochastic process followed by the three- 
month interest rate. 

Another way to deal with measurement 
error is to exclude those observations for 
which the reported interest rate is suspect. 
This technique allows one to use most of 
the observations. When we implement this 
procedure for the three-month interest rate 
process, we find that the switch date is 
December 1914 when we exclude month 
dummies and February 1915 when we in- 
clude month dummies. For the Modigliani- 
Sutch equation that relates the six-month 
rate and the three-month rate, the switch 
date is June 1915 when we exclude or in- 


When comparing the data we used with the data 
collected by Fishe and Wohar, we also discovered 
several coding errors in our data. These errors are 
usually very minor and do not influence any of our 
reported results. The corrected data are available upon 
request. 
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clude month dummies. These results are 
almost identical to those in our original 
paper. 

None of these methods of dealing with 
the measurement error in the data is ideal. 
Yet the results we reported are robust to 
the alternative methods. There is no evi- 
dence that any of the results we reported 
are attributable to measurement error. 


IL The Cause of the Change in Regime 


Fishe and Wohar argue that the founding 
of the Fed was not the cause of the change 
in stochastic process driving interest rates. 
They point out that many other economic 
events, such as the outbreak of World War 
I, the closing and reopening of the bond 
and stock markets, and the issue of emer- 
gency currency, were occurring at about the 
same time as the opening of the Fed. They 
suggest that one of these other events may 
have been responsible for the change in the 
behavior of interest rates. 

We recognized this possibility in our orig- 
inal paper, where we wrote: 


The year 1914 also saw the outbreak 
of World War I. Our estimates of the 
stochastic process of the short-term 
interest rate indicate that the short 
rate followed essentially the same pro- 
cess in the 1915-18 period as in the 
1919-33 period. It appears, therefore, 
that the war was not itself the major 
factor of the regime change. 


All of the events that Fishe and Wohar 
document were transitory in nature and 
therefore cannot explain a permanent 
change in regime. 

An alternative explanation for the perma- 
nent change in regime is the abandonment 
of the Gold Standard, as suggested by Tru- 
man Clark (1986). We examined this possi- 
bility in Robert Barsky et al. (1988), where 
we discussed theoretical and empirical rea- 
sons to doubt the gold standard explana- 
tion. The resolution of this question, how- 
ever, is not crucial to the central conclusion 
of our original paper. In that paper, we 
noted that “while our econometric results 


below point to the founding of the Fed 
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rather than the abandonment of the Gold 
Standard as the likely cause of the regime 
change, our analysis of the adjustment of 
expectations does not rely on the Fed being 
the source of the change.” 


III. The Change in the Process of the 
Six-Month Rate 


Fishe and Wohar point out that the switch 
date for the univariate process for the six- 
month interest rate is different from the 
switch dates for the three-month interest 
rate process and for the Modigliani-Sutch 
equation. This anomaly is present using our 
original data and using either their sug- 
gested correction for measurement error or 
the correction that we prefer. 

All three techniques do produce local 
peaks in the likelihood function around the 
founding of the Fed. This finding leads us to 
believe that while one of the factors deter- 
mining the stochastic process of the six- 
month rate (that is, the process for the 
three-month rate) did change when the Fed 
was founded, other determinants of the pro- 
cess for the six-month rate changed at other 
times. To make this point more concretely, 
consider the model for the six-month rate 
presented in our earlier paper, 


(1) R,=1/2 (1+ p5)r, + 6,, 


where R, is the six-month rate, r, is the 
three-month rate, p is the autoregressive 
parameter of the three-month rate, and 6, 
is the term premium. Our analysis focused 
on whether there was a change in the coef- 
ficient on the short rate in this equation. 
The univariate process for R, changes 
when either the process for r, or the pro- 
cess for 0, changes. If these processes 
change at different times, and one estimates 
a switching regression for the univariate 
process for the long rate, either switch date 
may be found. The focus of our original 
paper was the behavior of the short rate 
and the relationship between the long and 
the short rate. We found that these two 
equations changed in late 1914 or early 1915. 
We interpreted the results as a change in 
the actual value and the public’s perception 
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of p. The finding that the univariate process 
for the six-month rate switched at a differ- 
ent point in time suggests that there also 
may have been changes in the process fol- 
lowed by 8, Yet this finding in no way 
vitiates our result that the behavior of the 
term structure changed to reflect the new 
process followed by the short rate around 
the time that the Fed was founded. 


IV. Conclusion 


We have learned two things by reflecting 


on Fishe and Wohar’s comment on our 1987 . 


paper. First, the results in our original pa- 
per on the timing of the change in the 
behavior of interest rates are not affected by 
the measurement error that arises because 
of state usury laws. Indeed, we are now 
even more confident in our results, because 
Fishe and Wohar have shown that using 
weekly rather than monthly data on three- 
month interest rates produces the same 
findings on the nature and timing of the 
regime change. Second, the univariate be- 
havior of six-month interest rates appears 
anomalous when compared to the behavior 
of three-month interest rates and the behav- 
ior of the Modigliani-Sutch term structure 
equation relating six-month and three- 
month rates. 

On the question of what caused the be- 
havior of interest rates to change so dramat- 
ically in 1914, we remain convinced that the 
Fed is the most likely culprit. Since the 
elimination of transitory movements in 
interest rates in 1914, short-term interest 
rates have been close to a random walk 
(N. Gregory Mankiw and Miron, 1986), Be- 
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cause this behavior of interest rates has 
persisted throughout the Fed’s 75-year his- 
tory despite many other changes in eco- 
nomic policy, the Fed seems the most likely 
cause of the change in the behavior of inter- 
est rates. | 
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KATHERINE ScurPPER, University of Chicago 
Wayne Mann, Tulane University 


8:00 A.M. LATIN IMMIGRATION AND THE U.S. Economy IN THE 1990's (Joint Session with Hispanic Professors of 
Economics and Business) 
Presiding: JonGE SALAZAR-CaRILLO, Florida International University 
Papers: Francisco Rivera-Batiz, Rutgers University-New Brunswick 
Puerto Rican Migration and Its Impact 
GiLBert CARDENAS, Pan-American University 
Mexican Immigration and U.S, Labor Markets 
ANTONIO JoRGE AND Raut Moncarz, Florida International University 
Cuban Immigration into the United States and Minority Business Development 
Discussants: CoRpELIA Reimers, Hunter College 
ROGER Betancourt, University of Maryland 


10:15 A.M. Women 1N Economics 
Presiding: NANCY M. Gorpon, Congressional Budget Office 
Papers: Ivy E. Broper, National Science Foundation and American University 
New Evidence on Wage Differentials in Economics 
REBECCA BLANK, Northwestern University 
The Effect of Blind Refereeing in Economics 
June O'NEILL, Baruch College, anp NACHUM SICHERMAN, Ruigers University-New Brunswick 
Is the Gender Gap in Economics Declining? 
Discussants: CLAUDIA GorpiN, Harvard University 
ARLENE Ho en, U.S. Office of Management and Budget 
Marvin Kosters, American Enterprise Institute 


10:15 A.M. Wuy ARE Prices Sticky? EARLY RESULTS FROM AN INTERVIEW STUDY 
Presiding: RoBERT M. SoLow, Massachusetts Institute of Technology 
Papers: ALAN S. BLINDER, Princeton University 
Why Are Prices Sticky? Early Results from an Interview Study 

Discussants; ROBERT SHILLER, Yale University 

Rosert J. GonpoN, Northwestern University 

Rosert E. Hatt, Stanford University 

Rosert M. Sotow, Massachusetts Institute of Technology 


10:15 A.M. THE MEASUREMENT OF INPUT QUALITY AND PRODUCTIVITY 
Presiding: ERNsT BERNDT, Massachusetts Institute of Technology and National Bureau of Economic 
Research 
Papers: CHARLES R. Hutren, University of Maryland and National Bureau of Economic Research 
Embodied versus Disembodied Technical Change in Two-Digit U.S. Manufacturing Industries 
Larry RosENBLUM, Epwin Dean, Mary JABLONSKI, AND Kent Kunze, U.S. Bureau of Labor 
Statistics ; 
Heterogeneous Labor Inputs and the Measurement of Prodactivity 
Ernst R. BERNDT, Massachusetts Institute of Technology and National Bureau of Economic 
Research, CATHERINE J. Morrison, Tufts University and National Bureau of Economic Re- 
search, AND Davip O. Woop, Massachusetts Institute of Technology 
Assessing the Productivity of Information Technology Equipment in U.S, Manufacturing 
Industries 
Discussants: MICHAEL IntRILIGATOR, University of California-Los Angeles 
Finis WELCH, University of California-Los Angeles 
Martin N; Barry, University of Maryland 
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10:15 A.M. Tort Law As A REGULATORY SYSTEM 


Presiding: CHRISTOPHER DEMurH, American Enterprise Institute 
Papers: SUSAN RosE-ACKERMAN, Yale University 
Tort Law in the Regulatory State 
Rosert Lrran, Brookings Institution 
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The Impact of Tort Liability on Safety and Innovation: Lessons from the Scientific Community 


GLENN BLACKMON AND RICHARD ZECKHAUSER, Harvard University 
The Regulation of Automobile Liability Insurance: Lessons from Massachusetts 
Discussants: Rocer Nott, Stanford University 
GroncE Eaps, General Motors Corporation 
ROBERT CRANDALL, Brookings Institution 


10:15 A.M. DIFFUSION or DEVELOPMENT 


10:15 A.M. 


10:15 A.M. 


10:15 A.M. 


10:15 A.M. 


Presiding: EpwARD Wo rr, New York University 
Papers: RicHarp R. Netson, Columbia University 
Diffusion of Development: Western Europe and North America 
MARSHALL GoLpMaN, Wellesley College and Harvard University 
Diffusion of Development: Eastern Europe 


ALICE H. AMSDEN, New School for Social Research and Massachusetts Institute of Technology 


Diffusion of Development: East Asia 
Discussants: PAUL Davin, Stanford University 
Donard Hannis, Stanford University 
Henry Rosovskv, Harvard University 


SocIAL VALUES AND THE TRANSFORMATION OF EASTERN Europe (Panel Discussion) 
Presiding: ABRAM BERGSON, Harvard University 
Panel: KENNETH Arrow, Stanford University 
Janos Kornal, Harvard University 


SEMINARS ON RECENT Topics AND TECHNIQUES* 
Speaker: JEAN TiRoLE, Massachusetts Institute of Technology 
New Developments in Industrial Organization 


WHY IS THE INTELLECTUAL DEBATE ON MACROECONOMICS SucH A SHAMBLES? (Roundtable) 
Presiding: Francis M. Bator, Harvard University 
Panel: CHARLES SCHULTZE, Brookings Institution 
Jonn TAYLOR, Stanford University 
JOHN Berry, Washington Post 
ALAN Murray, Wall Street Journal 


THe Economics or NUCLEAR POWER 
Presiding: Jonn L. SoLow, University of Iowa 
Papers: James Hew ett, U.S. Department of Energy 
The Cost of Nuclear Power: Evidence from the United Kingdom 
Mark McCase AND RicHARD Lester, Massachusetts Institute of Technology 


Principals, Agents, and the Learning Curve: The Case of Nuclear Power Plant Construction 


GEOFFREY ROTHWELL AND J. BRADFORD JENSEN, Stanford University 
Information Structures for Maintaining and Operating Nuclear Reactors 
Discussants: EDWARD KOKKELENBERG, State University of New York-Binghamton 

ANTHONY KRAUTMANN, DePaul University i 
Jonn MARSHALL, University of California-Santa Barbara 


10:15 A.M. Tue ROLE or TRADE IN NORTH AMERICAN INTEGRATION (Joint Session with the North American 


Economics and Finance Association) 
Presiding: Manmoop A. Zaim, University of Minnesota 
Papers: Luis Ernest DeRBEz, World Bank 
Trade Liberalization Policies in the North America Context 


Harry P. Hurzina, Stanford University, anp K. C. Fona, University of California-Santa Cruz 


The Free Trade Agreement and Union Wages in Canada 
CoriN I, BRApFoRD, World Bank 
A Global Trade and Growth Strategy for Correcting U.S. Imbalances 
Discussants: RuptaeR W. DonNBUSCH, Massachusetts Institute of Technology 
GEORGE VON FunsrENBERO, Indiana University 
RicHARD B. Freeman, Harvard University 
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10:15 A.M. CowrLicr AND Peace Economics II: Disarmament EFFECTS AND NATIONAL Prorecrion (Joint 
Session with the Peace Science Society International) 
Presiding: WALTER Isarn, Cornell University 
Papers: GERALD F. Apams, University of Pennsylvania 
Macroeconomic Effects of Disarmament in the Global Economy 
Jon HAVEMAN, ALAN V. DEARDORFF, AND Rosert M. Stern, University of Michigan 
The Economic Effects of Unilateral and Multilateral Reductions in Military Expenditures in 
the Major Industrialized and Developing Countries 
WiLLtAM G. SHEPHERD, University of Massachusetts 
Efficient Levels of Post-Military Spending on National Prctection 
Discussants: PETER H. Pauty, University of Toronto 
SHERMAN Rosinson, University of California-Berkeley 


10:15 A.M. Wat REMAINS OF COMPARATIVE ADVANTAGE? (Roundtable) 
Presiding: DoMINICK SALVATORE, Fordham University 
Papers: JacpDisH BuaGcwati, Columbia University 
Level Playing Field and Comparative Advantage 
PauL Krucman, Massachusetts Institute of Technology 
Do New Trade Theories Require New Trade Policies? 
WILLIAM Branson, Princeton University 
Heckscher-Ohlin is Alive and Well and Living in the Pacifiz! 
RoBERT BaLpwin, Univeristy of Wisconsin 
Where Do We Stand on Comparative Advantage? . 


12:30 P.M. AEA/AFA Joist LUNCHEON 
Presiding: Tuomas C. ScHELLING, University of Maryland 
Speaker: (To BE ANNOUNCED) 


2:30 P.M. Patent RACES AND TECHNOLOGY TRANSFER 
Presiding: DEBRA Aron, Northwestern University 
Papers: Reiko Aoxt, Ohio State University 
,R&D Competition for Product Innovation: A Race Without End 
RICHARD JENSEN, University of Kentucky, ANp Marte THURsBY, Purdue University 
Patent Races, Product Standards, and International Competition 
Beru ANNE TERCEK, Boston College 
North-South Technology Transfer in the Context of International Returns to Scale 
Discussants: JosepH E. STiGLITZ, Stanford University 
JENNIFER F. REINGANUM, University of Iowa 
PAuL Romer, Center for Advanced Studies in Behavioral Sciences 


2:30 P.M. SrABILIZATION'AND REFORM IN EASTERN EUROPE AND LATIN AMERICA 
Presiding: KENNETH FRoor, Massachusetts Institute of Technology 
Papers: RoNALD McKinnon, Stanford University 
Financial Control During the Transition from a Centrally Planned to a Market Economy 
Jonn CocHRANE, University of Chicago, anp Barry W. Ickes, Pennsylvania State University 
Stopping Inflation in Reforming Socialist Economies 
Sepastian Epwarps, University of California-Los Angeles 
Stabilization Experience in Latin America: Lessons for Eastern Europe 
Tuomas Wo tr, International Monetary Fund 
Stabilization and Adjustment in Eastern Europe 
Discussants: RUDIGER DonNBUscH, Massachusetts Institute of Technology 
Errana Carposo, Tufts University 
Jan Svernar, University of Pittsburgh 
Lawrence SUMMERS, Harvard University 


2:30 P.M. Can Sanctions SAVE FREE TRADE? 
Presiding: C. MicuaEL Ano, Council on Foreign Relations 
Papers: THomas O. Bayarp, Institute for International Economics 
Reciprocity and Retaliation: Should Might Make Right? 
Mark Levinson, Journal of Commerce 
Can Sanctions Save Free Trade? 
KENNETH Ove, Swarthmore College and Massachusetts Institute of Technology 
Economic Discrimination and Political Effectiveness: In Defense of Bilateral Sanctions © 


al? 
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Discussants: AviNAsH Dixrr, Princeton University 
C. MicHAEL Axo, Council on Foreign Relations 


2:30 P.M. THe DEMAND For Money REVISITED 
Presiding: James Boucuron, International Monetary Fund 
Papers: Davip LAIDLER, University of Western Ontario 
The Demand for Money: An Empirical Overview 
Bennett McCarLuM, Carnegie Mellon University 
The Demand for Money: A Theoretical Overview 
James BouGHTON, International Monetary Fund 
The Demand for Money: International Comparisons 
Discussants: STEPHEN GOLDFIELD, Princeton University 
Dare Henperson, Federal Reserve Board 
Ray Farr, Yale University 


2:30 P.M. CuiNEsE Economic Rerorms, 1979-1989: LESSONS FOR THE FUTURE 
Presiding: GREGORY C. Cuow, Princeton University 
Papers: Rocer H. Gorpon, University of Michigan 
Chinese Enterprise Behavior under the Reforms 
Barry NAuGHTON, University of California-San Diego 
Why Has Economic Reform Led to Inflation? 
^ Terry Sicucar, Harvard University 
Moving Toward Market Allocation in Agriculture: Potential Problems 
Discussants: LAWRENCE J. Lau, Stanford University 
RicHARD D. Portes, Birkbeck College 
GERSHON Feber, World Bank 


2:30 P.M. TEACHING COLLEGE ECONOMICS 
Presiding: STEPHEN Bucks, Joint Council on Economic Education 
Papers: Joun Srecrriep, Vanderbilt University, Rosin Bartiett, Denison University, W. LEE 
Hansen, University of Wisconsin-Madison, ALLEN C. Ketrey, Duke University, DoNALD 
McCLoskey, University of Iowa, AND THOMAS TIETENBERG, Colby College 
The B^ Economics Major: Can and Should We Do Better? 
WirLiAM Becker, Indiana University, RoperT HiGHsMrTH, Joint Council on Economic Educa- 
tion, PErER Kennepy, Simon Fraser University, anb WILLIAM WALsTAD, University of Nebraska 
A New Agenda for Research on Teaching College Economics 
Puur SAUNDERS, Indiana University 
The Newly Revised Test of Understanding College Economics 
Discussants: DAvip COLANDER, Middlebury College 
Craupia Gorom, Harvard University 
ALAN BLINDER, Princeton University 
Eric HaNusHEk, University of Rochester 


2:30 P.M. LABOR SuPPLv 
Presiding: Davin BLoom, Columbia University 
Papers: SHELLY LUNDBERG, Princeton University, ANp RoBert PLotnick, University of Washington 
Teenage Childbearing Decisions: Do “Opportunity Costs” Matter? 
Lori BoLLINGER, University of Pennsylvania 
Diffusion and Adoption of Contraceptive Technology in Developing Countries 
MANOUCHEHR Mokurani, University of Maryland-College Park, AND PAUL R. GREGORY, Univer- 
sity of Houston 
Backward Bends, Quantitative Constraints, and Labor Supply: Evidence from the Soviet 
Interview Project 
Fints Werc, Unicon Research Corporation, anp Kevin Murpuy, University of Chicago 
Wages and Participation in the 1980's 
Discussants: SANDERS KORENMAN, Princeton University 
Davip Loam, University of Michigan 
Gary Burtcess, Brookings Institution 
Davip BLoom, Columbia University ` 


2:30 P.M. SMOKING, NUTRITION, AND HEALTH 
Presiding: Nancy Gorpon, Congressional Budget Office 
Papers: GERARD Russo, University of Hawaii 
The Demand for Physicians' Services and the Price of Cigarettes 
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W. Kip Viscusi, Duke University 
Perception of Smoking Risks and Cigarette Smoking Behavior 
PaAuriNE M. Ippotrro AND Aran D. Matuios, Federal Trade Commission 
Information, Advertising, and Health Choices: A Study of the Cereal Market 
Discussants: JANET B. MiTcHELL, Center for Health Economics 
WitLARD G. Manning, University of Michigan 
SugLLEv I. Wurre-MaaNs, Memphis State University 


2:30 P.M. OPTIONS AND BUBBLES: EMPLOYING EMERGING METHODOLOGIES TO EXAMINE QUESTIONS IN FINAN- ,. 
CiAL Economics (Joint Session with the Economic Science Association) 
Presiding: Jonn DickHaut, University of Minnesota 
Papers: JOHN O'BRIEN AND SANJAY SRIVASTAVA, Carnegie Mellon University 
Writing Options in the Laboratory 
Hersu M. Suerrin, University of Santa Clara 
Prospect Theory as a Basis for Examining Behavior with Call Options 
VERNON Smita, University of Arizona 
Bubble Crashes in the Laboratory: A Summary of What We Know Plus Some New Evidence 
Discussants: CHESTER Spatr, Carnegie Mellon University 
DaNIEL FRIEDMAN, University of California-Santa Cruz 


2:30 P.M. MAINTAINING INFRASTRUCTURE AND Fiscat CREDIBILITY IN URBAN AMERICA (Joint Session with the 
National Tax Association) " à 
Presiding: Roy BAHL, Georgia State University Mus 
Papers: MicHAEL E. BELL, Johns Hopkins University 
Infrastructure and Environmental Quality Requirements for Sustainable Growth 
Heren F. Lapp, Duke University and Lincoln Institute of Land Policy 
Fiscal Implications of Growth and Decline in U.S. Cities 
Rosert A. Bonm AND DEBORAH VAUGHN Nestor, University of Tennessee 
Characteristics of the Solid Waste Crisis and Incentives to Create Capacity 
Discussants: GEORGE Presko, Northeastern University 
SaLLv WarLAcE, U.S. Department of the Treasury 
Joun M. Quicey, University of California-Berkeley 


4:45 P.M. RicHarp T. ELY LECTURE 
Presiding: Tomas C. ScuELLING, University of Maryland 
Speaker: GEORGE A. AKERLOF, University of California-Berkeley 
Procrastination, Indoctrination, and Obedience 


Saturday, December 29, 1990 


8:00 A.M. GENDER AND PRODUCTIVITY 
Presiding: FRANCINE Blau, University of Illinois-Urbana-Champaign 
Papers: SOLOMON POLACHEK, State University of New York-Bingkamton 
An Analysis of Recent Trends in the Male-Female Wage Gap 
Jon: HEnscu, University of Wyoming 
Gender Differences in Wages: The Role of Human Capital, Working Conditions, and House- 
work 
Joun MurLauv, Trinity College, AND Jopy L. SiNpELAR, Yale University 
Substance Abuse and Gender Differences in Productivity 
Discussants: FRANCINE BrAu, University of Illinois-Urbana-Champaign 
Laurie Bassi, Georgetown University 
JEAN MITCHELL, Florida State University 


8:00 A.M. EFFECTS OF TAXATION : 
Presiding: Jane G. GRAVELLE, Congressional Research Service 
Papers: WitLiIAM M. Sose, University of North Carolina 
The Effects of Taxation on Capital Gains Realizations: Theory and Evidence 
WiLLIAM GALE, University of California-Los Angeles, AND Jonn Kari ScHorz, University of 
Wisconsin-Madison 
The Effects of IRA's on Household Savings 
JAMES ANDREONI AND JOHN Kanr Scuorz, University of Wisconsin- Madison 
An Econometric Analysis of Charitable Giving and Interdependent Preferences 
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Discussants: LEONARD BURMAN, Congressional Budget Office 
BanBARA Mann, U.S. Department of the Treasury 
DANIEL FEENBERG, National Bureau of Economic Research 


8:00 A.M. ENERGY INVESTMENTS IN DEVELOPING COUNTRIES: ENVIRONMENTAL IssUES AND POLICY 
APPROACHES . : 
Presiding: Corazon M. Sippayao, World Bank 
Papers: Marcia Gowen, Winrock 
Investments in Biomass Fuels: Environment and Policy Issues 
Canor Daut, Louisiana State University 
Energy Demand Elasticities in the Developing World and Their Implications for Global 
Environmental Issues 
RALPH W. HUENEMANN, University of Victoria 
Cost/ Benefit Analysis of a Power Project in the Three Gorges Dam in China: Environmental 
Implications of a Persistent Error 
GuNTER ScHRAMM, World Bank 
Incorporating Uncertainty in Addressing Environmental Issues in Power Section Investments 
Jonn SHEERIN, U.S. Department of State, Kirk R. Smita, East-West Center, anb Corazon M. 
SippAvAo, World Bank 
The Global Warming Issue: Policy Approaches to Developing Countries’ Problems 
Discussants: MOHAN MUNASINGHE, World Bank 
Uziet Nocuerra, Inter-American Development Bank 


8:00 A.M. ALTRUISM: EMPIRICAL STUDIES 
Presiding: RoBERT Linpsay, New York University 
Papers: RoBERT H. FRANK, Cornell University 
Are Economists Good Citizens? f 
Jonn Curis AND ALAN Lewis, University of Bath, United Kingdom 
United Kingdom Ethical Investments: A Case Study 
CHARLOTTE D. PHELPs, Temple University 
Money, Love, and Happiness 
Discussants: MICHAEL WALDMAN, University of California-Los Angeles 
RANDOLPH WESTERFIELD, University of Southern California 
SHosHONA A. GROSSBARD-SHECHTMAN, San Diego State University 
PauL L. WacHTEL, City University of New York 


8:00 A.M. Economics IN SPACE 
Presiding: LiNpA Conen, University of California-Irvine 
Papers: MoLLrE MACAULEY AND MICHAEL Toman, Resources for the Future 
Eye' in the Sky: The Economics of Remote Sensing 
Harvey Levin, Hofstra University 
Joint Ventures, Acquisitions, and Market-Type Transactions in Trading Orbit Spectrum 
Assignments in the Space Satellite Industry 
NriL DoueErty, University of Pennsylvania 
Risk-Bearing and Moral Hazard in Space 
LinpA Conen, University of California-Irvine, Susan EDELMAN, Columbia University, AND 
Rocer Not, Stanford University 
American Commercial Aerospace Transport: A Political Economic Perspective 
Discussants: ARTHUR DeVany, University of California-Irvine 
Nancy Rose, Massachusetts Institute of Technology 
JAMES DEARDEN, Lehigh University 


8:00 A.M. ArcoHoL AND PuBLic PoLicy 
Presiding: Joun Mu iauy, Trinity College and Resources for the Future 
Papers: FRANK .J. CHALOUPKA, University of Illinois-Chicago and National Bureau of Economic 
Research, MicHAEL Grossman, City University of New York Graduate Center and National 
Bureau of Economic Research, anD Henry SarrEnR, Kean College of New Jersey and National 
Bureau of Economic Research 
Alcohol, Regulation, and Motor Vehicle Mortality 
Putur J. Cook AND MICHAEL J. Moore, Duke University 
Drinking and Earnings 
Jopy SINDELaR, Yale University, anp Jonn MuLiany, Trinity College and Resources for the 
Future 
Alcohol Abuse, Alcohol Dependence, and Productivity 
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Discussants: CHARLES PHELPs, University of Rochester 
WitLARD G. Mannino, University of Michigan 
PauL GERTLER, Rand Corporation 


8:00 A.M. Loca. Pusiic FINANCE 
Presiding: Jonn YINGER, Syracuse University 
Papers: Ropert M. ScHwAB AND WALLACE E. Oates, University of Maryland 
Community Composition and the Provision of Local Public Goods: A Normative Analysis 
Diir BHATTACHARYYA AND ROBERT W. WassMER, Wayne State University 
Taxation and Expenditure Decisions of Elected Local Officials 
CHARLES A. M. DE BARTHOLOME, New York University 
The Fiscal Effect of Community Composition on Public Expenditure and Welfare 
Discussants: GARY J. Reip, University of Southern California 
Davin J. Ssoquist, Georgia State University 


8:00 A.M. U.S.~Korga Trane Rzrariows (Joint Session with the Korea-America Economic Association) 
Presiding: E. Kwan CHor, Iowa State University 
Papers: Joosunc Jun, Yale University 
Tax Policy and Korean Direct Investment in the U.S. 
Danny M. Lerpzicer, World Bank 
New Issues in Korean Trade Policy 
Won W. Koo, North Dakota State University ` 
Trade Liberalization for Agricultural Products in Korea and Its Implications on U.S. Agricul- 
tural Exports 
Marcus Norawp, Institute for International Economics 
The Impact of Prospective Changes in the Korean Trade Pattern on the U.S. Economy 
Cuong Liew, University of Oklahoma 
The Effects of Labor Disputes on Korea-U.S. Trade Structure 
Discussants: Marcus NoLAND, Institute for International Economics 
Joosunc Jun, Yale University 
E. Kwan Cuor, Iowa State University 
Ker Mu Lee, Rice University 
Won W. Koo, North Dakota State University 


8:00 A.M. CURRENT DEVELOPMENT OF CHINESE FINANCIAL MANAGEMENT AND PLANNING (Joint Session with the 
Society on Economics and Management in China) 
Presiding: CHENG-FEW Lee, Rutgers University-New Brunswick 
Papers: Janson CHANG AND Bos C. Li, World Bank 
Enterprise Investment and Financing Policy Review for Guangdong Agriculture Reclamation 
Myron J. Gorpon, University of Toronto 
The Agency Problem and Its Solution in China’s State Enterprises 
CHENG-FEW Lee, Rutgers University-New Brunswick, AND Git Yen, National Central University 
Financial and Economic Relationship Between China and Taiwan 
Discussants: D. GALE Jounson, University of Chicago 
FRANK C. JEN, State University of New York-Buffalo 
CHARLES Ou, U.S. Small Business Administration 


8:00 A.M. REassEssiNG THE ROLE OF FINANCE IN DEVELOPMENT (Joint Session with the Union for Radical 
Political Economists) 
Presiding: CARMEN DIANA DEERE, University of Massachusetts-Amherst 
Papers: BEN Crow, Open University 
Credit Conditions and the Context of Credit: The Case of the Bangladesh Grain Trade 
Kerru GRIFFIN, University of California-Riverside ` 
Foreign Finance and Domestic Savings: What Have Twenty Years Taught Us? 
LAURENCE Hannis, Open University 
Public Finance and Financing Development 
Discussants: STANLEY FISCHER, World Bank 


10:15 A.M. TusonETICAL IO wrrH AppPriCATIONS TO R&D 
Presiding: Ex1zABETH Horrman, University of Arizona 
Papers: BETH ALLEN, University of Pennsylvania 
Choosing R&D Projects: An Informational Approach 
KAREN PALMER, Resources for the Future 
Diversification by Regulated Monopolies and Incentives for R&D 


MÜ 
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SuzANNE ScoTCHMER, University of California-Berkeley 
Cooperation in R&D and the Breadth of Patent Protection 
Discussants: TARA VisHWANATH, Northwestern University 
PAULA-ÀNN Cech, Northwestern University 
EsrHER GaL-Or, University of Pittsburgh 


10:15 A.M. Tux PoLrricAr. Economy or CAPITAL MOBILITY 
Presiding: Eric Nirssow, Tufts University 
Papers: Timotuy Korcuun, Skidmore College 
The Responsiveness of Domestic Investment to Foreign Economic Conditions: An Analysis of 
Seven OECD Countries 
GERALD EPSTEIN AND HERBERT GiNrIs, University of Massachusetts-Amherst 
An Asset Balance Model of International Capital Market Equilibrium 
Gary DvMski, University of Southern California, ANb MANUEL Pastor, Occidental College 
Misleading Signals: Bank Lending and the Latin American Debt Crisis 
Discussants: RICHARD MacIntyre, University of Rhode Island 
JEFFREY A. FRANKEL, University of California-Berkeley 
Jonn WitLIAMSON, Institute for International Economics 


10:15 A.M. Economics oF GRowrH AND STAGNATION - i 
Presiding: BaniuL A. MAJUMDAR, Washington State University 
Papers: Deepak K. Lar, University College, London 
The Political Economy of Poverty, Equity, and Growth in 21 Depi Countries—A 
Summary of Findings 
BapiuL A. Mayumpar, Washington State University 
Economics of Growth and Stagnation: Korea Versus Bangladesh 
Gustav F. Papanek, Boston University 
Growth, Equity, and Politics: Contrasting Development Strategies in India and Pakistan 
Discussants: ANNE O. KRUEGER, Duke University 
E. Wayne Narzicer, Kansas State University 
Douc as O. WALKER, United Nations 


10:15 A.M. TRANSFERS AND EXCHANGES OUTSIDE THE MARKETPLACE 
Presiding: ODED Starx, Harvard University and Bar-Ilan University 
Papers: JAMES ANDREONI AND JOHN Kanr ScHorz, University of Wisconsin-Madison 
An Econometric Analysis of Charitable Giving with Interdependent Preferences 
DoNar» Cox, Boston College, anb EMMANUEL JIMENEZ, World Bank 
Motives for Private Transfers over the Life Cycle 
THEODORE BERGSTROM, University of Michigan 
Systems of Benevolent Utility Interdependence 
Opep Stark, Harvard University and Bar-Ilan University 
Nonmarket Transfers: The Role of Altruism 
Discussants: KENNETH J. ARRow, Stanford University 
Gary S. BECKER, University of Chicago 


10:15 A.M. PatH-DEPENDENCE IN Economics: THE INVISIBLE HAND IN THE GRIP OF THE PAST 
Presiding: PAUL A. Davin, Stanford University 
Papers: STEVEN Dur.aur, Stanford University 
Persistence and Multiple Equilibria in Aggregate Fluctuations 
James Heckman, Yale University 
Identifying the Hand of the Past: Distinguishing State Dependence from Heterogeneity 
PauL R. KruGMan, Massachusetts Institute of Technology 
History as a Determinant of Location and Trade 
PauL MiLcRom, Stanford University 
Complementarities and the Evolution of Manufacturing Organizations 
Discussants: W. BRIAN ARTHUR, Stanford University 
PAur A. Davin, Stanford University 
PauL Romer, University of California-Berkeley 
Rosert M. SoLow, Massachusetts Institute of Technology 


10:15 A.M. REFORM, REGULATION, AND THE EcoNoMiIc BEHAVIOR OF SOCIALIST FIRMS 
Presiding: Joser Brapa, Arizona State University 
Papers: Istvan AngL, Karl Marx University, Budapest, AND Jonn P. Bonin, Wesleyan University 
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Janez PRASNIKAR, University of Ljubljana, Yugoslavia, anD JAN SvEJNAR, University of Pitts- 
burgh 

The Objectives and Employment Behavior of Yugoslav Firms 
NADEZHDA MiKHEEVA, Academy of Sciences USSR, ano Jupitru THORNTON, University of 
Washington 

Structural Change and Integration of the Soviet Far East into the Pacific Rim Economy 

Discussants: IsvAN Dosozi, Institute for World Economy, Budapest, and Colorado School of Mines 

PETER MurrELL, University of Maryland 


10:15 A.M. PATTERNS or MiNonITY PROGRESS f 
Presiding: MARGARET C. Simms, Joint Center for Political Studies 
Papers: HARRIET Oncurr Du eer, U.S. Commission on Civil Rights, AnD SETH SANpEns, Univer- 
sity of Chicago 
Accounting for the Narrowing Wage Gap Between American-Born Asians and Whites 
NADJA ZALOKAR, U.S. Commission on Civil Rights 
Sources of Black Women's Economic Progress Since 1940 
HARRIET Oncurr DULEEP AND Mark C. Recets, U.S. Commission on Civil Rights 
Selectivity and Changes in the Relative Earnings of Hispanics: Evidence from Diverse Groups 
June O’NEILL, Baruch College 
Educational Achievement and Racial Differences in Economic Status 
Discussants: WALTER Y. O1, University of Rochester 
Wayne Vroman, Urban Institute 


10:15 A.M. Seminars on RECENT Topics AND TECHNIQUES* 
Speaker: James PoTERBA, Massachusetts Institute of Technology 
New Developments in Public Finance 


10:15 A.M. THe Tax REFORM Act or 1986: APPRAISAL AND PROSPECTS 
Presiding: Henry J. Aaron, Brookings Institution 
Papers: JANE G. GRAVELLE, Congressional Research Service 
The Tax Reform Act of 1986 and Tax Equity 
JoeL SLEMROD, University of Michigan 
Did the Tax Reform Act of 1986 Simplify Matters? 
Gary BunTLESS AND BARRY BoswoRTH, Brookings Institution 
The Supply Side to Tax Reform: Saving, Investment and Labor Supply 
Discussants: J. GREGORY BALLENTINE, KPMB Peat Warwick 


10:15 A.M. THE CONTRIBUTIONS OF FRIEDRICH A. HAYEK (Joint Session with the History of Economics Society) 
Presiding: Bruce J. CALDWELL, University of North Carolina-Greensboro 
Papers: Bruce J. CALDWELL, University of North Carolina-Greensboro 

Hayek’s Methodology 

MeEGunaD Desa, London School of Economics 
Hayek’s Early Contributions to Economic Theory 

WnuLiAM Buros, Trinity College 
Hayek's Theory of Psychology 

JEREMY SHEARMUR, Institute for Human Studies 
Hayek's Political Economy 

Karen Vaucun, George Mason University 
Hayek's Theory of Culture 


10:15 A.M. Economic CONSEQUENCES OF RAISING THE MINIMUM Wace (Joint Session with the Industrial ' 
Relations Research Association) 
Presiding: FRANCINE D. BrAu, University of Illinois-Urbana-Champaign 
Papers: Davip Carp, Princeton University 
Using Regional Variation in Wages to Measure the Effect of Minimum Wages 
ALIDA CasTILLO-FREEMAN, National Bureau of Economic Research, AND RICHARD B. FREEMAN, 
Harvard University 
The Effect of the U.S. Minimum Wage: The United States and Puerto Rico 
Lawrence F. Katz, Harvard University, AND ALAN B. Krusc=r, Princeton University 
Does the Minimum Wage Improve Minimum Wage Jobs? 
BRADLEY R. SCHILLER, American University 
Training and Advancement of Minimum Wage Workers 
Discussants: CHARLES C. Brown, University of Michigan 
Lisa M. Lyncu, Massachusetts Institute of Technology 


f. 
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2:30 P.M. EwPriRICAL ANALYSES OF R&D AND PRODUCTIVITY GROWTH 
Presiding: BRoNwvN Hatt, University of California-Berkeley 
Papers: Linpa M. Epwarps AND Betsy Fie-p-HEeNnprREy, Quens College, City University of New 
York 
Unions and Productivity in the Public Sector: The Case of Solid Waste Collection 
Janet W. Tittincer, Texas A&M University 
Dividend Taxation and Dividend Signalling: An Analysis of the Effects on Investment in 
Research and Development 
Sanau J. Lane, Boston University 
New Diggins in Bituminous Coal—The Structure of the Output Market as a Determinant of 
Investment in New Technology 
CATHERINE Morrison, National Bureau of Economic Research 
The Impacts of Markups, Capacity Utilization, and Scale Economies on Productivity Growth: 
A Reevaluation Using Productive Theory 
Discussants: Lisa M. LvNcH, Massachusetts Institute of Technology 
Epwin MaNsriELD, University of Pennsylvania 


2:30 P.M. DIRECT INVESTMENT IN THE UNITED STATES: WHAT ARE THE Issues? 
Presiding: Martin Konui, New York State ICC 
Papers: Norman J. GrickMaN, Rutgers University-New Brunswick, AnD Doucras P. WOODWARD, 
University of South Carolina 
Strategy and Structure of Japanese Manufacturing Investment 
PauL R. KnucMAN, Massachusetts Institute of Technology, ANp Epwarp M. GRAHAM, Institute 
for International Economics 
Discussants: RoBERT KuTTNER, The New Republic 
Auice H. Amspen, New School for Social Research 
Les SurrH, New York State ICC 


2:30 P.M. Can Feminism Finn A Home IN Economics? 
Presiding: RoBERT PoLtak, Washington University and University of Pennsylvania 
Papers: PAULA ENGLAND, University of Arizona 
How Should Feminism Change Economic Theory? 
Craupia Gotpin, Harvard University 
The Pollution Theory of Discrimination 
Diana STRASSMAN, Rice University 
Feminism and Economic Knowledge 
Nancy Fo.sre, University of Massachusetts 
The Male Domain of Reason: Androcentrism in Classical Political Economy 
Discussants: REBECCA BrANK, Northwestern University 
PAULETTE OLSEN, Wright State University 


2:30 P.M. New RESEARCH ON THE UNDERCLASS 
Presiding: IsaBeL V. SawniLL, Urban Institute 
Papers: RoNALD B. Mincy, Urban Institute 
Where the Underclass Lives: Evidence from Cross-Sectional Data 
EpwaRD M. Gramuica, University of Michigan 
Geographic Mobility and the Underclass 
JoNATHAN CRANE, Harvard University 
Economic Models of Noneconomic Motivations for Underclass Behavior 
Discussants: Gary BunrrEss, Brookings Institution 
Rosert H. Haveman, University of Wisconsin-Madison 
RosBERT D. REISCHAUER, Congressional Budget Office 


2:30 P.M. THE Economic Impact oF RELIGIOUS FUNDAMENTALISM 
Presiding: FREDERIC L. Pryor, Swarthmore College 
Papers: DEEPAK LAL, University College, London 
The Economic Impact of Hindu Fundamentalism 
Timur Kuran, University of Southern California 
Has Islamic Economics Made a Difference? 
Laurence R. Iannaccone, University of Santa Clara 
Heirs to the Protestant Ethic? The Economics of American Fundamentalists 
Discussants: T. N. SRINIVASAN, Yale University 
Dyavab SALEHI-IsFAHANI, Virginia Polytechnic Institute & State University 
KENNETH BourpiNo, University of Colorado 
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2:30 P.M. RECOMMENDATIONS AND Finpincs oF AEA COMMISSION ON GRADUATE EDUCATION IN ECONOMICS 
Presiding: RoBERT Etsner, Northwestern University 
Papers: ANNE O, KRUEGER, Duke University 
The Commission’s Recommendations 
W. Lee Hansen, University of Wisconsin-Madison 
The Commission’s Findings 
Discussants: MARK BLAuc, University of London 
Davip C. Cotanper, Middlebury College 
DaNrEL New on, National Science Foundation 


2:30 P.M. EMPIRICAL IMPLICATIONS OF MARKET STRUCTURE FOR PRICING BEHAVIOR 
Presiding: Joun LONDREGAN, Carnegie Mellon University 
Papers: Craig M. Newmark, North Carolina State University 
Do High Prices Indicate Collusion? A Critical Reiew of Price- Concentration Studies 
WiLLIAM N. Evans AND Ioannis N. Kessipes, University of Maryland 
The Relationship Between Market Share and Price in the Airline Industry 
ANDREW S. Joskow, GREGORY WERDON, AND RICHARD Jounson, U.S. Department of Justice 
Empirical Analysis of Entry and Exit in Airline Markets 
Discussants: Sucato BHATTACHARYYA, Carnegie Mellon University 


2:30 P.M. THe Goat oF PRICE STABILITY 
: Presiding: Davin J. Stockton, Board of Governors of the Federal Reserve System 
Papers: LAURENCE BALL, Princeton University 
Real Effects of Disinflation with Credible Policy 
Jack SELopv, Bank of Canada 
The Costs and Benefits of Price Stability: An Empirical Assessment 
Davip Lesow, Jonn ROBERTS, AND Davip J. Srockrow, Board of Governors of the Federal 
Reserve System 
Economic Performance under Price Stability 
Discussants: STANLEY FiscHER, World Bank and Massachusetts Institute of Technology 
BENNETT McCarLuM, Carnegie Mellon University 
CHaRLES ScHULTZE, Brookings Institution 


2:30 P.M. Economic DEVELOPMENTS AND PROSPECTS IN CZECHOSLOVAKIA, YUGOSLAVIA, AND GERMANY (Joint 
Session with the Association for Comparative Economic Studies) 

Presiding: Jan SvEmNAR, University of Pittsburgh 

Panel: Viapimir Dioucny, Deputy Prime Minister of Czechoslovakia 
KareL Dysa, Czechoslovak Academy of Sciences 
JAN SvEiNAR, University of Pittsburgh 
ZivKO PREGL, Deputy Prime Minister of Yugoslavia 
Janez PRAsNIKAR, University of Ljubljana, Yugoslavia 
Horst SIEBERT, Kiel Institute of the World Economy 
InwiN Correr, University of Houston 


2:30 P.M. Peace Economics: Score, NATURE AND FUTURE Directions (Joint Session with Economists Against 
the Arms Race) : 
Presiding: SoLoMoN W. PoLAcHEK, State University of New York-Binghamton : 
Papers: WaLTER Isarp, Cornell University, anD CHARLES H. ANDERTON, Holy Cross College 
A Survey of Peace Economics and Its Future 
Discussants: MARTIN C. McGuire, University of Maryland 
Jack HinsHrgIFER, University of California-Los Angeles 
Murray WorrsoN, California State University-Fullerton 
Mancur Orson, University of Maryland 


2:30 P.M. How ROBUST ts THE INTERNATIONAL FINANCIAL SvsrEM (Joint Session with the International Trade 
and Finance Association) 
Presiding: CATHERINE MANN, Federal Reserve System 
Papers: RicHARD J. HEnRING, University of Pennsylvania 
Dogs that Didn't Bark: What Do They Tell Us about Systematic Risk? 
RoNarp I. McKinnon, Stanford University 
Exchange-Risk and Interest Rate Volatility Since Bretton V/oods 
Hyman P. Minsky, Jerome Levy Economics Institute 
Fragility and Resilience of the International Financial System 
H. Peter Gray, Rensselaer Polytechnic Institute 
A Clear and Present Danger? 
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4:45 P.M. PRESIDENTIAL ADDRESS AND BUSINESS MEETING 
Presiding: THomas C. SCHELLING, University of Maryland 
Speaker: GERARD DEBREU, University of California-Berkeley 


Sunday, December 30, 1990 


8:00 A.M. MonriLITY, WAGES, AND GENDER 
Presiding: KATHARINE ABRAHAM, University of Maryland 
Papers: Lisa M. Lyncn, Massachusetts Institute of Technology 
The Role of Off-the-Job versus On-the-Job Training for Wage Growth and Mobility of Women 
Workers 
Tueresa J. Devine, Pennsylvania State University 
Gender Differences in Job Exit Behavior: An Empirical Analysis Using SIPP 
CoNsTANCE Rump, Congressional Budget Office 
Retirement in the Dual Worker Family 
Discussants: MARK Gritz, University of Washington 
KarHRYN SHAW, Carnegie Mellon University 
JANE SioGREN, Simmons College 


8:00 A.M. OPTIMAL PRICING AND ADJUSTMENT CosTs 
Presiding: Davip I. RosENBAUM, University of Nebraska-Lincoln 
Papers: Jonn A. Carson, Purdue University 
Customer Sensitivity to Relative Prices and Frequency of Price Changes 
Menc-Hua Ye, George Washington University, ann Davip I. RosENBAUM, University of 
Nebraska-Lincoln 
Optimal (s, $) Pricing Boundaries Under Stochastic Inflation Rates 
Jerzy D. Konreczy, University of Western Ontario 
Variable Price Adjustment Costs 
Discussants: KENNETH D. Boyer, Michigan State University 
Lawrence W. Martin, Michigan State University 
Jurio RorEMBERG, Massachusetts Institute of Technology 


8:00 A.M. THE Economic IMPACT OF IMMIGRATION 
Presiding: Dante. HAMERMESH, Michigan State University 
Papers: GEORGE J. Borsas, University of California-Santa Barbara 
Immigrants in the U.S. Labor Market: 1940-1980 
Davin Carp, Princeton University 
Immigration and the Earnings of Natives . 
Ropert J. LALONDE AND RoBERT H. Topet, University of Chicago 
The Assimilation of Immigrants 
Discussants: Larry Katz, Harvard University 
SrEPHEN J. Treso, University of California-Santa Barbara 
Jonn ABowp, Cornell University 


8:00 A.M. AFRICA’s IMMISERIZATION: WuicH Way FoRWARD? 
Presiding: E. Wayne NarzicER, Kansas State University 
Papers: ELLIOT Berc, Development Alternatives, Inc. 
Structural Adjustment and Its Critics: Are We On the Right Path? | 
Frances STEWART, Oxford University 
Adjustment with a Human Face 
E. Wayne NAFZIGER, Kansas State University, AnD HowARD STEIN, Roosevelt University 
Africa's Economic Development: A Critique of the World Bank and the Lagos Plan of Action 
Discussants: ROLF VAN DER Hoeven, UNICEF 
Baptut A. Majumpar, Washington State University 
Fantu CHERU, American University 


8:00 A.M. Tur DISTRIBUTION or THE BENEFITS OF NONPROFIT INSTITUTIONS 
Presiding: CHARLES T. CLOTFELTER, Duke University 
Papers: Dick Netzer, New York University 
Arts and Cultural Institutions 
Jerr Bipprg, Michigan State University 
Religious Congregations 
Sanpra Baum, Skidmore College, anD SAUL ScHwartz, Tufts University 
Educational Institutions 
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Discussants: HELEN Lapp, Duke University 
EMMETT Carson, Ford Foundation 
EsTELLE JAMES, State University of New York-Stony Brook 


8:00 A.M. BEHAVIORAL FINANCE 
Presiding: CoLin CAMERER, University of Pennsylvania 
Papers: LAURIE Simon BAGWELL, Northwestern University 
Dutch Auction Repurchases: An Analysis of Shareholder Heterogeneity 
KENNETH French, University of Chicago, AND JAMES PoTERBA, Massachusetts Institute of 
Technology 
The World Cross-Border Investment Puzzle 
Joser LAKONISHOK, University of Illinois-Urbana-Champaign, ANDREI SHLEIFER, University of 
Chicago, RicHarD THALER, Cornell University, AnD RoBerT VisuNv, University of Chicago 
Do Portfolio Managers Window Dress? 
RicHARD ZECKHAUSER, JAY PATEL, AND DARRYLL HENDRICKS, Harvard University 
What Investment Flows Reveal About Investory Behavior 
Discussants: SUSAN Corus, Council of Economic Advisors 
HowARD KuNREUTHER, University of Pennsylvania 
CoriN CAMERER, University of Pennsylvania 


8:00 A.M. DISCRIMINATION 
Presiding: Harry HorzzR, Michigan State University 
Papers: Barry T. HinscH, Florida State University, AND EDWARD J. ScHuMACHER, R. L. Banks & 
Associates, Inc. 
Labor Earnings, Discrimination, and the Racial Composition of Jobs 
Ronald D’Amico, SRI International, anb Nan L. MAxwe t, California State University-Hay- 
ward 
Employment During the School-to-Work Transition: An Explanation for Subsequent 
Black-White Wage Differentials and Bifurcation of Black Income 
Micuaet A. Leeps, Haverford College 
Determinants of Success in the AEA Summer Minority Program 
Discussants: WAYNE VROMAN, Urban Institute 
CHARLES C. Brown, University of Michigan 
RuoNDA WitLiAMs, University of Maryland 


8:00 A.M. LirE-CvcrE ANALYSIS OF Tax INCIDENCE 
Presiding: Rosemary Marcuss, Congressional Budget Office 
Papers: Diana FurcuTcotr-Rotu, American Petroleum Institute 
The Measurement of Regressivity: The Case of the Mctor Fuels Tax 
ANDREW B. Lyon anp Rosert M. Scuwas, University of Maryland 
Estimated Tax Incidence in a Life-Cycle Framework 
Don FuLterton, University of Virginia, AND Diane Lim Rocers, Pennsylvania State University 
A General Equilibrium Study of Lifetime Tax Incidence 
Discussants: JANET FURMAN SPEYRER, University of New Orleans 
RoBERT Luckz, Price Waterhouse 
Joun Hakken, Congressional Budget Office 


8:00 A.M. , EcoNoMY IN TRANSITION: Issues AND Poticies (Joint Session with the Chinese Economic Association 
in North America) 
Presiding: BEN-cHIEH Liu, Chicago State University 
Papers: Curr J. Huang, Vanderbilt University, AND Jin-Taw Liu, Academia Sinica, R.O.C. 
Manufacturing Production Frontier and Efficiency: A Case Study of Electronic Industry in 
Taiwan 
Henry Wan, Cornell University 
Technology Transfer and New Growth Theory 
Pinc Wane, Pennsylvania State University, Maw-Lin Lee, University of Missouri, AND BEN-CHIEH 
Liv, Chicago State University 
Growth Versus Equity: A Comparative Study for Taiwan and South Korea 
Gary H. Jerrerson, Brandeis University, anp Xu Wenyi, Wuhan University and Brandeis 
University 
The Convergence of Factor Returns in Transitional Economies: The Case of Chinese Industry 
Discussants: GALE JonNsoN, University of Chicago 
Grecory C. Cuow, Princeton University 
Rosert Eisner, Northwestern University 
ANTHONY Y. C. Koo, Michigan State University 
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8:00 A.M. Economerric SruprEs or HosprrAL Facror Costs (Joint Session with the Health Economics 


Research Organization) 
Presiding: Donan E. YetT, University of Southern California 
Papers: MicHaEL Grossman, City University of New York Graduate School and National Bureau of 
Economic Research, FRED GOLDMAN, Susan W. Nesnarrr, New School for Social Research and 
National Bureau of Economic Research, AND PAMELA Mosi.ia, Brooklyn College, City University 


of New York and National Bureau of Economic Research 
Determinants of Interest Rates on Hospital Bonds 

Joun A. Rizzo, Agency for Health Care Policy and Research 
Investment and Capital Costs in Proprietary and Not-for-Profit Hospitals 

Jack HapLEv, Georgetown University, AND STEPHEN ZUCKERMAN, Urban Institute 
Dynamic Estimates of Multiproduct Hospital Cost Functions: 1980-1986 

Discussants: FRANK SLOAN, Vanderbilt University 
Gerarp J. Wenig, University of Pennsylvania 


8:00 A.M. Comparative Economic Pouicies (Joint Session with the Society for Policy Modeling) 
Presiding: Dominick SALVATORE, Fordham University 
Papers: Virrorio Corso, World Bank 

Economic Policies in Latin America 

Deena KHATKHATE, World Development 
Economic Policies in India 

ADEBAYO ADEDEJI, Economic Commission for Africa 
Economic Policies in Africa 

Discussants: Jonn M. PAGE, World Bank 
ANDARU Ray, World Bank 
SrEPHEN O'Brian, World Bank 


10:15 A.M. LEARNING AND ADAPTIVE Economic BEHAVIOR 
Presiding: KENNETH J. Arrow, Stanford University 
Papers: W. BRIAN ARTHUR, Stanford University 
On Calibrated Rationality 
RicHARD J. HERRNSTEIN, Harvard University 
Experiments on Stable Suboptimality in Individual Behavior 
Jonn HotLaNp, University of Michigan, AND Jonn MILLER, Santa Fe Institute 
Just Don't Eat the Typewriter: Artificial Adaptive Agents in Economic Theory 
Discussants: KEN BiNMonE, University of Michigan 
Drew FupENBERG, Massachusetts Institute of Technology 
Jonn Geanoxop tos, Yale University 


10:15 A.M. Women, Work, AND CHILDREN IN CANADA AND THE UNITED STATES 
Presiding: Wit.LiAM T. ALPERT, William H. Donner Foundation and University of Connecticut 
Papers: ARLEEN LEIBOWITZ AND LiNDA Warre, Rand Corporation 
Employment Behavior Surrounding the First Birth 
SOLOMON PorAcHEK, State University of New York-Binghamton 
Effects of Employment Interruptions on Female Wage Rates 
ALICE NAKAMURA AND Masao Nakamura, University of Alberta 
Effects of Children on Female Earnings: Employment Behavior and Wage Effects in 
Intertemporal Context 
Discussants: MARTIN BROWNING, McMaster University 
T. PauL ScuurrZz, Yale University 


10:15 A.M. TEENAGE PREGNANCY AND WELFARE DEPENDENCY 
Presiding: REBECCA BLANK, Northwestern University . f 
Papers: WitLiAM N. Evans, WALLACE OATES, AND RoserT Scuwas, University of Maryland 
Community Characteristics and Teenage Pregnancy 
GEORGE CAVE AND DANIEL FRIEDLANDER, Manpower Demonstration Research Corporation 


an 


How Do Welfare Employment Programs Achieve Their Impacts? Evidence from Cross-Pro- 


gram Comparisons 
SHARON Lona, Urban Institute 


Analysis of Welfare Dependency: The Dynamics of Participation in the AFDC and Food 


Stamp Programs 
Discussants: ROBERT PLotnicx, University of Washington 
Laurie Bassi, Georgetown University 
June O’NEILL, Baruch College 
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10:15 A.M. Tus Locic oF CoLtective AcTION: Twenty-Five YEARS LATER 
Presiding: BURTON WEIsBROD, University of Wisconsin-Madison and Northwestern University - 
Papers: Topp SANDLER, Iowa State University 

The Logic of Collective Action: A Retrospective Look 
Martin McGuire, University of Maryland 
Empirical Methods and Collective Action 
Mancur Orson, University of Maryland 
Unanswered Questions 
Discussants: Jonn Tscuiruart, University of Wyoming 
RoBert ToLLisoN, George Mason University 
DwicHT Lee, University of Georgia 


10:15 A.M. ExcHaANGE Rare Poricv 
Presiding: KATHRYN Dominguez, Harvard University 
Papers: Haut J. Epison, Board of Governors of the Feceral Reserve System, AND GRACIELA 
Kaminsky, University of California-San Diego 
Target Zones and Exchange Rate Variability 
Virrorio GRILL AND NounigL RouniNi, Yale University 
Liquidity, Exchange Rates, and the International Transmission of Monetary Policy 
PauL R. KRuGMAN, Massachusetts Institute of Technology 
Speculative Attack and Regime Collapse 
KaTHRYN Dominguez, Harvard University, AND JEFFREY A. FRANKEL, University of California- 
Berkeley 
Exchange Rate Policy Announcements—When Do They Matter? 
Discussants: RoBERT Cumsy, New York University 
Witttam H. Branson, Princeton University 
KENNETH A. Fnoor, Massachusetts Institute of Technology 
Rupicer Dornsuscu, Massachusetts Institute of Technology 


10:15 A.M. PATTERNS of FAcULTY RETIREMENT 
Presiding: ALBERT Rees, Princeton University 
Papers: G. GREGORY LoztER AND MicHAEL Doonis, Pennsylvania State University 
Projecting Faculty Retirement: Factors Influencing Individual Decisions 
SHARON P. Smrru, Princeton University 
Ending Mandatory Retirement in the Arts and Sciences 
ALAN L. Gustman, Dartmouth College, anp THOMAS STEINMEIER, Texas Technological Univer- 
si 
The Effects of Pensions and Retirement Policies on Retirement in Higher Education 
Discussants: W. LEE Hansen, University of Wisconsin-Madison 
RICHARD BuRKHAUSER, Syracuse University 


10:15 AM. Economics or Drucs 
Presiding: PauL J. TauBMAN, University of Pennsylvania and National Bureau of Economic 
Research 
Papers: GARY BECKER, University of Chicago 
Legalization of Drugs 
JEFFREY Miron, National Bureau of Economic Research 
The Effects of Drug Criminalization on Drug Use: Some Cross-Regime Evidence 
Rosin Sickes, Rice University and National Bureau of Economic Research, AND Paut J. 
TAUBMAN, University of Pennsylvania and National Bureau of Economic Research 
Who Uses Illegal Drugs? 
Discussants: CLaupia GorpiN, Harvard University 
Mark Keman, Harvard University 
PETER REUTER, Rand Corporation 


10:15 A.M. Seminars ON RECENT Topics AND TECHNIQUES* 
Speaker: Joun W. CAMPBELL, Princeton University 
New Developments in the Analysis of Stock Markets 


10:15 A.M. LABOR MARKET INSTITUTIONS AND OUTCOMES 
Presiding: Lisa M. LvNcH, Massachusetts Institute of Technology 
Papers: Topp L. Ipson anp Puiuir K. Rosins, University of Miami 
Labor Turnover and Worker Mobility in Small and Large Firms 
Takao Kato AND Mark Rocker, Colgate University 
Managerial Internal Labor Markets and Executive Compensation: A Comparison of Japan and 
the United States 
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JANET CurRIE, University of California-Los Angeles, AND SHEENA MCCONNELL, Mathematical 
Policy Research 
The Right to Strike versus Compulsory Arbitration in the Public Sector: Effects on Wages and 
Dispute Costs 
Linpa C. CALvALLUZZO, Union College, AND James M. MacDona p, Rensselaer Polytechnic . 
Institute 
The Effects of Rail Deregulation on Labor 
Discussants: JAMES REBITZER, Massachusetts Institute of Technology 
SHERRY GLIED, Columbia University 
Nancy Rose, Massachusetts Institute of Technology 


10:15 A.M. WHEN AND Wuy Do INsrrrUTIONs REALLY Marrer? New EVIDENCE FROM Economic History 
(Joint Session with the Economic History Association) 
Presiding: JEREMY ATACK, University of Illinois-Urbana-Champaign 
Papers: MICHAEL Borno AND EUGENE Wurrzg, Rutgers University-New Brunswick 
A Tale of Two Currencies: British and French Finances During the Napoleonic Wars 
WiLLIAM Darity, University of North Carolina-Chapel Hill 
Technical Change in Agriculture, Population Growth, and Theories of Economic History 
ELIZABETH HOFFMAN AND Gary LiBEcaP, University of Arizona 
The History and Functioning of a Government-Sponsored Cartel: Agricultural Marketing 
Orders for Citrus ' 
Discussants: LARRY D. NEAL, University of Illinois-Urbana-Champaign 
GREGORY CLank, University of Michigan 
Tuomas S. ULEN, University of Illinois-Urbana-Champaign 


10:25 A.M. REREGULATION FOR THE PROMOTION OF COMPETITION? (Joint Session with the, Transportation and 
Public Utilities Group) 
Presiding: BENIAMIN J. ALLEN, Iowa State University \ 
Papers: RODNEY STEVENSON, University of Wisconsin-Madison 
Regulation, Sharing Rules, and the Quality of Competition 
RoBERT WINDLE AND MarTIN DnESSNER, University of Maryland 
Competition at “Competitive” Hubs in the U.S. Domestic Air Transport Industry 
EpwarD MonasH AND GEORGE WAGGENHEIM, Michigan State University 
The Impact of Federal Deregulation on State Regulatory Attempts 
Discussants: DENIS BREEN, Federal Trade Commission 
Jor KERKVLIET, Oregon State University 


Note: Although the following sessions will be included in the final program, it was not possible at press time to 
schedule day and time assignments because of incomplete information: 


Economic Reform in Poland and Hungary History of Economic Thought 
Applications of Contract Design Greenhouse Warming 
What Remains of Comparative Advantage? Demand Uncertainty, Inventory Investment, 
Intertemporal Choice and Seat Belts 
Innovation and Transaction-Cost Economics Asian Economic Regimes 
Western Europe, Eastern Europe, and the Economics and Public Policy 
World Economy Information 
Topics in Macroeconomics Bidding and Negotiation 
Consumption, Saving, and Interest Rates Depository Institutions 
Output, Employment, Prices The Political Economy and Monetary Policy 
The Economics of Health Care Economic Effects of Government Credit 
Teenage Pregnancy and Welfare Dependency Market Policies 
Mitigation of and Adaptation to Global Tropical Forest Protection (joint with AERE) 
Environmental Change Frontiers of Cliometric Research (joint with CS) 
Protection of the Individual ODE Chapter Advisers and Regional Directors 
Political Economy Session (joint with ODE) 





"The American Economic Association will offer a new series of seminars on recent topics and techniques at the 1990 
annual meeting. At these seminars, a member of the Association will lecture on a recently developed technique or research 
area. Reading lists will be rade available so that participants can further develop.their interest. The aim of these seminars 
is to reduce the time between development of new knowledge and its availability in textbooks or survey articles. Seminars 
will last approximately two hours. 


“THE CHOICE 
OF THE EXPERTS 


GAUSS 


*6 The speediest and most 
versatile of the mathematical 
programming languages’? 


Barry Simon - PC Magazine 





You get more work done when you use the right language. 





Fast and Flexible 

Now you can discover why scientists, engi- 
neers and statisticians agree that GAUSS is the 
language of choice when you want the most in 
speed and flexibility from your PC. The built-in 


matrix routines 
are written in 
assembly lan- 
guage and are 
faster than even 
the fastest C or 
Fortran compil- 
ers. 

It's not just 
execution speed that makes GAUSS the better 
choice. GAUSS is a matrix language, using 
familiar syntax for expressing mathematical - 
relationships. The result is you will shorten your 
development time and write less code. GAUSS's 
programming environment has all the tools you 
will need to write large programs, including an 
on-line debugger and an on-line help system 
that can handle the functions you write. 


Expandable 


Create your own libraries of functions or call 
FORTRAN, C, or assembly language routines. 


Publication Quality Graphics 

The GAUSS system includes 2-D and 3-D 
publication quality graphics with an underlying 
resolution of 4096 x 3120. 


Applications Modules 
The optional SimGauss module simulates 
nonlinear differential equations, state-space 


and discrete systems. It's powerful, flexible and 
supports state/parameter vectors for increased 
speed and ease. Modules are also available for 
numerical optimization, solving simultaneous 
nonlinear equations, generating basic descrip- 
tive statistics, maximum likelihood, logit, probit, 
loglinear analysis, and others. 


Economical 

For just $495 you get the basic system 
including a library of mathematical functions 
and publication quality graphics. 


Guaranteed 

If you don't agree that GAUSS is the speediest 
and most versatile of the mathematical pro- 
gramming languages, simply return it for a full 
refund. No questions asked. Call (206) 631-6679 
to order. 


GAUSS-faster, easier, smarter... guaranteed. 


System includes 700 page 
manual and 11 program and 
application disks for IBM 
PC-XT-PS/2 á compatibles 
with math coprocessor. 

386 protected mode version 
also available. 


GAUSS GAUSS 


BY APTECH BY APTECH SYSTEMS, INC.” INC. 

26250 - 196th Place SE, Kent Washington 98042 
Phone (206) 631-6679 FAX (206) 630-1220 
CALL OR WRITE FOR FREE BROCHURE 
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Here is what Axum users are saying: ` 


**Fantastic..." 
WordPerfect...’ i 


Rates of Change 
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Analytical Drawings 
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“In a matter. of n minütes I took my Lotus daía, created a graph, and used it in 
“Axum is exactly what I've been waiting for.” 


<T created publication-quality 


graphs without even using the manele “other graphics packageé he: Stone Age compared 


with Axum! po? 


Introducing Aims £ 


Axum, å new technical graphics and 
data analysis program for PC's, 
combines the ease-of-use of the best- 
selling business graphics packages 
with the advanced capabilities of 
expensive mainframe technical 
graphics programs. 
Even if you have never used a graphics 
program before, with Axum you will 
be analyzing and graphing your data 
within minutes, 
Publication-quality 
technical graphs 
Take advantage of Axum's extraordi- 
nary display capabilities. 
Produce publication-quality, color 
graphs 


‘Choose from numerous graph types: 
2D, 3D, Contour & more 


- Choose from 21 fonts including 
Greek and scientific; use unlimited 
super and subscripts 


- Graph arbitrary 2D and 3D functions 
Interactively create and customize 





oV 


XN M. 


-æ 0) 
Place multiple graphs on pagë 
rotate, shrink, or expand ' 
Automatic regression plots 

Scale plots to multiple y axes and, 
logarithmic axes” 

Change the viewpoint of your 3D 
graph . 

Features ultra-high-resolution 
output from GraphiC?" by Scien- 
tific Endeavors 


State-of-the-art 
. , data editor 
Use Axum to perform both simple and 
advanced data analysis with ease. 
Easily import, edit and transform data 
Explore and analyze your data quickly 
and easily using multiple, interrelated 
'*data sheets” 
Use unlimited-sized data sets 
Do statistics, curve fitting, and 
smoothing 
* Create macros using Axum's built-in 
advanced programming language 
Choose from over 100 functions and 
operators for manipulating and 
transforming your data 


Works with the software 
you already have 
Use your graphs in WordPerfect, 
Word, Ventura, PageMaker 
- Export to GEM, Lotus PIC, HPGL, 
' Tektronix, and Encapsulated Post- 
Script files 


Import ASCII, Lotus, and dBase files 
- Use virtually any monitor, printer 
(including PostScript) and plotter 


60 day money back 
guarantee 


Try Axum without risk, If you are not 


` completely satisfied, retum it within 60 


days for a refund. 








Axum” 
Call toll free for FREE brochure 


1-800-548-5653 ext. 029 
TRIMETRIX 


344 NB Ravenna Blvd Suite 210-AE 
Seanle, WA 98115 
TEL. WA 206-527-1801 


FAX 206-522-9159 
For the IBM PC, XT, AT, PS/2 and compatibles 














